Gene Sama_1568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1568 
Symbol 
ID4603820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1912255 
End bp1913292 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content52% 
IMG OID639780924 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_927445 
Protein GI119774705 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGCG AAACCAATCC ACTGGGCCTG CTCGGCATCG AATTTACCGA GTTTGCCACC 
CCAGATAACG ACTTCATGCA CAAGGTTTTT CTGGACTTTG GCTTTTCCAT GCTGAAAAAG
CACAAGGAAA AAGACATCTA CTACTACCAG CAAAACGACA TCAACTTTTT GATGAACCGT
GACCGCGCCG GTTTCTCGGC CGGTTTTGCC AAGTCTCACG GCCCGGCCAT CACCTCCATG
GGCTGGCGCG TGGAAGATGC CGAATATGCC TACAAGCACG CGGTTGAACG TGGCGCCAAG
GCCGCCCCGG ATGACGTGAA AGACCTGCCC TACCCAGCCA TTTACGGCAT TGGTGACAGC
CTGATTTACT TCATCGACCG TTTCGGTGAT GACAACATCT ACGCCACCGA TTTTGTTGAT
CTGGATGAGC CTGTGATTGT GCAGGAAAAA GGCTTTATGG AAGTCGACCA TCTGACCAAC
AACGTCTACA AGGGCACCAT GGAACAGTGG TCAAACTTCT ATAAAGACGT TTTTGGCTTT
ACCGAAGTGC GCTACTTCGA CATCAAGGGC TCCCAGACTG CACTGATTTC TTACGCGCTG
CGTTCACCGG ATGGCAGCTT CTGTATCCCT ATCAACGAAG GTAAAGGCGA CGATCGTAAC
CAGATTGACG AATACCTGCG TGAATACAAT GGCCCGGGCG TTCAGCACCT GGCGTTCCGC
AGCCGTGACA TAGTTGCCTC GCTGGATGCA ATGGAAGGCT CGTCCATTGC GACACTGGAC
ATTATCCCTG AATACTACGA CACCATCTTC GAAAAACTGC CCCAGGTGAC CGAAGACCGT
GAGCGCATCA AGCATCACCA AATTCTGGTG GATGGCGATG AAAACGGCTA CCTGCTGCAG
ATTTTCACCA AGAACCTGTT TGGTCCTATC TTTATCGAAA TCATCCAGCG TAAGAACAAC
CTGGGTTTCG GTGAAGGTAA CTTCAAGGCG CTGTTTGAAT CTATCGAGCG CGATCAGGTC
CGCCGCGGCG TGCTTTAA
 
Protein sequence
MASETNPLGL LGIEFTEFAT PDNDFMHKVF LDFGFSMLKK HKEKDIYYYQ QNDINFLMNR 
DRAGFSAGFA KSHGPAITSM GWRVEDAEYA YKHAVERGAK AAPDDVKDLP YPAIYGIGDS
LIYFIDRFGD DNIYATDFVD LDEPVIVQEK GFMEVDHLTN NVYKGTMEQW SNFYKDVFGF
TEVRYFDIKG SQTALISYAL RSPDGSFCIP INEGKGDDRN QIDEYLREYN GPGVQHLAFR
SRDIVASLDA MEGSSIATLD IIPEYYDTIF EKLPQVTEDR ERIKHHQILV DGDENGYLLQ
IFTKNLFGPI FIEIIQRKNN LGFGEGNFKA LFESIERDQV RRGVL