Gene Tery_4458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4458 
Symbol 
ID4246111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6872163 
End bp6873455 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content44% 
IMG OID638109341 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_723918 
Protein GI113477857 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCAAT ACAATTTTTT TTCTCAACCA GAACAAAAAA TACTTGATAT TTTCTATGGC 
TTGATCATAA AAAAAAGTAC AATATTATTT CGCACTGCAA TAGCGATCGC TCTCTTAGGC
ATTGTAAGTT GTAGGACCGA AATTAAACTT GAGTCATCTA CTGCTACTCA GACTCAAAAC
AAGTCTAGGA TAAACTCATC GGTCTCTGAT GCTGAAAACA TGACCTCTAT TTCAGACTAC
CAAAAAATAA CTTTAGTAAA AAACCTAGAA CATCCCTGGA GTATTGCCTG GTTACCAGAT
GGAAAAATAC TAATTACTGA AAGACCAGGG CGGTTGCGAA TTTTTCGTGA TGGAATTTTA
GAGCCAACTC CTATTTCAGG TGTTCCCCAA GTCTTTGCTT TTGGTCAAGG TGGTTTACTC
GATGTCTCTG CTCACCCGCG TTTTGCCGAA AACCGCTTTA TTTATTTAAC CTATTCCCAC
GGCGATCGCT CAAACAACCG CACTCGCATC GCCCGCGCTC GACTAGAGAA TAATACCTTG
GGGGATCTAA AAGTCATCTT TGAGGTTTCC CAGACGAAAC CAGGGGCACA ACATTTTGGT
TCACGCATTA TTTGGTTGCC TGATGGAACT CTAGTAGCAA GTATTGGTGA TGGCGGCAAC
CCACCTATTG AGTTCAATGG TGAATTTATT CGGCAACAGG CTCAAAACCG CAATAGTCAT
TTTGGCAAAG TGATCAGGTT GAATGACGAT GGATCTATAC CTTCAAATAA CCCTTTTGCT
ACCTCTACAG ATGCTAAACC TGCTCTCTGG AGTTATGGGC ATCGGAATAT TCAGGGTATC
ACCTTAGACC CCACAAAAAA TAGAGTCTGG GCTACTGAAC ATGGTTCCAG GGGTGGTGAT
GAACTGAATT TAATTGAGAG GGGAGAAAAC TATGGTTGGC CTGTAGTTAC TCACAGTCGT
GAATATTCTG GAGGGCTCAT CTCTCCAGAA ACATCTCGCC CTGGTCTGGT AGACCCGAAA
GTGATCTGGA CTCCTTCTAT AGCTCCCTCT GGTTTGGCTT TTTATAACGG CGATCGCTTT
CCCCAGTGGC GAGGCAATTT GTTTGCAGGT GGGTTAGTCT CCCAAGATAT TCGTCGCATT
CAGCTTGACC CTGGGGGTAA TGTTATCGCT CAAAATTCTA TTCCCATAGG TCAGAGAGTG
CGGGATGTCC GACAGGGACC TGATGGACTG TTGTATGTTC TCACCGATGA CCGAAACGGG
CAATTAATTC GACTAGAACC TATGGGGAAA TAA
 
Protein sequence
MFQYNFFSQP EQKILDIFYG LIIKKSTILF RTAIAIALLG IVSCRTEIKL ESSTATQTQN 
KSRINSSVSD AENMTSISDY QKITLVKNLE HPWSIAWLPD GKILITERPG RLRIFRDGIL
EPTPISGVPQ VFAFGQGGLL DVSAHPRFAE NRFIYLTYSH GDRSNNRTRI ARARLENNTL
GDLKVIFEVS QTKPGAQHFG SRIIWLPDGT LVASIGDGGN PPIEFNGEFI RQQAQNRNSH
FGKVIRLNDD GSIPSNNPFA TSTDAKPALW SYGHRNIQGI TLDPTKNRVW ATEHGSRGGD
ELNLIERGEN YGWPVVTHSR EYSGGLISPE TSRPGLVDPK VIWTPSIAPS GLAFYNGDRF
PQWRGNLFAG GLVSQDIRRI QLDPGGNVIA QNSIPIGQRV RDVRQGPDGL LYVLTDDRNG
QLIRLEPMGK