Gene Tery_0398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0398 
Symbol 
ID4241962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp620588 
End bp621844 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content37% 
IMG OID638105723 
ProductUDP-sulfoquinovose synthase 
Protein accessionYP_720337 
Protein GI113474276 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTCC TAGTAATTGG TGGCGATGGC TACTGCGGTT GGGCAACCGC TCTTTATCTT 
TCCAATCAAG GTTATGAGGT AGGTATTTTA GACAGTATGG TTAGGCGACA CTGGGATCTA
CAACTTCAAG TAGAAACCCT CACACCTATT GCCCCTATTC AACAACGCAT CCAACGTTGG
AAAGACCTCA CAGGCAAAAA AATTGATCTA TATATAGGAG ATATTACCAA CTACGATTTC
CTAAGTACAA CACTACATCA GTTTGAACCA GAATCTATAG TTCACTTTGG TGAACAACGT
TCTGCTCCAT TTTCTATGAT TGACCGGGAA CATGCTGTTA CGACTCAAGT TAATAATGTT
GTTGGTACTC TCAATATTCT TTATGCTATG AAAGAGGATT TTCCAGACTG CCATTTAGTT
AAACTGGGAA CTATGGGAGA GTATGGTACT CCTAATATAG ATATTGAAGA GGGTTATATC
AAAATTGAAC ATAATGGACG CACGGATACT CTACCCTATC CAAAACAACC AGGGTCTTTC
TATCATCTCA GTAAAGTTCA CGATAGCCAC AACATTCACT TTGCCTGCAA AATATGGGGT
ATCCGGGCTA CGGACCTTAA TCAAGGAATT GTATATGGTG TTGCTCTAAC TGGTCTTCTA
AATGATGAAA CAATCCAAGA TGAACTTTTG ATTAACCGTC TTGACTATGA TGGAGTTTTT
GGTACAGCTC TCAATAGATT TTGTATTCAA GCAGCAATTG GCCATCCGCT AACTGTTTAT
GGTACAGGTG GACAAACTCG TGGCTTTTTA GATATTAGAG ATACTGTCCG TTGTATGGAA
ATAGCGATCG CAAACCCAGC ACAACCAGGT GAATTCCGAG TATTTAACCA ATTTACTGAA
ATGTTTAGTG TACTAGACCT AGCAGAAATG GTAAAAACAG CGGGTAAAAC TATGGATTTA
GATGTACAAA TTAATCACTT GGATAATCCT AGAGTAGAGT TAGAGCAACA TTATTTCAAT
GCTAAAAATA CGAATTTATT AGAGCTTGGT TTAAAACCTC ATTATCTATC TGATTCTTTG
TTAGATTCTT TGCTCAATTT TGCTATTAAG TATAAAACAA GAGTAGATAA AAACCATATT
TTACCTAAAG TCTCTTGGCA TCGAGAAAAA ACTCAACAAT TAGATTCTGT AAAAAGTACA
TTAATATCCA AAACTGAGGA AAAAAACAAA CAATCAGATT CTGTAAAAGT ACAATAG
 
Protein sequence
MKVLVIGGDG YCGWATALYL SNQGYEVGIL DSMVRRHWDL QLQVETLTPI APIQQRIQRW 
KDLTGKKIDL YIGDITNYDF LSTTLHQFEP ESIVHFGEQR SAPFSMIDRE HAVTTQVNNV
VGTLNILYAM KEDFPDCHLV KLGTMGEYGT PNIDIEEGYI KIEHNGRTDT LPYPKQPGSF
YHLSKVHDSH NIHFACKIWG IRATDLNQGI VYGVALTGLL NDETIQDELL INRLDYDGVF
GTALNRFCIQ AAIGHPLTVY GTGGQTRGFL DIRDTVRCME IAIANPAQPG EFRVFNQFTE
MFSVLDLAEM VKTAGKTMDL DVQINHLDNP RVELEQHYFN AKNTNLLELG LKPHYLSDSL
LDSLLNFAIK YKTRVDKNHI LPKVSWHREK TQQLDSVKST LISKTEEKNK QSDSVKVQ