Gene Tery_3212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3212 
Symbol 
ID4243807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4910934 
End bp4912403 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content34% 
IMG OID638108213 
Productcarotenoid oxygenase 
Protein accessionYP_722804 
Protein GI113476743 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.415925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.375816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATT TACAAATTCA ACAACCTAAA TCTTATACTA GTAAAGATTG GCAACAAGGA 
TATAAATCTC AACCACAAGA ATATAACTAT TGGATTGATG ATATAGAAGG TGAAATACCA
GAAGATTTAA ACGGTACTTT TTTCCGTAAC GGACCAGGTT TATTAGACAT TAATGGTCAA
CTTATTGCTC ATCCTTTTGA CGGAGATGGA ATGGTTTGTG CAATTAGTTT TAAAAACCGT
CGCGCCCACT TCCAAAATAG ATTTGTTAGA ACAGAAGGTT ATGTGGCAGA AAAAGCGGCA
GGAAAAATTC TCTATAGAGG TGTTTTTGGT ACTCAAAAAA CAGGTGGTTG GTTAGCTAAT
CTTTTTGATT TTAAACTTAA AAATATTGCC AATACTGGTA TTATTTATTG GGGTGATAAA
CTTTTGGCAT TGTGGGAAGG AGGGCAACCT CATCGTTTAA ATCCCCAGAA TTTAGAAACC
ATTGGCCTTA ACGATTTAGA TGGGCTTTTA CAACCAGGTC AAGCTTTTTC TGCTCATCCC
AGAATTGATA AAGGAAAGGA TGGAAAAGGA GATGTTTTAG TTAATTTTTC TGTCAAACCT
GGTTTATCAA GTACCATTAC TATTTTTGAA TTTAATAGTC AGGGAAAATT ACTCAAACGT
TACTCTAATT CTATTCCTGG TTTTGCCTTT TTACACGATA TGGTAATTAC ACCAAATTAC
TGTATTTTTT TTCAAAATCC TGTTGCTTTT AATCCTTTTC CTTTATTACT AGGGTTACGA
ACTCCAGGTC AATGTTTAGA GTTTTTACCT AATAATTCAA CACAAGTTAT TTTAATTCCT
CGTGATGGTA GTAAAGCTAT AAAAATTTTG AAAACGAAAC CTTGTTTTGT ATTTCATCAT
GCTAATGCTT GGGAAAAGGA CGGGGAAATT TATGTAGATT CTATTTGTTA TGAATCTGTC
TCACAAACTG ACCTAGGTGA TAATTTTCTG GAGGTGGATT TTGACTCAAT GACAGAAGGT
AAGTTATGGC GATTTAAGAT TAATTTATCA GAGAATAATG TGGAACATAA ATTGCTTGAA
AGTCGTTGTT GTGAGTTTCC GACTTTAAAT CCGAATAATG TAGGAAAAGC TTATCGATAT
TTATTTATTG GAGCAGCAGA TAAGCCTAGT GGAAATGCTC CTTTACAAGC AATATTAAAA
ATTGATTTGC ATACAGGAAA ACGTCAAACT TTTAGTGTCG CACCGCGAGG TTTTGCAGGA
GAACCTTTAT TTGTTCCTTT TCCAAATGGG GTGAATGAGG ATGATGGTTG GTTATTAATG
TTGATGTATG ATGCAGCAGA ACATCGGTCG GATATTGTGA TTTTGGATGC TCGTGATTTG
AATAAAAAAC CTGTGGCAAG ATTACATTTA AAGCATCATA TTCCTTATGG TTTACATGGT
AGTTTTACCC CTAATTATTT TCAAGAGTAA
 
Protein sequence
MTNLQIQQPK SYTSKDWQQG YKSQPQEYNY WIDDIEGEIP EDLNGTFFRN GPGLLDINGQ 
LIAHPFDGDG MVCAISFKNR RAHFQNRFVR TEGYVAEKAA GKILYRGVFG TQKTGGWLAN
LFDFKLKNIA NTGIIYWGDK LLALWEGGQP HRLNPQNLET IGLNDLDGLL QPGQAFSAHP
RIDKGKDGKG DVLVNFSVKP GLSSTITIFE FNSQGKLLKR YSNSIPGFAF LHDMVITPNY
CIFFQNPVAF NPFPLLLGLR TPGQCLEFLP NNSTQVILIP RDGSKAIKIL KTKPCFVFHH
ANAWEKDGEI YVDSICYESV SQTDLGDNFL EVDFDSMTEG KLWRFKINLS ENNVEHKLLE
SRCCEFPTLN PNNVGKAYRY LFIGAADKPS GNAPLQAILK IDLHTGKRQT FSVAPRGFAG
EPLFVPFPNG VNEDDGWLLM LMYDAAEHRS DIVILDARDL NKKPVARLHL KHHIPYGLHG
SFTPNYFQE