Gene Tery_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1077 
Symbol 
ID4241650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1694036 
End bp1695550 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content40% 
IMG OID638106307 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_720919 
Protein GI113474858 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.785546 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTTACG ATTTTGATGC AATTATTATT GGCAGCGGTG CAGGAGGAGG AACGATCGCC 
CATACTCTAG CAAAAGCTGG TAAACAGGTT CTTATAGTAG AACGTGGTCC AAGATTTAGT
GATGTTGAAG CTTTCCAAGA TGAGCAAAGA ATGTTAATAA ATAAAGAAGC TTTTGACGAT
CGCCAAATTG AAGTTAATGG ACGTAACGCC CAACTTCATA TTGCCGGAAT ATTAGGGGGT
GGTACATCTT TATATGGAGG TGTATTAATG CGTCCAAGTC CTTATGACTT CCACCCTGGT
AAATTCTATG ACCAATGGCT ACCCCGTCAC CTGTGGGATT GGCCCATTAC TTACGATGAT
ATGGCTCCCT ACTTCGAGCA AGCAGAAACA TTATTTCATG TTGCTGGAGA TCGGCCATAT
AAAATGCCCA ATGTAGGAAC ACCAAGCAAA GGTTATCCAG CTACAGTTCC TCCACTCGAG
CCAATTAACC AACAATTGGA AAAAGGTATA AAAGATGCCG GGTTAATGCC ATTTCATCTG
CCCTTTGGTA TAGATTTCAA AAGTTGTCTG CGTTGTTCAA AATGTCCTGG TTTCTACTGC
CCAAATGAAG CTCGTGCTTC TACGGTAGTA CGTACTATTG ATAAGGCCAC TCTTGATTAC
CACCTTCAAG TTCAAACTTT CACAGAAGCA GACCGTTTAG TTACTAACAG TCACCAAAAA
GTTACAGGTA TTCGCCTCCG TTGTCGCAAA ACTAACAAAG TTCAAGAACT GACAGCTAAA
CTTTATATTC TTTCTGCGGG AGCTTTAGGT AGTCCAATTA TTTTGATGAA GTCTGGCTTA
ACAGGACGTA GCGGACAAGT AGGGCGTAAT TATATGTATC ATTGTGGTGC TTTAGTGGCA
GGTTTATTCA AACAAGAGAC AGGAGGGGCA GATAAATTCA TTAAACAATT AGGGTTTACA
GACCTGTATC TGGGAAATAA AGAATTTACT CACAAATTGG GGTTAGCTCA AACAGTTCCT
ATTCCAGGAC TTCTTTCCCT ACAACAAAAT TTCCCCATAC CAATACCAAA AACCATCGCC
GAATTTCTGC TCAAGCGAAT GTTGGCAATA ACTGGCATTG TGGAAGATTT ACCCCAACAG
GAAAATCGGA TAGAATTAGG TAAAGATGGC AAAATTCGTC TATTTCACAA ATTTCATCCT
TATGATGTTT ACCGTTCTCA ATACTATAAG AGCAAATTAA AACAGGTATT CCGCTTGGCA
GGATCAATAT TATCTTTTGG AGCAACAGGT GATAAAGATG ATATTCATAC ATCCCACCAG
GTAGGTACAA CTAGGTTTGG TACAGATCCT AATACATCTG TGCTTAACAA AGATTGTCGT
TTACATGAGC ATGAAAATGT TTTTGTTGTG GATGGTGGAT TTATGCCTAA TGCTTTGGGT
GTTAGTCCAG CTTTGACTAT TGCGGCAAAT GCTCTGAGAG TTGCGGATAC TATATTACAA
AAAGCTATAA TTTGA
 
Protein sequence
MGYDFDAIII GSGAGGGTIA HTLAKAGKQV LIVERGPRFS DVEAFQDEQR MLINKEAFDD 
RQIEVNGRNA QLHIAGILGG GTSLYGGVLM RPSPYDFHPG KFYDQWLPRH LWDWPITYDD
MAPYFEQAET LFHVAGDRPY KMPNVGTPSK GYPATVPPLE PINQQLEKGI KDAGLMPFHL
PFGIDFKSCL RCSKCPGFYC PNEARASTVV RTIDKATLDY HLQVQTFTEA DRLVTNSHQK
VTGIRLRCRK TNKVQELTAK LYILSAGALG SPIILMKSGL TGRSGQVGRN YMYHCGALVA
GLFKQETGGA DKFIKQLGFT DLYLGNKEFT HKLGLAQTVP IPGLLSLQQN FPIPIPKTIA
EFLLKRMLAI TGIVEDLPQQ ENRIELGKDG KIRLFHKFHP YDVYRSQYYK SKLKQVFRLA
GSILSFGATG DKDDIHTSHQ VGTTRFGTDP NTSVLNKDCR LHEHENVFVV DGGFMPNALG
VSPALTIAAN ALRVADTILQ KAII