Gene Tery_0275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0275 
Symbol 
ID4241642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp425178 
End bp426698 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content40% 
IMG OID638105615 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_720230 
Protein GI113474169 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0236777 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATTG ACGACCAACA TTATGATGTA ATTATTGTCG GTACAGGAGC AGGCGGTGGT 
ACTTTAGCAT ATAAACTTGC TGCCACTGGC AAAAAAATTC TGATTCTGGA AAGAGGTGAT
TTTATGCCTC TAGAAATCCA AAATCGAGTT AATGTTGATA TCTTTAAGAA AGAACTTTAC
CGTGCCCATG AAAATTGGTA TAATGATGCA GGGGAAGCAT TTTCTCCTCA GACAAATTAT
GCTGTTGGTG GTAATACAAA AATTTATGGA GCAGCCCTAA TTCGGATGCG GGAAAAAGAC
TTTGAGGCAG TGGAACATCA AGAAGGAATT TCTCCAGAGT GGTGTCTCAA ATACAAGGAC
TTTGAACCTT ATTACACTGA AGCAGAACAA CTCTATAAAG TTCATGGTCA CTCAAGCAAT
AATTCTAATG AACCTCATCA TAGTCAAGAA TATCCTTATC CCGCAGTTGA CCACGACCCA
CAAATTCAAA CAGTTGTCAA CGCTATCGCC GGCCAAGGTC TACATCCAGA AAACATACCA
TTAACTCTGA CTCGGGAACA GGAAGACCCC ACAGGAGACT CAGAAGTATT TGGTATTGCA
CCACTGCTCA ATTCTCCTAA CATTACTATT AAAACTAATG CTAAAGTAGT CTGTTTACTT
ACCAACTCTT CTGGTAATAC TGTCAAAGCA GTAGAAGTTG AAATTGACGA ACGATCATTT
CTATTTTTTG GGGATATTGT TGTAGTTGCT TGTGGAGCTG TAAATTCAGC AGCATTGTTA
TTACGTTCTG CTAACGAAAA ACATCCTCAT GGTTTAGCAA ACAGTTCTGG ACAAGTTGGG
CGCAACCTGA TGAAACATCA AATGACTGCT GTAGTGCAAC CTAGTCCAAA ACCTAATTCT
GGCAACTTTC TCAGAAGCGT TTGTGTAAAT GACTTTTATT GGGGAGATGA AAATTATTCT
TATCCCATGG GTCACATTCA GAATACAGGT GGGTTACTTC AAGATATTAC TTTTGCTGAA
TCTCCACCTA TCTTGTCTAT TTTAGCAAAA GCAATGCCTG AGGTGGGTTT GAAGCGGTTA
GCATTACGTT CTGTTGGTTG GTGGACATAT TCGGAGGTAT TACCAGATCC TAATAACCGC
ATTGAAGTTA AGGGAGACAA ACTTTTCTTC CATTACACTC CCAATAACTT GGAAGCACAC
GATCGCCTAG TTCATCGTTG GATAGAGGTA TTAAAGTCAG TAGACAAAGC GGTTGGAAAT
TCTTTCTTTG CAAGATCAGG AGGTAATGTT TACCCTCGTG GAGAAGCTCC TATCGAAGTA
GTTGCCAACC AGTGCGGTAC TTGTAAGTTT GGGGAAGATC CAGCTACATC AGTTTTAGAT
ATTAACTGTC GTACCCACGA TGTAGATAAT TTATATGTTG TAGATAGCAG TTTTTTCCCA
TCAAGTTCTG CTATAACTCC TGCTTTAACA ATTATTGCTA ATGCTTTACG AGTAGGGGAA
CATTTAAAGG AGCGTCTATA G
 
Protein sequence
MIIDDQHYDV IIVGTGAGGG TLAYKLAATG KKILILERGD FMPLEIQNRV NVDIFKKELY 
RAHENWYNDA GEAFSPQTNY AVGGNTKIYG AALIRMREKD FEAVEHQEGI SPEWCLKYKD
FEPYYTEAEQ LYKVHGHSSN NSNEPHHSQE YPYPAVDHDP QIQTVVNAIA GQGLHPENIP
LTLTREQEDP TGDSEVFGIA PLLNSPNITI KTNAKVVCLL TNSSGNTVKA VEVEIDERSF
LFFGDIVVVA CGAVNSAALL LRSANEKHPH GLANSSGQVG RNLMKHQMTA VVQPSPKPNS
GNFLRSVCVN DFYWGDENYS YPMGHIQNTG GLLQDITFAE SPPILSILAK AMPEVGLKRL
ALRSVGWWTY SEVLPDPNNR IEVKGDKLFF HYTPNNLEAH DRLVHRWIEV LKSVDKAVGN
SFFARSGGNV YPRGEAPIEV VANQCGTCKF GEDPATSVLD INCRTHDVDN LYVVDSSFFP
SSSAITPALT IIANALRVGE HLKERL