Gene Tery_1926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1926 
Symbol 
ID4242675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2984383 
End bp2985750 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content40% 
IMG OID638107047 
Productaldehyde dehydrogenase 
Protein accessionYP_721654 
Protein GI113475593 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.515635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.086374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATCG CTACAGTTAA TCCAGCAACA GGAGAAGTCC TGAAAACTTT TGAACAAATT 
ACAGATACAC AAATAGAGGC TAAACTAGAG TTAGCAGAAA AAACTTTTCG TGCCTATTGT
CAAACTTCTA TAACTCAACG TGGAGAATGG TTGTTAGCAG CAGCAGACAT TTTAGAAAAA
AATGCTGAGA AATTTGGTAA GATAATGACT CTAGAGATGG GTAAAACTAT AAGTGGAGCG
ATCGCCGAAG CTAAAAAATG TGCTCTAGTC TGTCGTTACT ATGCAGAAAA GGCTACTGAG
TTTCTGGCTG ATGTTCCTGC ACAAACTGAT GCTAGTAAAT CATTTGTTCG TTATCAACCA
ATTGGTCCAG TGCTAGCGGT TATGCCCTGG AATTTTCCTT TTTGGCAGGT TTTCCGTTTT
GCAGCACCAG CTTTAATGGC AGGAAATGTG GGTTTATTGA AACACGCTTC TAATGTTCCT
CAATGTGCTT TGGCTATTGA GGAAATTTTT CAAGAAGCAG GTTTCCCAGA AGGTGTATTT
CAAACTCTTT TGATCAGCTC AGATAAAGTG TCTGGTATTA TGATGGATGA CCGGGTCAAA
GCAGGAACTT TAACTGGCAG TGAACCTGCG GGTGCAAGTT TAGCGGCAAC AGCAGGTAGA
GCTATTAAAA AAACGGTCTT GGAACTTGGG GGTAGTGACC CTTTTATAGT ATTAGAAAGT
GCTGATTTAG AAACAGCAGT TACAACAGCA GTTACAGCTA GGATGCTAAA TAATGGTCAA
TCTTGTATTG CAGCTAAACG TTTTATTTTG GCAGATGCGA TCGCTGATCA ATTTCAAGAG
GGTTTGGTAG AAAAATTTGA GGCTTTAAAA GTAGGAGACC CTATGTTGCC AGATACTAAT
ATTGGTCCTT TGGCAACTCC ATCTATTCTT GAAGAGTTAA ATGCTCAAGT GGAAGCTTCT
GTGGAGAAAG GAGCGAAAAT TCTCACAGGT GGTCATCTTT TATCTGACCT TCCTGGAAAT
TTTTACCCTC CGACAATTTT AGCTGAGATA CCAATAAGTT CTCCTGCTTA TCAGGAAGAA
TTTTTTGGTC CGGTAGCTTT AGTCTTTCGC GTTGCTAATA TTGATGAAGC AATAAATTTG
GCAAATAATA CACCTTTTGG TTTAGGTGCA AGTGCATGGA CTAAAGATAC GGGAGAGACG
GAAAGATTAA TCTCAGAATT AGAGGCTGGT GCTGTTTTTA TCAACGGTTT AGTTAAGTCT
GATCCACGTC TGCCTTTTGG TGGAATTAAA CGTTCTGGTT ATGGTCGGGA ACTGAGCAGG
GAAGGAATTT TGGAATTTGT CAATATTAAG ACTGTTTGGG TTAAATAA
 
Protein sequence
MQIATVNPAT GEVLKTFEQI TDTQIEAKLE LAEKTFRAYC QTSITQRGEW LLAAADILEK 
NAEKFGKIMT LEMGKTISGA IAEAKKCALV CRYYAEKATE FLADVPAQTD ASKSFVRYQP
IGPVLAVMPW NFPFWQVFRF AAPALMAGNV GLLKHASNVP QCALAIEEIF QEAGFPEGVF
QTLLISSDKV SGIMMDDRVK AGTLTGSEPA GASLAATAGR AIKKTVLELG GSDPFIVLES
ADLETAVTTA VTARMLNNGQ SCIAAKRFIL ADAIADQFQE GLVEKFEALK VGDPMLPDTN
IGPLATPSIL EELNAQVEAS VEKGAKILTG GHLLSDLPGN FYPPTILAEI PISSPAYQEE
FFGPVALVFR VANIDEAINL ANNTPFGLGA SAWTKDTGET ERLISELEAG AVFINGLVKS
DPRLPFGGIK RSGYGRELSR EGILEFVNIK TVWVK