Gene Tery_2669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2669 
Symbol 
ID4245164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4135837 
End bp4136874 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content35% 
IMG OID638107736 
Productaldo/keto reductase 
Protein accessionYP_722335 
Protein GI113476274 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.634348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATACA ACAAGCTTGG CGATAGTAAC CTTAATGTTT CAGAAATTTG TTTAGGCACG 
ATGACCTATG GACTACAAAA TACTATTGAA GAAGCTCATC AACAACTTAA TTATGCTGTT
GCTGAAGGTA TTAATTTTAT TGATACGGCA GAAATGTATC CCGTGCCAAG TAGGGCTGAT
ACTCAAGGAA AAACAGAGGA ATATATTGGT GAATGGTTAG TTAAACAACA ACGGGATAAG
TTAATTATTG CTACAAAAGT TACTGGTCCT AGTCCTAGAA TTACATGGAT TCGTGGGGAA
AATCGTAAAG TTAACCGAGC TAATATTCAG CAAGCAATAG AAGATAGTTT GCGTGCTCTT
CAGACTGACT ATATTGACCT TTATCAAATC CATTGGCCTG ACCGTTATGT GCCTCTATTT
GGTGCCCCAG ATTATGATCC AAATAATGAG TGGGATTCGA CGCCTATTGC GGAGCAATTA
GAAGTATTTG CAGAGTTAAT TAAAGCTGGA AAAATTCGTT ATTTAGGAGT AAGTAATGAG
ACGGCTTGGG GACTGTGTGA ATTTTGTCAT CTAGCAGAAA AATTAGGTTT ACCAAAAATA
GTTTCGATTC AAAATGCTTT TAGTTTAGTG AATCGTGTTT TTCATATTAA TATAGCTGAA
GCTTGTCGAT TTAATAATGT GGGATTAATG GCTTATAGTC CTCTAGCTTT TGGTATTTTA
ACAGGTAAAT ATTTGCAAGG AGTTCCTGAG AATTCTCGTC TAGCTTTTTT CCCCGGATTT
GACCAACGTT ATCGTAAAAC AAATTTGACT GAAGCGATAA AAAGTTATAT AGAAATTGCC
AATAAAAATA ATATGACTCC GGCACAATTA GCATTAGCTT ATGTGAATTC TCGATGGTTT
GTAGCTAGTA CAATTATTGG AGCAACAACT ATGGAACAAC TGAAGGAAAA TATTAGTAGT
GTGGAGATAA GTTTGACTGA GGAAATTATT GCTGAAATTG ATACAGTTCA TGCTAAATAT
CCTAATCCTA CACCTTAG
 
Protein sequence
MKYNKLGDSN LNVSEICLGT MTYGLQNTIE EAHQQLNYAV AEGINFIDTA EMYPVPSRAD 
TQGKTEEYIG EWLVKQQRDK LIIATKVTGP SPRITWIRGE NRKVNRANIQ QAIEDSLRAL
QTDYIDLYQI HWPDRYVPLF GAPDYDPNNE WDSTPIAEQL EVFAELIKAG KIRYLGVSNE
TAWGLCEFCH LAEKLGLPKI VSIQNAFSLV NRVFHINIAE ACRFNNVGLM AYSPLAFGIL
TGKYLQGVPE NSRLAFFPGF DQRYRKTNLT EAIKSYIEIA NKNNMTPAQL ALAYVNSRWF
VASTIIGATT MEQLKENISS VEISLTEEII AEIDTVHAKY PNPTP