Gene Lferr_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1067 
Symbol 
ID6877038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1042242 
End bp1043648 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content61% 
IMG OID642788947 
ProductAldehyde Dehydrogenase 
Protein accessionYP_002219516 
Protein GI198283195 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGCACTG TGGACGAATT GCTGGCGGTG CAAAAATCTG CGCTCCGGGA AAACGGCGCC 
ATCGACTACA GGCAACGTAG GACGCTGCTG CGGGATCTTG CGCAGATGCT GCGGCAACAC
GGCAGGGCAT TCAGCGAAGC CATCGCCCGG GATTTCGGCA GACGCCACCC CCGCGAAACG
GAAATCTATG AAATCTATCC CTTGCAGGCG GAAATCGCCT ATGTGCTCAG CCATCTCAGG
AACTGGACGC GGCCTCGCGC CGTGCATACC CGCTGGCCAT TCCTGCCGGC CCGCAGTCAG
ATCACTCCGC AACCCGTGGG TGTCGTCGGC ATTATCGGCG CCTGGAATTA TCCCCTGCTC
CTGACCCTGC TGCCGTTGAT CTCCGCCATC GCCGCAGGCA ACCGGGCGAT TATCAAAGGG
CCGCGTCTGG CGCCTCAGAC CATGACGTTA TTGGCGCAAT ACCTGCGCGA TGTGACATCT
GAGGATACCA TTGCGCTGGT TCAGGGATCA CCTGATGTGG ATCACGCGTT TCCCGGACTG
CCTTTCGATC ATCTGATTTT TTCGGGAGCC ACCCGTACCG GACGGGTGAT CGCCCGTGCC
GCTGCGCGGA ATCTGGTGCC CGTGACACTG TCCATGAGTG GCAAGAGCCC GGCCATCATC
CAGTGTGATT ATCCCCTGGC CACGGCGGCC CGATCCATCA TGGCGGGCAA ACTGGTGAAT
GCCGGGCAGA CCTGCATCGC CCCCGATTAT TGTCTGGTGG CGGCGGATCA GCGCGACGAT
TTCATCGCGC TGGCGAAGTC GGCCGCTTTG TCGCTCTATC CCCACTGGGC AGACAACCCG
GACTATACCA GCATCCCCAA TGTGCTGCTG TGGGAACGTC TGGAGGGCCT GCTGCAGGAT
GCCCAGAGGA AAGGAGCGAT ATTATGGCAG CCCTCGCCAG CACCAGCACT GGCCGATGGC
GCACAGCGGC CCTTCCCGCC CACCCTGCTC TGGGATGTGC AGCCTGGCAT GAAAATCCTG
GAAGAAGAAA CCTTCGGGCC GATCTTGGTG GTTCTGACCT ACGATGACAT CCAGGAGGCA
CTGGATTATG TGCGCGATCA CCCAGCGCCT CTGGCGCTGT ATTATTTCGA CCGGGATCAG
CGGCGCGCCC TGCGGCACTG TAAGGGAATT GCCGCGGGAG GCGTCACCAT CAATGACACC
ATTTTCCATG TCGCACAACC AGGCATTCCT TTCGGCGGTA TCGGCCTGAG CGGCATAGGC
CAGTATCGCG GTATTTATGG GTTTCAACGC CTCTCGCACT ACCAGGGCGT CTTCAGGCAG
AATCGTCTGA GCGCCTGCGA ATGGGTTCGC CCCCCTTACG GACGCTGGAC GCGGCTGCTC
ATCGCCTGGC TGTCACGCTG GGGTTAA
 
Protein sequence
MRTVDELLAV QKSALRENGA IDYRQRRTLL RDLAQMLRQH GRAFSEAIAR DFGRRHPRET 
EIYEIYPLQA EIAYVLSHLR NWTRPRAVHT RWPFLPARSQ ITPQPVGVVG IIGAWNYPLL
LTLLPLISAI AAGNRAIIKG PRLAPQTMTL LAQYLRDVTS EDTIALVQGS PDVDHAFPGL
PFDHLIFSGA TRTGRVIARA AARNLVPVTL SMSGKSPAII QCDYPLATAA RSIMAGKLVN
AGQTCIAPDY CLVAADQRDD FIALAKSAAL SLYPHWADNP DYTSIPNVLL WERLEGLLQD
AQRKGAILWQ PSPAPALADG AQRPFPPTLL WDVQPGMKIL EEETFGPILV VLTYDDIQEA
LDYVRDHPAP LALYYFDRDQ RRALRHCKGI AAGGVTINDT IFHVAQPGIP FGGIGLSGIG
QYRGIYGFQR LSHYQGVFRQ NRLSACEWVR PPYGRWTRLL IAWLSRWG