Gene Lferr_1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1744 
Symbol 
ID6877726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1720783 
End bp1721790 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content58% 
IMG OID642789613 
Productzinc-binding alcohol dehydrogenase family protein 
Protein accessionYP_002220173 
Protein GI198283852 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID[TIGR02822] zinc-binding alcohol dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0255685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGCCA TGGTTCTGGA AGAGGTGGGT AGGCCTTTGG TGCCCACTGA ATTGCCGCGA 
CCGCGGCCAC AGCCGGGCCA GGTCCTGGTA AAAATACTGG CGTGCGGAGT GTGCCGTACC
GATCTGCATG TAGTGGACGG TGAATTGCCC AACCCCAAAC TACCACTCGT TCCCGGCCAT
GAAGTAGTCG GCCAGATAGA ATCGGTGGGA AGCCCCGATA TTTCCTTACA GACCGGTCAG
ATGGTGGGAA TTCCCTGGCT CGCATGGACC TGCGGTGCCT GCGAATATTG TCGAGCAGGA
CGGGAGAATC TTTGTGACCA GGCGCGTTTT CATGGCTACA CCGTGGATGG CGGTTACGCC
GAGTATATGG TCGCCGATGC GCGTTACTGC TTCCCTCTTC CAGACATTTA CGCCAATCCG
GAAGGTGCCC CGCTGTTGTG TGCGGGGCTC ATCGGCTTTC GGGCCTTACG TTTTGCCGCG
GGAAGACGAC GCCTGGGTCT TTATGGATTC GGCGCTGCCG CGCATTTACT GATTCAGGTA
GCGCGTTATC AAGGCATGGA GGTCTATGCA TTTACCCGCC CGGGCGATAG CAAGGCACAG
GATCTAGCGA TCAAATTGGG GGCTGTGTGG GTAGGTGGAT CGGAGGTCCT GCCGCCACAA
CCGCTGGATG CGGCAATTTT GTTTGCACCG GTTGGAGCCC TGATACCTAT TGCTCTTCAG
GCGGTCAAGA AAGGGGGTGT TGTGATCAGT GCGGGGATTC ACATGTCGGA TATCCCAGCT
TTCCCCTATT CCTTACTCTG GGAGGAGCGA CAGGTGCGAT CGGTTGCCAA CCTGACGCGC
AAAGATGCCG AGGACTATTT CCCACTGGCA CGACGGGTCC CGGTGCAAAG CCATATCACG
ACGTATCCTT TGGCCATGGC GAATGTGGCA TTGGCGGATT TGAGAGGTGG CGCGGTCCAC
GGTGCCGCGG TACTGGTTAT GGGGGCATGG CAGGAACGCG AGACGTGA
 
Protein sequence
MHAMVLEEVG RPLVPTELPR PRPQPGQVLV KILACGVCRT DLHVVDGELP NPKLPLVPGH 
EVVGQIESVG SPDISLQTGQ MVGIPWLAWT CGACEYCRAG RENLCDQARF HGYTVDGGYA
EYMVADARYC FPLPDIYANP EGAPLLCAGL IGFRALRFAA GRRRLGLYGF GAAAHLLIQV
ARYQGMEVYA FTRPGDSKAQ DLAIKLGAVW VGGSEVLPPQ PLDAAILFAP VGALIPIALQ
AVKKGGVVIS AGIHMSDIPA FPYSLLWEER QVRSVANLTR KDAEDYFPLA RRVPVQSHIT
TYPLAMANVA LADLRGGAVH GAAVLVMGAW QERET