Gene Lferr_1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1066 
Symbol 
ID6877037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1040965 
End bp1042212 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content61% 
IMG OID642788946 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_002219515 
Protein GI198283194 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCAT CCGATGTGAT CATTGCCGGT TCCGGACCTG CTGGCGCCAT GCTGGCCAGA 
GACCTGACGC GCGCCGGGGC TAGCGTGCGG ATTCTGGAAC GGGGCGGGGA AACCCCGGCG
TCAGCGCCGA ATTTATGGAA GCTCTGGCGC CGCAAGGAGG CATTACATGT GGCGCCGGGG
GTTGCCCTGC TCCGTGGCAC ACGGGTTGGA GGTGGTTCCA CCGTTTTCTA CCACACGGCC
ATATCTCCTC CCCTGGAGAT GTTTTCCCAC CATGGCGTGG AAATGGCGCA GGATTTGGCG
GCTGTTCTTG CAGAACTACC GCACCAACCC CTGCAGGCGC CACTTCTGGG TCCCACTGCG
CAACGCATCA TCGACGCGGC GCGCTCTTTA GGGCTGCCCT GGCAGACGTT GCCGAAAATG
ATTGATCAGG AACTCTGCGG CCATGGGGAC TGCCCACCTG CAGCTTTCTG GTCTGCGGCG
TCGCTGCTGG CGCAAGCAAT GCAACTGGGC GCCGAACTGG AAACCGGCAT CCAGGTGAAC
AGGGTGCTTT TTCAGAATGG GCGTGCGGTG GGTGTGGAGG CCCTTCAGAA GGGGCAGCTA
CGCCGGTTCA TGGGAGGGAC GGTCATTCTG TCAGCGGGTG GTGTGGCAAG TCCCGTAATT
TTACAGCGCA GTGGGATCCG CGAGGCGGGG CGGGGATTTT TTTGCGATCC CTTGCGCGTT
GGTGTGGCAA TCGGTCGGGG AAATGGGCTC CAGGAGGCAG AAATGCCCAT GACAGCGGGA
TTCGTAGATC GGGAAGCCGG TTACATGCTG ACGGATATGA CTGTTCCGCC CAATTTTTAT
CGCGCTTTTG CGTGGGCAGC CGGCAGAGTC GACATGCTTG CGCATTACCG GCATAGTATG
ATGATCATGG TGAAAATCCG CGATGAAATC AGCGGTGAAG TGAATGCCAA CGGACGCGTC
TGGCGACACT TTTCCGCAGC CGAAAAAAAT CGCATGCGCA ACGGCATGGG ACTGGCTGCC
GACATCCTCC GGGCGGCGGG CGGTAGACGG ATATTCTTCT CGCCCTGGCT GGCCGCGCAC
CCCGGCGGCA GCGTGCGTCT GGGCGAACTG CTGGATGAGC GGCTATCCTG TTGCACCCCC
AATCTCCATG TCTGCGATGC GTCGGTGATC CCCGAACCCT GGGGTTTACC CCCGACGCTG
ACGGTATTAT CCCTGGCGAA GTATTTGGGC CGCATATTAC TTGGGTGA
 
Protein sequence
MPASDVIIAG SGPAGAMLAR DLTRAGASVR ILERGGETPA SAPNLWKLWR RKEALHVAPG 
VALLRGTRVG GGSTVFYHTA ISPPLEMFSH HGVEMAQDLA AVLAELPHQP LQAPLLGPTA
QRIIDAARSL GLPWQTLPKM IDQELCGHGD CPPAAFWSAA SLLAQAMQLG AELETGIQVN
RVLFQNGRAV GVEALQKGQL RRFMGGTVIL SAGGVASPVI LQRSGIREAG RGFFCDPLRV
GVAIGRGNGL QEAEMPMTAG FVDREAGYML TDMTVPPNFY RAFAWAAGRV DMLAHYRHSM
MIMVKIRDEI SGEVNANGRV WRHFSAAEKN RMRNGMGLAA DILRAAGGRR IFFSPWLAAH
PGGSVRLGEL LDERLSCCTP NLHVCDASVI PEPWGLPPTL TVLSLAKYLG RILLG