Gene Lferr_1064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1064 
Symbol 
ID6877035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1038587 
End bp1040470 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content60% 
IMG OID642788944 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002219513 
Protein GI198283192 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.932567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACCC GTCCCACGGA TATGGTGAAT ACTATGCAGG ACGCAGCGGT AACCTGCGGC 
CCGATCCCCG GCTCACACAA GCGTTACCTC GGTGGCGTGC GTTTCCCGGA ATTGCGCATT
CCCTTGCGGG AGATTCGGCA GAGCGACTCC CGGAAGCGGG ATGGCAGCCT CGAAGTCAAT
CCGGCCATCC CGGTCTATGA TTGCAGCGGT CCCTACACCG ACCCGGACGT GGTCATCGAT
ATTCATGCGG GCCTGCCCGC CATGCGCTGG CAATGGTCGC CGACGGATAC CGCCATCGAA
AGCCGCCCCG AGCCCGGTTC CGCCTATGGT CGCGCCCGTC TGGCGGATGT CCGCACGGCG
GATCTGCGTT TTCATCATCG CCGCGAAATT CGCCGGGCGG CCAACGGCGG CAACGTCTCC
CAGATGCACT ATGCGCGGCG CGGTATCATC ACCCCCGAGA TGGAGTTCGT CGCCATCCGC
GAGAATCAGG GTTTGGAGCA CCTGCGTGCC AGCCATCCGG CGCTGTTTCG CGCCCATCGC
GGCGAGTCCT TCGGCGCGCA GATCCCCGAC ACCATTACCC CGGAATTTGT CCGCGATGAA
ATCGCCCGGG GCCGCGCCAT CATCCCCGCC AATATCAATC ACCCGGAACT GGAGCCGATG
ATCATCGGCC GGAATTTCCT GGTGAAGATC AACGCCAATA TCGGCAACTC CGCGGTCACC
TCGTCCATTG CCGAAGAAGT GGAAAAAATG GTCTGGTCGA TCCGCTGGGG CGCCGATACG
GTCATGGATT TATCCACCGG CAGGCATATT CACGAAACCC GCGAATGGAT TATCCGCAAT
AGCCCGGTTC CCATCGGGAC CGTGCCGATT TATCAGGCCC TGGAGAAGGT CGATGGGAAG
GCCGAAGACC TCACCTGGGA TTTGTTCCGG GATACACTCA TTGAGCAGGC GGAGCAGGGC
GTCGATTATT TTACCATCCA TGCCGGTGTG CTGCTGCGTT ATATCCCGCT GACGGCGAAT
CGTCTTACCG GCATCGTGTC GCGCGGTGGT TCCATCATGG CCAAATGGTG CCTCGCGCAC
CACCAGGAGA GCTTTCTCTA CACCCATTTT GAAGAAATCT GCGAGATCAT GAAGGCCTAC
GACGTGAGCT TCTCCCTCGG CGATGGTCTG CGTCCCGGCT CTCTGGCCGA CGCCAATGAC
GAGGCGCAGT TCGCCGAGCT GCATACCCTG GGTGAACTCA CCCGGATTGC CTGGAAGCAC
GATGTGCAGG TCATGATCGA AGGCCCCGGC CATGTCCCCA TGCAACTCAT CAAGGCTAAT
ATGGAGGAGG AGCTACAGCA CTGTTTCGAG GCGCCGTTCT ACACGCTCGG ACCGCTGACC
ACGGATATTG CCCCCGGCTA TGACCATATT ACCAGCGCCA TCGGCGCCGC GCAGATCGGC
TGGTACGGGA CCGCCATGCT CTGCTACGTC ACCCCCAAGG AGCATCTCGG CCTGCCGGAT
AAAAATGACG TGCGGGAAGG GGCCATCACC TACAAGATTG CCGCCCATGC CGCCGACCTG
GCCAAGGGGC ATCCCGGCGC ACAAGTGCGG GATAACGCCC TGTCCAAGGC GCGTTTTGAG
TTCCGCTGGG AAGACCAGTT TCACCTCGGA CTGGACCCCG AAAAGGCGCG GGAATATCAC
GACGAGACCT TACCCCAGGA AGGCGCCAAG GCGGCGCACT TCTGTTCCAT GTGCGGTCCC
CATTTCTGCT CCATGAAAAT CTCCCAGGAT TTGCAGGAAT ACGCGCAATC CAAGGGTGAA
GATATCGAGA CGGCACGACT GGAAGGCCTG CAGGAAAAGG CAGAGGATTT CAAGCGGCTG
GGGAAGAATA TCTACCTTAG ATAA
 
Protein sequence
MATRPTDMVN TMQDAAVTCG PIPGSHKRYL GGVRFPELRI PLREIRQSDS RKRDGSLEVN 
PAIPVYDCSG PYTDPDVVID IHAGLPAMRW QWSPTDTAIE SRPEPGSAYG RARLADVRTA
DLRFHHRREI RRAANGGNVS QMHYARRGII TPEMEFVAIR ENQGLEHLRA SHPALFRAHR
GESFGAQIPD TITPEFVRDE IARGRAIIPA NINHPELEPM IIGRNFLVKI NANIGNSAVT
SSIAEEVEKM VWSIRWGADT VMDLSTGRHI HETREWIIRN SPVPIGTVPI YQALEKVDGK
AEDLTWDLFR DTLIEQAEQG VDYFTIHAGV LLRYIPLTAN RLTGIVSRGG SIMAKWCLAH
HQESFLYTHF EEICEIMKAY DVSFSLGDGL RPGSLADAND EAQFAELHTL GELTRIAWKH
DVQVMIEGPG HVPMQLIKAN MEEELQHCFE APFYTLGPLT TDIAPGYDHI TSAIGAAQIG
WYGTAMLCYV TPKEHLGLPD KNDVREGAIT YKIAAHAADL AKGHPGAQVR DNALSKARFE
FRWEDQFHLG LDPEKAREYH DETLPQEGAK AAHFCSMCGP HFCSMKISQD LQEYAQSKGE
DIETARLEGL QEKAEDFKRL GKNIYLR