Gene Lferr_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_1921 
Symbol 
ID6877906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp1910524 
End bp1913412 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content55% 
IMG OID642789791 
ProductSMP-30/Gluconolaconase/LRE domain protein 
Protein accessionYP_002220349 
Protein GI198284028 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATA TGATCGCGAA GGCGGTGATC TGCGTCATCC TTGCAGGACC GCTGTGTTAC 
TTACCCTCTG CCTTGGCCAA CCCCCTGAAT CGGCATGTTT TGAGTTCGGT TCAACTTGAC
TCACAAAATA TAGCAAATTA CACCGCCACG TTGCCCACGG GCCGCATAGT GACGCCGGTC
GGCGCAATCA ACGGCACCCC GAATTTCCCC ACGATGGTAG CAGCAGATGG CAATCGTATT
GCCGTGCTGG CTAACGGTGC AACACCTTTT CAAACCATTA CGTTTTATGA CGGGAATAAT
CTTCGGCGGG TAGATCGCCT GGCAGCGTTT TCGAAAGTGG CACCCGTTCA ACCGATGGCT
TACGCAACAC CTGATGGTAG CGGTATCGCG ATCAATCCGC AACATGCGGG TGCGGCAGGC
GTGGTGTATA TTCCAAAAGA AGGCGTTGCA CTTAAAATTG CCCAGGCCAA AGCCACGCTT
GCCGCCGAGA GCGATTCCCA AGCCATTCCC AGTTCGGTCA TTTCGCACAG CGACTTTTTC
CAAGGGCTTA CTGCCGGTCC GGATGGAACC TTCTACGCTA CAGGAGGTGA TTCTGATCAG
GTGGTGGCTT TGCGCAGTGT GCAGGGCAAG GTTGAGGTGA TTCATCGGTA TCCTTTGCAG
TGGCAGGCTT TTCCGAAGGA TCAATATCCC TATCAGTATC AGGGGAACCA GCAGAAGAAA
TATCTCTTTT ACCCGGACTC GGTGGTCGTC GGTCCCCAGA ATAAGCATCT CTATGTGACG
GGGATGCTCG CCAACAGTCT GGCACGTATC AGTATCGCCA GTGGCAAGAC CGAATACGTC
AATGTCGGTG CCTATCCCTT TGCGGTGGCT CTGGCCGATA ACGGGCAGCG GTTGGTGGTC
AGCGACTGGG CAGGCAACGG CGTTGTGGTG CTGGACCGGC AACGCCTGAA GGTTCTGGGC
GAAATTCCCA CTGGGCCGGC GGTCGGCCCG TCAACAGTGG CGGCGGGTGT GCACCCTACG
GCGATGGTTG CGGTGCCCGA CAGTCCCGAC GTCTTTGTGG CCGACGCCAA TGTCGACAAG
GTGGTGGAAG TCGATACGCA AAACTTGCGC CCGCTGCAGG TGGTGGACGA CAGCCCCTAT
CCCGACGCCC CTCCCGGTAG CTATCCGGAT GGACTTGCGG TTGCCGATGG CAAGCTCTAT
ATCGCCAACG CCGGTAATAA TGATGTGGCG GTGTATGACA TCCGAACGGG AAAGCGTCTG
GGCCTGATTC CCACGGCCTG GTACCCGACC AGCCTGACGG TGGCGAATGG TGCCTTATAT
ATCAGCTGTG CCAAGGGTCT GGGTGCCGGG CCCAATCTAC AGTGGCAGTA TATCGGCAAT
ATGATGCACG GGGTGATCCA AAAGGTGGCT CTCCAGGAAA TCCCGGCCCA TCTGGCCGAT
TGGACCGAGA AGTCTTTGCA CAACGATGGT TTTACGTCGG CCCAGCGCAG CGCCCGTCAC
GAGGGAAATG TCAAAACCAC CACCTATCTG CGCAAGCATA TTCACTACGT TGTATTCATC
CTGCGTGAAA ATAAGACCTT TGATGAAGAT TTAGGGGATT ACACGGCGGC TGGCAAATGG
GCGGATCCAC ATTTTGATCT GTATAATCAG AAAGAATTGC CCAACTTGTA TAATCTGGCC
CATCATTATG CGCTGTTCGG GAATTTCATG GCCGACGGGG AAGTCACCGC ACAGGGTCAT
CAGTGGACCG ATGGCGCTTC GGATTCCGAT GTGGTGCAGC GCTTGTGGCC CGAATATTAT
TCTAACCGCG GTTTGCTCTG GAACGCCGGT CCCGGCGGTA GCTCTTCCCT CAAGCCCACC
GCACAGGGCG CACACAACCC CTATGATATC TATCAGCCGC TGGGAGATCA TACCAACCCC
TGGATCAGCT ATCCTGAAAA GCTCTATTTG TTCAATGATC TTTTGGAACA TCATATCCGT
TTCGAGGATT TTGGCGAGAA CGTGACGCGA CGCCGGGATG GTGTCATTCG TTCTGGCCTA
CTGCAGCATA TGGATGTCCG TTATCCCTAT TGGGATCGGA TGATTCTGGA TACGGCTCGT
GTGAGACTGG CCGAACACTG GCTGAAGGCC CATCCGGGCG TAAAATTTCC GCATTTCATT
TATATCTGGA TACCGGACGA CCACACTGCA GGCTTGTCTC CCTGCTATTA CGCGCCAGAT
TATTACGTAG CCAATAACGA TTACGCCACA GCCAAATTCA TCCATTACCT GTCCACGACG
CCGCAATGGA AGCATATGGC GATTTTCCTG ACCGAGGACG ATGCGCAGTC CGGTGCCGAT
CATATCAATG GTCATCGCAC TTTTGCGCTG GTCATCAGCC CCTGGGTCAA AAAAGGCGTG
CTGGAAACGC ATCTGAATTC CCAGGTGAAT ATCGTCAAAA CCATTGAAGC AACCTTGGGT
CTGCCGCCCA TGTCCCAGTG GGATGCCAAT GCCTCGGTCA TTGCGGGTAT CTGGACAGAT
CATCCGGATT TCGCGCCGAC GCCGGCGGTG CTTCCGATTC AGGTGCCGGT GTCTTTCAAT
CCGGGTAAAT GCAGCAATCG AACATTGCTC AGAAGAGAGG CTGGGGCCAC CGGTCATATG
CTGACTGCGG AATGGTTGAA GGCGCATACC GATCCCCACG GGCGGCGTCT GGCGCCGGTT
AGTGCGGCGA ACGCCTATAC CCCCACTTCG CTGCTCAAGG TCAGCGGTCC GGAGCAGTTG
AAGCAGGAAT GGATCGCCAG CAAGGGGGTC AAGAGCTATG ACCATTTTAT GGCCTACCTA
CATCACTATG CCCAAGTCCA TGGGGCGACC ATCGCCAGCT ATGAAGCCAA CGAAGGAAAA
CTCCATTGA
 
Protein sequence
MNNMIAKAVI CVILAGPLCY LPSALANPLN RHVLSSVQLD SQNIANYTAT LPTGRIVTPV 
GAINGTPNFP TMVAADGNRI AVLANGATPF QTITFYDGNN LRRVDRLAAF SKVAPVQPMA
YATPDGSGIA INPQHAGAAG VVYIPKEGVA LKIAQAKATL AAESDSQAIP SSVISHSDFF
QGLTAGPDGT FYATGGDSDQ VVALRSVQGK VEVIHRYPLQ WQAFPKDQYP YQYQGNQQKK
YLFYPDSVVV GPQNKHLYVT GMLANSLARI SIASGKTEYV NVGAYPFAVA LADNGQRLVV
SDWAGNGVVV LDRQRLKVLG EIPTGPAVGP STVAAGVHPT AMVAVPDSPD VFVADANVDK
VVEVDTQNLR PLQVVDDSPY PDAPPGSYPD GLAVADGKLY IANAGNNDVA VYDIRTGKRL
GLIPTAWYPT SLTVANGALY ISCAKGLGAG PNLQWQYIGN MMHGVIQKVA LQEIPAHLAD
WTEKSLHNDG FTSAQRSARH EGNVKTTTYL RKHIHYVVFI LRENKTFDED LGDYTAAGKW
ADPHFDLYNQ KELPNLYNLA HHYALFGNFM ADGEVTAQGH QWTDGASDSD VVQRLWPEYY
SNRGLLWNAG PGGSSSLKPT AQGAHNPYDI YQPLGDHTNP WISYPEKLYL FNDLLEHHIR
FEDFGENVTR RRDGVIRSGL LQHMDVRYPY WDRMILDTAR VRLAEHWLKA HPGVKFPHFI
YIWIPDDHTA GLSPCYYAPD YYVANNDYAT AKFIHYLSTT PQWKHMAIFL TEDDAQSGAD
HINGHRTFAL VISPWVKKGV LETHLNSQVN IVKTIEATLG LPPMSQWDAN ASVIAGIWTD
HPDFAPTPAV LPIQVPVSFN PGKCSNRTLL RREAGATGHM LTAEWLKAHT DPHGRRLAPV
SAANAYTPTS LLKVSGPEQL KQEWIASKGV KSYDHFMAYL HHYAQVHGAT IASYEANEGK
LH