Gene Lferr_2600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_2600 
Symbol 
ID6878599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp2574719 
End bp2576221 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content48% 
IMG OID642790457 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002221001 
Protein GI198284680 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTCA ACTCTTTGCT GCCGGTGGTC AATAATGATT CCAGTGCCAA TGCGCAGATA 
ATCAGCTTGA TGTTTCGGCC TCTTTTGTGG ATTGGCACCA ATCTGAAAAT CAACTGGCCG
CAATCCATTG CCAAAAGTAT TACCGTATCG CCGAACCGGC GAAAGTTTAT CATTCATCTC
AAGGTTTGGC GTTGGTCTGA CGGGAGACCG GTTACGGCTG AAGATACTTT GGCTTGTCTT
ACGCTGATTC GCCAATACGG GCCAAGATAC CCCAACGCGG GAATGGGAGG AATACCGAAT
ATTATCGAAA GCGCAGTGGT TATTGATCCT CGGACTCTGG AAATCACACT GAAACGTTCG
GTGAACCCAA CCTGGTTTGA ACTGAATGGA TTGTCTCAGT TGTTTCCAGT GCCCGCCTGG
CGCTGGAAAC ACTATTCCAT AGATACACTG TCCAAACTTC AGGATAACCC GGCTATGGTT
TCCGTGGTGG ATGGTCCATA CAACCTGCAA CGTTTCGTCC CTGGTCGGAG CATCAGCTTT
ATACGAAACC CTCATTATTC CGGAAATCCA TCCGCTTTGG AGCACCTACA TTTTAAAATG
TACACCTCTG ATTCCAGTGC CTTTTGGGCT CTGAAAACAG GCACCATTCA GGCGGGAATG
ATTCCGCATT ACTTGTATGC AGCGCGTAGC ATGGTAAAAA ACCTCAAAAC ATGCGTCAGT
AACGGTGGTT ATGGTTTTAA TTATGTGACG TTGAATCTCA CCAATCCCCA GGTAGCTTTT
TTTCGGAATG TAAAGGTGAG GCAGGCGCTT GCTCTGGCGA TCAACCAGAC ACAGATCATC
CAAATTGCAT TTCATGGATT AGGCGTCCCC AGTTTTAATC CGGTACCCAC TAATCCCGAT
ACGTATCTTT CTCCAGAGAT GAAGAAGCTC GTGGCACGCC CAGCCCTCGC CTATAATCCT
TCTGCTGCGA AACAGTTACT AGCGGAGGCG GGATGGCAAC CAGGTCTGGA TGGGGTCCGG
ATGCGAAATG GGCAGCGTCT TCAGTTTACG ATGATGGTTC CAGACACCAG CCAGACGCTG
ATAGCCGTGG CGGAAATGTT GAAAGCGGAC TGGCAGGCTG TTGGTATAGA TATGCGCCTG
CGCGTCCTGC CATTCAATCT GGAACTAGCT AAATTGCACC CCCATGGGAA ATGGGATGCT
TCCATGATCG TCTGGTCCTA TGATCCGGAT TACTATCCTA GCGGTGATGG TTTGTTTAAT
ACTGGTGGTG GTAGTAATTA TGGGGATTAT AGCAACTCCA TGATGGACAA GCTGGTTCGC
GATAGCACAG AAAAAAACAG TACAAAATTT TTGTATCAAT ATGAGAATTA TGCGTACGCT
CAGCAACCGG TGATTTTTCT ACCTTATCCG AAGTACGTTG TGAAATATAC TCAAGACTTG
ACCCACGCAC AGTTGATGGA AGGTGTTTAT TCTGTAGATT GCCATCCTCA ACGCCTACAC
TGA
 
Protein sequence
MNVNSLLPVV NNDSSANAQI ISLMFRPLLW IGTNLKINWP QSIAKSITVS PNRRKFIIHL 
KVWRWSDGRP VTAEDTLACL TLIRQYGPRY PNAGMGGIPN IIESAVVIDP RTLEITLKRS
VNPTWFELNG LSQLFPVPAW RWKHYSIDTL SKLQDNPAMV SVVDGPYNLQ RFVPGRSISF
IRNPHYSGNP SALEHLHFKM YTSDSSAFWA LKTGTIQAGM IPHYLYAARS MVKNLKTCVS
NGGYGFNYVT LNLTNPQVAF FRNVKVRQAL ALAINQTQII QIAFHGLGVP SFNPVPTNPD
TYLSPEMKKL VARPALAYNP SAAKQLLAEA GWQPGLDGVR MRNGQRLQFT MMVPDTSQTL
IAVAEMLKAD WQAVGIDMRL RVLPFNLELA KLHPHGKWDA SMIVWSYDPD YYPSGDGLFN
TGGGSNYGDY SNSMMDKLVR DSTEKNSTKF LYQYENYAYA QQPVIFLPYP KYVVKYTQDL
THAQLMEGVY SVDCHPQRLH