Gene Afer_1141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_1141 
Symbol 
ID8323211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp1186377 
End bp1188215 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content64% 
IMG OID644952269 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003109747 
Protein GI256371923 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.121856 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGA TCACAGCACG CCACCGCCTC ATCGCGGGAC TTGGCACCAC CGCTGCCGCA 
GCCATGCTGC TCGCGGCGTG TGGCTCCTCG TCGAGCTCGA CCTCGTCCTC AAGCACGACC
AAGGTCTCGG GCGGTACGGC CTACTTCGCC GAGGCCGCCG GCGCAAACCC GAACTACATC
TTCCCCTTGA CCGGGGGCCC GTACTTCAGC GTCGACAACC TCGCCCAGTT CCAGATCCTG
ATGTACCGAC CGCTCTACTT CTTCGGGGTC GGCTCGACGC CAGAGATCAA CTACCAGCTC
TCCATCGGCA ACGCGCCCGT CTACTCCAAC AACGACACCA CCGTCACGAT CACCCTGAAG
CACTACATGT GGTCAGACGG CGAGCCGGTG ACCTCGCGCG ACGTCATCTT CTGGATGAAC
CTGCTCAAGG CGAACAAGGC CAACTGGGCC GCCTACGTGC CTGGCGCCTT CCCTGACAAC
GTCGCGTCGT ACTCGGCTCC GAATGCATCG ACCGTCGTCT TCCACCTCAC CGGCAAGGTC
AACCCGACGT GGTTCACCTA CAACGAACTG AGCCAGATCA CGCCGCTCCC CATCGCCTGG
GACCGCACCT CGCTCAGCCA GCCGGCACCC AGCCCGTCGG CCGCCAACTT GCCCGATACG
ACGACCTCGG GGGCCAAGGC GGTCTACACC TTCTTGAACT CTCAGGCCAC GAACGCCAAC
GCCTGGGCCT CGAGCCCGAT CTGGTCGGTG GTGGACGGCC CGTGGAAGCT GAAGAGCTCG
TCGACGACCG GGCGCATGGT GTTCGTGCCG AACCCGTCCT ACACCGGGCC GGTCAAGCCG
TCGCTGTCGG AGTTCGTGGA GCTGCCCTTC ACCTCAGACA CCGCTGAGTT CAACACGCTG
CGCGCCGGCA ACGGGGCGAT CACCTATGGT TACGTGCCGA CGGTGGACCT CAGCCAGGTG
CCTTACCTGA AGAGCATCGG CTATCGCATC GAGCCCTGGA TCGACTTCGG CTTCAACTAC
TTCGTCGAGA ACTTCAACAA CCCGACCTAC GGACCGCTGT TCAAGCAGAC GTACTTCCGT
CAGGCCTTCC AGCACCTCGT CGACCAGCCG CAGTGGATCT CGACCTTCCT GAAGGGCTAC
GGCGTACCGA GCTACTCGCC GGTGCCGCTC GCGCCGGCGA ATCCCTTCGC GGACGCCACC
TCGAAGACCA ACCCGTTCCC CTACTCGATC TCGGCAGCGA AGTCGCTCCT CACCAGCCAC
GGCTGGAAGA TGGTGAACGG CGTCATGACG TGCGAGAGCC CGGGCACGGC CGCCACCGAC
TGCGGTGCAG GCATCGCGAA GGGCTTCACG CTCACGCTGA ACCTGCAGTA CGCCTCGGGT
ACCACGTTCA TCACCCAGGA GATGGACGCG CTCCAGTCGG CAGCAGCGAG CGCGGGCATC
AAGATCAACC TCTCGCAGGC ACCCTTCAAC ACGGTCATCC ACAACGCCAC TCAGTGCTCC
GGTTCGAGCT GCACCTGGCA GATGGAGAAC TGGGGTGGCG GCTGGGAGTA CTCCCCGGAC
AACTACCCGA CCGGCGGTGA GATCTTCGGC ACCGGTGCTG GGTCGAACTT CGGCAACTGG
AACGATCCGA CCACCAACTC GCTGATCGCC GAGACCCACA CCTCGTCGAA TGCGCAGGCG
GCGCTCGATG CCTACCAGGA CTACATGGCC AAGGTCCTGC CGGTGGTCTA TCAGCCCGCC
GCCGATTACG CCATCTCCGC GATCTCGAAC AAGCTCCAGG GGGTCACTCA AAACCCGTAC
CTGAACTTGA CGCCCGAGAC GTGGTACTTC GTCAAGTAG
 
Protein sequence
MKKITARHRL IAGLGTTAAA AMLLAACGSS SSSTSSSSTT KVSGGTAYFA EAAGANPNYI 
FPLTGGPYFS VDNLAQFQIL MYRPLYFFGV GSTPEINYQL SIGNAPVYSN NDTTVTITLK
HYMWSDGEPV TSRDVIFWMN LLKANKANWA AYVPGAFPDN VASYSAPNAS TVVFHLTGKV
NPTWFTYNEL SQITPLPIAW DRTSLSQPAP SPSAANLPDT TTSGAKAVYT FLNSQATNAN
AWASSPIWSV VDGPWKLKSS STTGRMVFVP NPSYTGPVKP SLSEFVELPF TSDTAEFNTL
RAGNGAITYG YVPTVDLSQV PYLKSIGYRI EPWIDFGFNY FVENFNNPTY GPLFKQTYFR
QAFQHLVDQP QWISTFLKGY GVPSYSPVPL APANPFADAT SKTNPFPYSI SAAKSLLTSH
GWKMVNGVMT CESPGTAATD CGAGIAKGFT LTLNLQYASG TTFITQEMDA LQSAAASAGI
KINLSQAPFN TVIHNATQCS GSSCTWQMEN WGGGWEYSPD NYPTGGEIFG TGAGSNFGNW
NDPTTNSLIA ETHTSSNAQA ALDAYQDYMA KVLPVVYQPA ADYAISAISN KLQGVTQNPY
LNLTPETWYF VK