Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_1141 |
Symbol | |
ID | 8323211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | + |
Start bp | 1186377 |
End bp | 1188215 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644952269 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003109747 |
Protein GI | 256371923 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.121856 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAGA TCACAGCACG CCACCGCCTC ATCGCGGGAC TTGGCACCAC CGCTGCCGCA GCCATGCTGC TCGCGGCGTG TGGCTCCTCG TCGAGCTCGA CCTCGTCCTC AAGCACGACC AAGGTCTCGG GCGGTACGGC CTACTTCGCC GAGGCCGCCG GCGCAAACCC GAACTACATC TTCCCCTTGA CCGGGGGCCC GTACTTCAGC GTCGACAACC TCGCCCAGTT CCAGATCCTG ATGTACCGAC CGCTCTACTT CTTCGGGGTC GGCTCGACGC CAGAGATCAA CTACCAGCTC TCCATCGGCA ACGCGCCCGT CTACTCCAAC AACGACACCA CCGTCACGAT CACCCTGAAG CACTACATGT GGTCAGACGG CGAGCCGGTG ACCTCGCGCG ACGTCATCTT CTGGATGAAC CTGCTCAAGG CGAACAAGGC CAACTGGGCC GCCTACGTGC CTGGCGCCTT CCCTGACAAC GTCGCGTCGT ACTCGGCTCC GAATGCATCG ACCGTCGTCT TCCACCTCAC CGGCAAGGTC AACCCGACGT GGTTCACCTA CAACGAACTG AGCCAGATCA CGCCGCTCCC CATCGCCTGG GACCGCACCT CGCTCAGCCA GCCGGCACCC AGCCCGTCGG CCGCCAACTT GCCCGATACG ACGACCTCGG GGGCCAAGGC GGTCTACACC TTCTTGAACT CTCAGGCCAC GAACGCCAAC GCCTGGGCCT CGAGCCCGAT CTGGTCGGTG GTGGACGGCC CGTGGAAGCT GAAGAGCTCG TCGACGACCG GGCGCATGGT GTTCGTGCCG AACCCGTCCT ACACCGGGCC GGTCAAGCCG TCGCTGTCGG AGTTCGTGGA GCTGCCCTTC ACCTCAGACA CCGCTGAGTT CAACACGCTG CGCGCCGGCA ACGGGGCGAT CACCTATGGT TACGTGCCGA CGGTGGACCT CAGCCAGGTG CCTTACCTGA AGAGCATCGG CTATCGCATC GAGCCCTGGA TCGACTTCGG CTTCAACTAC TTCGTCGAGA ACTTCAACAA CCCGACCTAC GGACCGCTGT TCAAGCAGAC GTACTTCCGT CAGGCCTTCC AGCACCTCGT CGACCAGCCG CAGTGGATCT CGACCTTCCT GAAGGGCTAC GGCGTACCGA GCTACTCGCC GGTGCCGCTC GCGCCGGCGA ATCCCTTCGC GGACGCCACC TCGAAGACCA ACCCGTTCCC CTACTCGATC TCGGCAGCGA AGTCGCTCCT CACCAGCCAC GGCTGGAAGA TGGTGAACGG CGTCATGACG TGCGAGAGCC CGGGCACGGC CGCCACCGAC TGCGGTGCAG GCATCGCGAA GGGCTTCACG CTCACGCTGA ACCTGCAGTA CGCCTCGGGT ACCACGTTCA TCACCCAGGA GATGGACGCG CTCCAGTCGG CAGCAGCGAG CGCGGGCATC AAGATCAACC TCTCGCAGGC ACCCTTCAAC ACGGTCATCC ACAACGCCAC TCAGTGCTCC GGTTCGAGCT GCACCTGGCA GATGGAGAAC TGGGGTGGCG GCTGGGAGTA CTCCCCGGAC AACTACCCGA CCGGCGGTGA GATCTTCGGC ACCGGTGCTG GGTCGAACTT CGGCAACTGG AACGATCCGA CCACCAACTC GCTGATCGCC GAGACCCACA CCTCGTCGAA TGCGCAGGCG GCGCTCGATG CCTACCAGGA CTACATGGCC AAGGTCCTGC CGGTGGTCTA TCAGCCCGCC GCCGATTACG CCATCTCCGC GATCTCGAAC AAGCTCCAGG GGGTCACTCA AAACCCGTAC CTGAACTTGA CGCCCGAGAC GTGGTACTTC GTCAAGTAG
|
Protein sequence | MKKITARHRL IAGLGTTAAA AMLLAACGSS SSSTSSSSTT KVSGGTAYFA EAAGANPNYI FPLTGGPYFS VDNLAQFQIL MYRPLYFFGV GSTPEINYQL SIGNAPVYSN NDTTVTITLK HYMWSDGEPV TSRDVIFWMN LLKANKANWA AYVPGAFPDN VASYSAPNAS TVVFHLTGKV NPTWFTYNEL SQITPLPIAW DRTSLSQPAP SPSAANLPDT TTSGAKAVYT FLNSQATNAN AWASSPIWSV VDGPWKLKSS STTGRMVFVP NPSYTGPVKP SLSEFVELPF TSDTAEFNTL RAGNGAITYG YVPTVDLSQV PYLKSIGYRI EPWIDFGFNY FVENFNNPTY GPLFKQTYFR QAFQHLVDQP QWISTFLKGY GVPSYSPVPL APANPFADAT SKTNPFPYSI SAAKSLLTSH GWKMVNGVMT CESPGTAATD CGAGIAKGFT LTLNLQYASG TTFITQEMDA LQSAAASAGI KINLSQAPFN TVIHNATQCS GSSCTWQMEN WGGGWEYSPD NYPTGGEIFG TGAGSNFGNW NDPTTNSLIA ETHTSSNAQA ALDAYQDYMA KVLPVVYQPA ADYAISAISN KLQGVTQNPY LNLTPETWYF VK
|
| |