Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_1331 |
Symbol | |
ID | 8323410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | - |
Start bp | 1385356 |
End bp | 1386750 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644952461 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003109930 |
Protein GI | 256372106 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCACG GAACCAACCG ACACGAGTTC GCGAGGGCGA CCGAGGGCCA GCAGACGACA CGGAGTCGGC GGCGACTGCG GCGCTCCGGT GCCGTCGGTG TCGTTCTCGC AGGGTCGATG GCGCTGCTCG CTGCCTGCGG GTCGTCGTCG ACGACCTCGT CGACCTCGTC GACGACGTCT GGCGTCAATC TATCCAGTGC GCACGGCACG ATCACCTGGT GGGCGAGCCC CATTGCGCAG GTAGGAATCC GCGCCGCGCT GATCCGCGCC TTCGAGAAGG CCTATCCCAA GATTCACGTC AACTTGGTGA GTGCACCCAC CAACACCGAC ACCAACCGTG CGGATCTCGT CAACACCATT TCCGGTGGTG CGTCGACGCC CGACGTGTTC ATGGGTGACG TCATCTGGCC GGCCGAGTTC GGGGCGCATG GCTTCGCCGT CCCACTGTCG AACTATTTGC CGTCGAGCTA CTGGGCGAAG TTCGCCCCCG GTCTCGTGCA AGGCGCGACC TACAAGGGCA AGGTGTACGG ATCGCCACTG TTCGAGGACG AGGGCTTCCT CTACTACCGC AAGGACCTGC TCGCCAAGTA TCATCTCCCG GTCCCGAAGA CCTGGGAGCA GCTCGAGAGC GATGCTGTGA CGCTGGTCCA TGACAACGCG GTCAAGTACG GCTTCGTCTG GGAGGGCAAC AGCTACGAGG GGTTGACCTG TGACTTCATG GAGTACCTCA CGAGCGCTGG CGGCAGCGTC GTGAACAGCT CCTATACGAA GGCCACCATC GACTCGCCGG CGGCGTTGCG CGCGCTCACG TTCATGCGCA GCTTGATCAC CTCGGGCGCG TCGCCTGCGG CCGTCACGAC CTTCGAAGAG CCTCAGGCGA TGGCGGTCTT CGACTCGGGT CAGGCGGCCT TCCTGCGCAA CTGGGACTAT GCGTGGGCCA ACTCCCAGAC CCCATCGGAC TCGAAGGTCG TCGGTGATGT GGGTGTCGCC CCGTTGCCGA CCTTCCAGGG CGAGTCCTAT CCCGGGTACT CGAACATCGG CGGCTGGAAC CTCTACATCA ACCCGCACTC GAAGAACCTC GCCGCCGACC TGGTCTTCAT CAAGTGGATG TCCTCGACAC AGGCTCAGGA CATCCTCGGC AAGACCTACT CGGAGATCCC GACGGTCGAG TCGGTGCGGC TCGCGCTCGC GAAGGACCCG TCGATCGCAC CCCCGATCAA GGTGGCTGCC GAGACGCGGC TCGTGCCGCG GCCCGCGGGC ACGCCGAACT ACCCGCAGGT GTCCTCGGCG ATTTACACCA ACGTGAACGC GGCCCTGGCG GGGTCGATGA GCCCGAGCGC TGCCTTGAAG ACTGCGGCGT CGCAGATCAA CACGGCGCTC GCTGGCGGCA TCTAG
|
Protein sequence | MSHGTNRHEF ARATEGQQTT RSRRRLRRSG AVGVVLAGSM ALLAACGSSS TTSSTSSTTS GVNLSSAHGT ITWWASPIAQ VGIRAALIRA FEKAYPKIHV NLVSAPTNTD TNRADLVNTI SGGASTPDVF MGDVIWPAEF GAHGFAVPLS NYLPSSYWAK FAPGLVQGAT YKGKVYGSPL FEDEGFLYYR KDLLAKYHLP VPKTWEQLES DAVTLVHDNA VKYGFVWEGN SYEGLTCDFM EYLTSAGGSV VNSSYTKATI DSPAALRALT FMRSLITSGA SPAAVTTFEE PQAMAVFDSG QAAFLRNWDY AWANSQTPSD SKVVGDVGVA PLPTFQGESY PGYSNIGGWN LYINPHSKNL AADLVFIKWM SSTQAQDILG KTYSEIPTVE SVRLALAKDP SIAPPIKVAA ETRLVPRPAG TPNYPQVSSA IYTNVNAALA GSMSPSAALK TAASQINTAL AGGI
|
| |