Gene Afer_1331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAfer_1331 
Symbol 
ID8323410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidimicrobium ferrooxidans DSM 10331 
KingdomBacteria 
Replicon accessionNC_013124 
Strand
Start bp1385356 
End bp1386750 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content65% 
IMG OID644952461 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003109930 
Protein GI256372106 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCACG GAACCAACCG ACACGAGTTC GCGAGGGCGA CCGAGGGCCA GCAGACGACA 
CGGAGTCGGC GGCGACTGCG GCGCTCCGGT GCCGTCGGTG TCGTTCTCGC AGGGTCGATG
GCGCTGCTCG CTGCCTGCGG GTCGTCGTCG ACGACCTCGT CGACCTCGTC GACGACGTCT
GGCGTCAATC TATCCAGTGC GCACGGCACG ATCACCTGGT GGGCGAGCCC CATTGCGCAG
GTAGGAATCC GCGCCGCGCT GATCCGCGCC TTCGAGAAGG CCTATCCCAA GATTCACGTC
AACTTGGTGA GTGCACCCAC CAACACCGAC ACCAACCGTG CGGATCTCGT CAACACCATT
TCCGGTGGTG CGTCGACGCC CGACGTGTTC ATGGGTGACG TCATCTGGCC GGCCGAGTTC
GGGGCGCATG GCTTCGCCGT CCCACTGTCG AACTATTTGC CGTCGAGCTA CTGGGCGAAG
TTCGCCCCCG GTCTCGTGCA AGGCGCGACC TACAAGGGCA AGGTGTACGG ATCGCCACTG
TTCGAGGACG AGGGCTTCCT CTACTACCGC AAGGACCTGC TCGCCAAGTA TCATCTCCCG
GTCCCGAAGA CCTGGGAGCA GCTCGAGAGC GATGCTGTGA CGCTGGTCCA TGACAACGCG
GTCAAGTACG GCTTCGTCTG GGAGGGCAAC AGCTACGAGG GGTTGACCTG TGACTTCATG
GAGTACCTCA CGAGCGCTGG CGGCAGCGTC GTGAACAGCT CCTATACGAA GGCCACCATC
GACTCGCCGG CGGCGTTGCG CGCGCTCACG TTCATGCGCA GCTTGATCAC CTCGGGCGCG
TCGCCTGCGG CCGTCACGAC CTTCGAAGAG CCTCAGGCGA TGGCGGTCTT CGACTCGGGT
CAGGCGGCCT TCCTGCGCAA CTGGGACTAT GCGTGGGCCA ACTCCCAGAC CCCATCGGAC
TCGAAGGTCG TCGGTGATGT GGGTGTCGCC CCGTTGCCGA CCTTCCAGGG CGAGTCCTAT
CCCGGGTACT CGAACATCGG CGGCTGGAAC CTCTACATCA ACCCGCACTC GAAGAACCTC
GCCGCCGACC TGGTCTTCAT CAAGTGGATG TCCTCGACAC AGGCTCAGGA CATCCTCGGC
AAGACCTACT CGGAGATCCC GACGGTCGAG TCGGTGCGGC TCGCGCTCGC GAAGGACCCG
TCGATCGCAC CCCCGATCAA GGTGGCTGCC GAGACGCGGC TCGTGCCGCG GCCCGCGGGC
ACGCCGAACT ACCCGCAGGT GTCCTCGGCG ATTTACACCA ACGTGAACGC GGCCCTGGCG
GGGTCGATGA GCCCGAGCGC TGCCTTGAAG ACTGCGGCGT CGCAGATCAA CACGGCGCTC
GCTGGCGGCA TCTAG
 
Protein sequence
MSHGTNRHEF ARATEGQQTT RSRRRLRRSG AVGVVLAGSM ALLAACGSSS TTSSTSSTTS 
GVNLSSAHGT ITWWASPIAQ VGIRAALIRA FEKAYPKIHV NLVSAPTNTD TNRADLVNTI
SGGASTPDVF MGDVIWPAEF GAHGFAVPLS NYLPSSYWAK FAPGLVQGAT YKGKVYGSPL
FEDEGFLYYR KDLLAKYHLP VPKTWEQLES DAVTLVHDNA VKYGFVWEGN SYEGLTCDFM
EYLTSAGGSV VNSSYTKATI DSPAALRALT FMRSLITSGA SPAAVTTFEE PQAMAVFDSG
QAAFLRNWDY AWANSQTPSD SKVVGDVGVA PLPTFQGESY PGYSNIGGWN LYINPHSKNL
AADLVFIKWM SSTQAQDILG KTYSEIPTVE SVRLALAKDP SIAPPIKVAA ETRLVPRPAG
TPNYPQVSSA IYTNVNAALA GSMSPSAALK TAASQINTAL AGGI