Gene Aaci_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAaci_2049 
Symbol 
ID8425571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlicyclobacillus acidocaldarius subsp. acidocaldarius DSM 446 
KingdomBacteria 
Replicon accessionNC_013205 
Strand
Start bp2098286 
End bp2099602 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content57% 
IMG OID645028166 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003185450 
Protein GI258512016 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.266449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACACA AAAACAAGGC CCTCATGGCG ACAGCCACAG CGGTGGTGAT GTGCGCGTGG 
ATGGCCACTG GATGCGGAGA CGAGGAGACC GGCGGCGGGG GCGGAATCCC CATGACCAAG
TCCTCCAACG CATCGGGACA AGCCGCGAGC ACCGCGCAAC CTGTGACCAT CACCTTTGAG
GAATCCATGC CTGGCAAGCT CGGGACCGAA CTCCAAAAGC TGACGAACGA GTTTGAGAAG
CAAAATCCCA ACATCCACGT GCAGCTCATC TTCAATGGCT CGTACAGCAC GTTGGAGCAG
AAGTTGACCG CCGCCATTGC GTCCGGCACA GAGCCCACGG TGGCGCAGGT CGAGGAGACG
TGGGAGACGA ATTACGTGCA GAACGGTCTG ATTGAGCCCC TCGACTCCGT CATTCCGAAA
TCCACGCAGA ACGATCTAAT CCCGATTTGG CGTCAGGACT CGACGTACAA CGGCAAGTTG
ATGTCCGTTC CCTTCAACAA GTCCGCCTAC GTCCTGTACT ACAATGTGGA CGATTTAAAG
AAGGCCGGGA TTTCTTCGCC GCCGACAACC TGGAGCCAGC TCGAACAGGA TGCCATCAAG
ATTCAAAAGA AAGAAGGCAT TCCGGGCCTC GGCCTGCAGG GCAACTATTA CACCTTTGAA
ATGCTCTTAA AACAAGCGGG TGGACAGATC CTGAACGCCA GAAACACCAA GGCGGCCTTC
GATAGCTCAG CCGGTCTCGC AGCGCTCCAT TTCATGAAGC GCTTGGTGGA TGAACACGCG
GCCAAGGTCA TCGGCGCGAA CGAATACCTC TCCGACGGTT TCAACACGAA CGAGTACGCG
ATGGACCTCG ACACGGTCGC CGCGATGTCG TTCATCAACA ACTCGAACCT CCATTGGAAA
GTTGCGCCGC TCCCAAAGGG CGTGACCTAC GCCGTCCCCA CGGCCGGTTT GAACTTGGTC
ATTTTCAACG CAGCGACATC CGCTCAGAAA GCGGCGGCGG CGAAGTATCT GAACTTCCTC
ATTTCTGTAC CATCGACCAT CGAGTGGGCG GAACAGACAG GCTACCTCCC CGTCCGCCAA
AGCGCACTCA CCAATTCGGC CTGGACGAGC TTCATCAAGA CCCATCCGAA TCAGGGGGTC
GCACCCAACG AGCTGAAATA CGCCTATTTC TCCCCTCGGC TCGCTTCGCT CTACTCCGCG
GAGCAAGAGA TGACCACACA GATTGGCAAC ATGCTGGCGG GTCGGCAGAC GCCGCAGGTG
ACCCTGCAGA ACATGGCGAA CATCACGAAC CAAGCGCTCG CCCAGGGCAA TTCATGA
 
Protein sequence
MKHKNKALMA TATAVVMCAW MATGCGDEET GGGGGIPMTK SSNASGQAAS TAQPVTITFE 
ESMPGKLGTE LQKLTNEFEK QNPNIHVQLI FNGSYSTLEQ KLTAAIASGT EPTVAQVEET
WETNYVQNGL IEPLDSVIPK STQNDLIPIW RQDSTYNGKL MSVPFNKSAY VLYYNVDDLK
KAGISSPPTT WSQLEQDAIK IQKKEGIPGL GLQGNYYTFE MLLKQAGGQI LNARNTKAAF
DSSAGLAALH FMKRLVDEHA AKVIGANEYL SDGFNTNEYA MDLDTVAAMS FINNSNLHWK
VAPLPKGVTY AVPTAGLNLV IFNAATSAQK AAAAKYLNFL ISVPSTIEWA EQTGYLPVRQ
SALTNSAWTS FIKTHPNQGV APNELKYAYF SPRLASLYSA EQEMTTQIGN MLAGRQTPQV
TLQNMANITN QALAQGNS