Gene Acry_0459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_0459 
Symbol 
ID5160686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp511322 
End bp512632 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content62% 
IMG OID640552375 
Productextracellular solute-binding protein 
Protein accessionYP_001233602 
Protein GI148259475 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.420805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACGAC AGATCAGGAT TGCGGTGGGC GGACTCGCCC TCGGTTGCCT CGGCATTACC 
GCCGCCCATG CGACGACGCT GACGATCGCG ACGGTGAACA ACCCCGACAT GCTGCAGATG
CAGAAGCTGT CGCCGCAGTT CACCAAGGAA ACCGGCATCA AGCTGAACTG GGTGGTGCTG
CCTGAAAACA CCCTGCGCCA GCGCGTCACC ACGGACATCG CAACGAATTC CGGCAATTTC
GACATCGTCA CCGTCGGCTC CTACGAAGTG CCGATCTGGG GCAAGGCCGG CTGGCTCGCG
CCGATCAAGA ACCTGCCGGC GAGCTACGAC GTGAAGGACC TGTTTCCCTC GGTACGCAAC
GGGCTGTCCT ACAAGGGCAC ACTCTACGCT CTCCCGTTCT ACGCCGAAAG CTCGGCGACC
TATTACCGCA AAGATCTCTT CAAGGCTGCC GGCCTGACCA TGCCGGCGCA TCCGACCTAT
ACCGAAATCG AGAAGTTTGC CGCCAAGATC AACGATCCTT CGAAGGGCAT CTACGGCATC
TGTCTGCGCG GCCTGCCTGG ATGGGGCGAG AACATGGCCT ATTTCACCAC GCTGGTGAAC
ACGTTCGGCG GCCGCTGGTT CAACATGAAG TGGCAGCCCC GGATCGACAG CCCGGCCTGG
AAGAAAGCTG CCAACTTCTA CGTGAATCTC GAGAAGAAGT ACGGCCCCCC CAACGTTACC
TCGAACGGCT TCACCGAGAA TCTCGCGCTG TTCAGCCAGG GCAAGTGCGG GATGTGGATC
GACAGCACGG TCGCCGCCGG CACGCTTTGG GATCCAAAGA CCTCGAAGGT CGCGAACGAG
GTCGGCATGG TATCCGCCCC GGTGGCGGTC ACGCCTCATG GCGCGCACTG GCTCTGGGCC
TGGGCGCTCG CGATGCCGAA GACCACGCGC CACAAGGCGG ACGACATGAA GTTCCTCGAA
TGGGCGACGT CGAAGGCCTA TCTCAAGCTC GTCGGCAAGA CCTTCGGCTG GGTCCAGGTC
CCGCCCGGCA CCCGCATCTC GACCTATGAC AACCCCGACT ACACCAAGGC CGCTCCCTTC
GCCTCGAAAG TGAAGGAAGC GATCCTGAGC GCCGATCCGA ATAATCCGAC GCTCAAGAAA
GTCCCCTATA CCGGCGTGCA GTTCGTCGCG ATCCCTCAGT TTGAGGGCAT CGGTACCGAA
GTTGGCCAGC AGATCGCGGC AGCTCTCGCC GGTCAGAAAT CGGTCGACGC GGCGCTCGCC
CAGGCCCAGA AGGCAACGGC GCGGACGATG AAGGAAGCCG GTTACCACTG A
 
Protein sequence
MKRQIRIAVG GLALGCLGIT AAHATTLTIA TVNNPDMLQM QKLSPQFTKE TGIKLNWVVL 
PENTLRQRVT TDIATNSGNF DIVTVGSYEV PIWGKAGWLA PIKNLPASYD VKDLFPSVRN
GLSYKGTLYA LPFYAESSAT YYRKDLFKAA GLTMPAHPTY TEIEKFAAKI NDPSKGIYGI
CLRGLPGWGE NMAYFTTLVN TFGGRWFNMK WQPRIDSPAW KKAANFYVNL EKKYGPPNVT
SNGFTENLAL FSQGKCGMWI DSTVAAGTLW DPKTSKVANE VGMVSAPVAV TPHGAHWLWA
WALAMPKTTR HKADDMKFLE WATSKAYLKL VGKTFGWVQV PPGTRISTYD NPDYTKAAPF
ASKVKEAILS ADPNNPTLKK VPYTGVQFVA IPQFEGIGTE VGQQIAAALA GQKSVDAALA
QAQKATARTM KEAGYH