Gene Acel_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2049 
Symbol 
ID4484728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2320454 
End bp2321761 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content64% 
IMG OID639730845 
Productextracellular solute-binding protein 
Protein accessionYP_873807 
Protein GI117929256 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGCCT GCAGCTCCGG CAAGAGCACG GGAGGCTCCA GCGGAACGTC TCCCAGCGCC 
GGCGGGTCAA GCAGCACGAA CGCCAGCGCG TCGAGCAGCG GCGGGGCGTC ATCGTCCTGT
CCGTTCCCGA CCGACTCCGT CACGCTGACG TGGTGGCACA ACGCCACAGC CGATCCGGGG
AAGGCGGCGT GGCAGAAGAC GGCCGACGAT TTCCACGCAC AGCACCCGAA CGTGAGCTTC
AACATTGTGC CCATTCAAAA TGAGCAATTC ACGACAAAGG TGCCGGCCGC ACTCGAGTCC
AACAATCCGC CGTCTCTGTA CCAGCAGTGG GGCGGGGGAT CCGAAGCGAC GCAGGTCAAG
TCCGGCAAAC TGATGGACAT GACGGCCTGT GTCTCGAGCT GGGTCGACCG GCTCGGCCCG
TCGGCCAAGG GTTGGCAGGT CGACGGCAAG TGGTACGGAA TTCCGTACGA CTACCACATC
GTCGGCTTCT GGTACCGCAC GGACCTGTTC CAAAAGGCGG GCATCACCTC GCCGCCGAAG
ACGATGGACG AGCTGTACCA AGACATCGAC AAGCTGAAGG CGGCAGGGAT CACCCCGATC
GCGCTGGGCG GCAAGGACCG TTGGCCGGAC GCCTTCTACT GGGAGTACTT CGTGCTTCGG
GAATGCCCGA AGGACACGGT GACGTCGTCC ATCGCCAACG TCAAATTCTC TGACCCCTGC
TTCGTTAAGG CCGGTCAGGA CATGAAGAAG TTCCTTGACG CCAAGCCGTT CCAGACCGGA
TTCCTTGGCA CGCCCGCACA ACAAGGCGCC GGCAGCTCGG CCGGCTTGGT GGCTAACGGC
AAGGCGGCAA TGGAGCTGCA GGGTGACTGG GAAATCCTGG TCATGCCGTC GCTCACCCAG
GACAAGAACT TCGCGTCGAA ACTCGGCTGG TTCCCCTTCC CGTCGGTGTC CGGCGGTGCG
GGTGACCAGA ACGCCGGACT CGGCGGTGGC GACGGTTTCA GCTGCACCTA CAAGGCCACC
AACGCCTGCC CGGCGTTCCT GGAGTACATC ACCAGCGCCG ATGTCCAGCG CTACCTGGTG
AAGCAGAGCG CCGTCAGCCT GCCGTCCAAC AGCGAGGCAA GCGACGCCAT CACCGACCCC
ACGCTGAAGA CGGTCCTTCA GTACATCGGA ACGGTGTCGT ACAACCAGCT GTACTTCGAC
CAGGCGCTGC CGACCGATGC CGGACAGGCG CTTGACTCGG CGGTCGCCGA CTTCTTCGCG
GGTTCCGGCA GTCCGGAGAG CCTGGCGGCG TCGGTGTCGT CGAAGTAA
 
Protein sequence
MAACSSGKST GGSSGTSPSA GGSSSTNASA SSSGGASSSC PFPTDSVTLT WWHNATADPG 
KAAWQKTADD FHAQHPNVSF NIVPIQNEQF TTKVPAALES NNPPSLYQQW GGGSEATQVK
SGKLMDMTAC VSSWVDRLGP SAKGWQVDGK WYGIPYDYHI VGFWYRTDLF QKAGITSPPK
TMDELYQDID KLKAAGITPI ALGGKDRWPD AFYWEYFVLR ECPKDTVTSS IANVKFSDPC
FVKAGQDMKK FLDAKPFQTG FLGTPAQQGA GSSAGLVANG KAAMELQGDW EILVMPSLTQ
DKNFASKLGW FPFPSVSGGA GDQNAGLGGG DGFSCTYKAT NACPAFLEYI TSADVQRYLV
KQSAVSLPSN SEASDAITDP TLKTVLQYIG TVSYNQLYFD QALPTDAGQA LDSAVADFFA
GSGSPESLAA SVSSK