Gene Acel_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1050 
Symbol 
ID4484832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1156395 
End bp1157927 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content64% 
IMG OID639729825 
ProductABC transporter related 
Protein accessionYP_872809 
Protein GI117928258 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.322688 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0473655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCCG ACGCCCACGG CGAAGGAGGT TTCTCTTCCA TCCAGCTCGT CGGCGTCACC 
AAACGCTTCG GTGGTACCCG TGCGCTCCGC GAGGTCAATC TCACGGTGCG CGCGGGTACC
GTCCACGCGC TCATCGGTGA GAACGGCGCA GGAAAATCGA CGATAGCGAA GATTATCGGC
GGCGTCCTGG CGCCCGACAC GGGTGAACTC CGCATTGACG GTACGCCTGT GACCTTCCGC
TCACCTCGCG ATGCCCGTGC GCACGGCATT GCCACCATCG CCCAGGAACT CACCATCGTG
CCCAGTCTCT CCGTAGCCGC CAACGTCTTT CTCGGCACCG AGCCTCGGAC GACAGGAATA
ATCCGCAGCC GCCGGCTGCG CGCACAATTC GAAGAGTTGG CGGGCGAAGT GGGATTTGAG
CTTCCACCGG ACGTCCCTGC AGGCGCGCTG CGTCCAGCCG ATCAACAAAA AATAGAGATT
CTGCGCGCCC TGCGCAGCGG TGCTTCGCTG ATCGTCATGG ACGAGCCGAC CGCGACTCTG
CCGGACAACG AAGCCGCCCG CTTGCACGAC GTCATCAAGT CCCTCGCCCG CTCCGGGAAG
GCCGTGCTTC TGATCTCACA CTTCCTCTCG GAGGTTCTCG CACTGTCGGA CGTCATCACG
ATTCTGCGCG ACGGCCTCGT CGTTCGCACC TCGCCGGCGG GGAGGGAGAC ACGTGAATCC
CTGATCGAGG GCATGCTCGG CCGCACACTC ACGTCTGTCT TCCCGGAGAA ACGCCTTCCG
GCCGCCTCAG CCGCGACCAT TCTGGAAGCA CGACAGGTGA CGGCTCCCGG TGTCGCAGGA
GTGTCGTTCT CCCTCCGCCG CGGCGAGATC CTTGGGCTGG CGGGACTCGT CGGAGCCGGC
CGGAGTGAAA TCGCCCACGC GATTTACGGT TCGCGCCGAC CGCATGCGGG CGAACTCGTC
GTGCGCGGCC GCTCCCAGCG ATTCGCGACA CCCAGCGCAG CCTTACGGTC AGGAATCGCT
CTGATCCCCG AATCCCGCAA GGACCAAGGA CTGCTGTTGC GCCGATCCAT CCGCGAAAAC
GTCACCCTCA ACATCCTCCG CACCGTCAGC CGCGGCGGTT GGATCAACAA GCGACGTGAG
GCACGCGTCG TCCGCCACAT TCTGGACGCG CTCAACGTTC GCGCGGCCAA CATTGACGCA
CCCGTCGCCA CGCTCTCCGG CGGAAACCAA CAGAAGGTCC TCTTCGCCCG GGCCTTGTTA
AGCCAGCCGG CGGTGCTCAT TGCCGATGAA CCGACTCGAG GCGTCGATGT CGGTGCACGT
CACGCCATCT ACGAACTTCT TGCCGGTGAG GCCGACAGCG GGACGGCGAT TGTCGTCATT
TCATCGGACG TCGAAGAAAT TCTCGGTCTC TCGCATCGCG CCCTCGTCAT TCGAGCCGGA
AGAATCGCCG CGGAGCTCAC GGCCGATCAG CTCAACGAGC ATAACGTGAT CACTGCGGCG
TTCGCCGAGC ACTCTGGGAC GCGGGAGGAA TGA
 
Protein sequence
MKSDAHGEGG FSSIQLVGVT KRFGGTRALR EVNLTVRAGT VHALIGENGA GKSTIAKIIG 
GVLAPDTGEL RIDGTPVTFR SPRDARAHGI ATIAQELTIV PSLSVAANVF LGTEPRTTGI
IRSRRLRAQF EELAGEVGFE LPPDVPAGAL RPADQQKIEI LRALRSGASL IVMDEPTATL
PDNEAARLHD VIKSLARSGK AVLLISHFLS EVLALSDVIT ILRDGLVVRT SPAGRETRES
LIEGMLGRTL TSVFPEKRLP AASAATILEA RQVTAPGVAG VSFSLRRGEI LGLAGLVGAG
RSEIAHAIYG SRRPHAGELV VRGRSQRFAT PSAALRSGIA LIPESRKDQG LLLRRSIREN
VTLNILRTVS RGGWINKRRE ARVVRHILDA LNVRAANIDA PVATLSGGNQ QKVLFARALL
SQPAVLIADE PTRGVDVGAR HAIYELLAGE ADSGTAIVVI SSDVEEILGL SHRALVIRAG
RIAAELTADQ LNEHNVITAA FAEHSGTREE