Gene Acel_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1049 
Symbol 
ID4484831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1155376 
End bp1156398 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content63% 
IMG OID639729824 
Productinner-membrane translocator 
Protein accessionYP_872808 
Protein GI117928257 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.186781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0406542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAGC ACTCGTCCGT GGCGGATTCC GTCAAGAGCC AGGACCGCGC GACTCCGGCG 
GTCCTGGCGG TCATCAGCGC GCTCCTGAGG CAACCGGCGC GGCTATGGCG CTCCGCCGCC
ATTCTCATGC CCTTTCTGGC CCTGTTCATC GCCCTGGCCG TGGCGAGTCC ACCCTTCCTG
TCAAAGACCA ATCTGCTCAA CATTCTCGAT CAACAGGCGG CGACACTCAT CATCGCCGCA
GCCGGCACTC TCGTCCTCAT CAGCGGCGGC CTCGACCTCT CCGTCGGTGC GACGTATGCG
CTCGCCGGGG TCGCGTCGGC GATGGTCGCG CAGCACCATC CGGCTCTCCT CGGCGCGCTG
ATCGCCGTTG CGATCGGAAT AGGCATCGGC GTCGTCAACG GAATCGTGAG CACGCTCTTC
CGCATCAACT CGCTCATCGC CACACTGGCG ATGTCCTTCA TCGTGAGCGG CCTGGCAGCT
CGCATTTCCA ACGGAAATCT CATCGTCCTC ACCAGCAATC GTGAATACGC ACGTATCGCG
CAGACCGAAT TCCTCACCAT TCGCACCTCG ATTTGGACAA TGCTGGTGGT CGTGCTGGCA
CTCGGGTTTC TCCTTGCCCG CACGACGTTC GGCCGATATC TATACGCCGT CGGCGGCAAC
ATCGAGGCTG CCCGGCTCGC CGGCGTACGG ATCAACGCCA TCCGAGTTGT CGCCTTCACC
CTCAGCGGCT TTGCCGCCGC CCTGGGCGGC GTCATCGACA CATCGCGCGT GCTCAGCGCC
CAAGCCAACA ACGGCAGCAC ACTCGCCTTC ACCGTGCTCG CAGGAATCGT CGTCGGGGGA
ACGTCAATCC TCGGCGGCGA AGGAGCGATC TGGCGCACGG TCGTCGGGGT ACTCTTCATC
GCCCTGATCG GAAACGGTTT CGATCTGCTC GGGCTGAACC CGCTCTACCA GCAGATGACC
CTCGGCGCCA TTCTTCTGCT TGCCGTCGGT ATCGACGCGT GGTATCGGTT ACGCCAGCCG
TAA
 
Protein sequence
MMEHSSVADS VKSQDRATPA VLAVISALLR QPARLWRSAA ILMPFLALFI ALAVASPPFL 
SKTNLLNILD QQAATLIIAA AGTLVLISGG LDLSVGATYA LAGVASAMVA QHHPALLGAL
IAVAIGIGIG VVNGIVSTLF RINSLIATLA MSFIVSGLAA RISNGNLIVL TSNREYARIA
QTEFLTIRTS IWTMLVVVLA LGFLLARTTF GRYLYAVGGN IEAARLAGVR INAIRVVAFT
LSGFAAALGG VIDTSRVLSA QANNGSTLAF TVLAGIVVGG TSILGGEGAI WRTVVGVLFI
ALIGNGFDLL GLNPLYQQMT LGAILLLAVG IDAWYRLRQP