Gene Acel_0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0421 
Symbol 
ID4485979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp441859 
End bp443133 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content65% 
IMG OID639729188 
ProductABC transporter related 
Protein accessionYP_872181 
Protein GI117927630 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1134] ABC-type polysaccharide/polyol phosphate transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAA CCGCTACGTC CGATCGTCCA CCGGCGGTCA TCGTCGACCA CGTCACGAAG 
GATTTCAAGC TGTACCGGGA ACGCCGGACC AGCCTCAAGG AGCGGCTGAC GACTCGGCGG
CCGAGCCGTT ACGAGTTGTA CCGCGCGCTT AACGACGTCA GTCTCGTCGT GCCTGCGGGA
TCGACGTACG GCCTGATCGG AGCCAACGGA TCCGGCAAGA GCACGTTGCT GAAGTTGATG
GCGAACATCC ACCGCCCGAC CAGCGGCACT ATCACGCACA ACGGCAGGAT CACCGCGCTG
CTGGAGCTGG GTGCGGGCTT TCACCCGGAA CTTTCCGGCC GCGACAACGT CTACCTCAAT
GGGTCGATTC TGGGACTGAG CCGGAAGCAG GTGGCCGCGG CGATGGATGA CATCATCGAG
TTCAGCGGCC TCCGTGACTT CATTGACGCA CCGGTCAAGG TCTATTCGTC CGGAATGTAC
GTGCGTCTCG GCTTCTCGGT CGCGGTCAAC CTCGACCCGG AAATTCTCAT CATCGATGAG
GTCATTGCGG TGGGTGACGA GGAGTTTCAG CGCCGTTGCT TCGACCACCT GGCGAAGCTG
CGCCGGCGCG GCGTCACCAT CATTTTCGTC TCGCACAGCA TGCCGCTCGT GCAGACGCTC
TGCGATCGAG TGGCCTGGAT CGACCGGGGA GTGCTGCGGG CCGAAGGGGA TCCGAGCGAG
GTCGTCGACG CGTACCTCGC GGTGGTGAAC GCCGCCGAGC GGGAACGTCT CGCCGCGCAG
GGGGAAGCCG CGGCGTCCGG TGAGGCCGTG CGGCGCGGCA GCCGGGAGAT TGTCGTGAGC
AACGTGAGTT TTCTCGACGG CAACGGGGAA CCGACGCTCG CCGCCGTCGC CGGCGAGCGG
CTCGTCGTCC GGGTGCACTA CGCCGCCGCA CAACCGGTCA CCGCACCGGT CTTTGGCCTT
GCCTTCCATA CCGAAGGCGG CGTGCTCGTC AGCAGCCCGA ATACCGAATA TGCCGGCGTG
CGATTCGGCA CCCTGCACGG TGAGGGTTAC GTCGAGTACG TGCTCGACCA CCTTCCCCTC
ACCCCGGGGA CGTATCTCGT CTCGGCATCC ATCACGGACA CGTCGTTGCT GCACGTGTAC
GACCAGCGCG ACCGGGCGTT CACCTTGACC GTGCAACCTG GGCATTCCCT CGATCGCGGC
GGGGTGGTGA CCCTCGGTGG ACGCTGGTCG GTTGTCGGAT CAACGGTGGG CGAGCGGGTG
GAGGCAATGT CGTGA
 
Protein sequence
MTQTATSDRP PAVIVDHVTK DFKLYRERRT SLKERLTTRR PSRYELYRAL NDVSLVVPAG 
STYGLIGANG SGKSTLLKLM ANIHRPTSGT ITHNGRITAL LELGAGFHPE LSGRDNVYLN
GSILGLSRKQ VAAAMDDIIE FSGLRDFIDA PVKVYSSGMY VRLGFSVAVN LDPEILIIDE
VIAVGDEEFQ RRCFDHLAKL RRRGVTIIFV SHSMPLVQTL CDRVAWIDRG VLRAEGDPSE
VVDAYLAVVN AAERERLAAQ GEAAASGEAV RRGSREIVVS NVSFLDGNGE PTLAAVAGER
LVVRVHYAAA QPVTAPVFGL AFHTEGGVLV SSPNTEYAGV RFGTLHGEGY VEYVLDHLPL
TPGTYLVSAS ITDTSLLHVY DQRDRAFTLT VQPGHSLDRG GVVTLGGRWS VVGSTVGERV
EAMS