Gene Acel_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0206 
Symbol 
ID4485292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp220075 
End bp221874 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content68% 
IMG OID639728969 
Producthypothetical protein 
Protein accessionYP_871966 
Protein GI117927415 
COG category[S] Function unknown 
COG ID[COG2898] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.914985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.140787 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGTG AGGCTCGGAC GACGACCACC CGCGACGCGG CGCGGCAACC GGTGAGGCGC 
CGGGATCGGC GTCGCCACCG TCCCTGGGTG CCTGCGCTCG CGGGCTGGGT GGCGGCCCTC
GTCGGGATTC GGAATTTCAT CGTCGTCCTG CACCCCCACT GGTGGGAGCG GGTTCGTCCC
GTCGGCAAGG TGCTGCCCGC ACCGGGGGAA ACACCGCTGC TCCACGGGCT GGCGGCCGCT
GAGCTGGTCG GTTCAGGCGC GCTCATGCTG CTCCTCGCGC ACGCGCTCAA ACGCCGCAAG
CGGCGGGCAT GGCAGGTCGT CGTCGGCGTT CTCGCCGTCG GGCTGGCCCT GCACGTGGTG
CATCACCCGC CGCTGCACGG CATCCCGGGA TGGCTGGTCA TCGACGGCGG TTTCCTCATC
GGCCTGCTGG CTTTCCGGAA GGAATTCTTC GCCGAACCGG ATCCCCGGAC CCGGTGGAGC
GCCTTGGTCA CCTTCGTCGG ACTTGGCGTC GCGAGTTACG GGTTCGGCCT GGTCCTGGTG
GGGCTGCGCA AGGACCAGCT GGTCGGCCAC CCGTCGTGGA CGGACATTCT CCTGCACGTC
GGGTACGGAC TCGTCGGCAT CCCAGGCCCG CTCGTCTTTC GCACCGACGC CGGCGCGGAC
GTCGTCTCGG CCACCCTGCT CGCAATGGGC GCCCTGACGG TCTTCAGCAC CGCGTATCTG
CTCCTCCGTG CCGCGAAACC GCGACCGGCC CTCACGCCAG ACGACGAAGC GCGGATGCGC
GCGCTGCTTG CCGCGCACGG ACACCGCGAC TCTCTGGGTT ACTTTGCGTT GCGGCGGGAC
AAGAGCGTGG TCTGGTCCGA GACGGGCAAG TCCTGCATTG CATATCGCGT GGTCTCGGGC
GTGATGCTGG TGAGTGGTGA TCCGCTCGGT GATCCGGAGG CTTGGCCGGG CGCGATCGAC
GTCTTCCTCG AGCAGGCCGA ACGGCACGCT TGGGTGCCTG CCGTCCTGGG GTGCAGCGAA
CGCGCCGCGG AAACGTGGCT CCGGCATGCC GATCTTGCGG CTCTGGAAAT TGGCGACGAA
GCCGTCATTG ATGTCGCGGA TTTCTCCCTT GAGGGCCGGG CGATGCGCAA TGTGCGGCAA
ATGGTGCACC GGGTCGAGCG GGCCGGCTAT GACGCCGTCA TCGCCCGAAA TTGCGACCTG
GCCCCCGACC TGCGCGACCA GTTGCGGGCC GCCGCCGTGC GGTGGCGCGA CGGGGAGACC
GAACGCGGTT TCGCCATGGC GCTGGGCCGG CTCGGTGACG TCACGGACCC CGACTGTCTG
TTCGCGGTAG CCGTCAAGGA CGGCCGTCCC CATGCGTTCC TGCATTTCGT CCCGTGGGGA
CGCGACGGCC TCTCCCTTGA CGTCATGCGG TGGGATCGAA CGAGCCATCC CGGGCTGAAT
GAATTTCTCA TCGCCCGGGT GATACGCGCG GCGCCGCATC TGGGCATCCG CCGCATCTCG
CTGAATTTCG CGGTCTTCCG GTCGGCATTG GAGCGTGGCG GGCGGATCGG CGCCGGTCCC
ATCATCCGCG CGTGGCGAAG CATTTTGCTC ATTGTTTCTC GGTGGGTGCA GATCGAATCG
CTGTACCGCT TCAACGCGAA ATTCCGGCCG GAGTGGGTCT CCCGGTACTT GCTGTATCCC
GATCTGCTGG ACCTGCCGCG GATCGCCCTT GCCGCGCTGG AAGCCGAGGC CTTCATCGTC
TGGCCGACCC CGAGCCTTCG GCGATTGCAG CGGGTGCTGC GCCTTGGAGG TGAGCCGTGA
 
Protein sequence
MTREARTTTT RDAARQPVRR RDRRRHRPWV PALAGWVAAL VGIRNFIVVL HPHWWERVRP 
VGKVLPAPGE TPLLHGLAAA ELVGSGALML LLAHALKRRK RRAWQVVVGV LAVGLALHVV
HHPPLHGIPG WLVIDGGFLI GLLAFRKEFF AEPDPRTRWS ALVTFVGLGV ASYGFGLVLV
GLRKDQLVGH PSWTDILLHV GYGLVGIPGP LVFRTDAGAD VVSATLLAMG ALTVFSTAYL
LLRAAKPRPA LTPDDEARMR ALLAAHGHRD SLGYFALRRD KSVVWSETGK SCIAYRVVSG
VMLVSGDPLG DPEAWPGAID VFLEQAERHA WVPAVLGCSE RAAETWLRHA DLAALEIGDE
AVIDVADFSL EGRAMRNVRQ MVHRVERAGY DAVIARNCDL APDLRDQLRA AAVRWRDGET
ERGFAMALGR LGDVTDPDCL FAVAVKDGRP HAFLHFVPWG RDGLSLDVMR WDRTSHPGLN
EFLIARVIRA APHLGIRRIS LNFAVFRSAL ERGGRIGAGP IIRAWRSILL IVSRWVQIES
LYRFNAKFRP EWVSRYLLYP DLLDLPRIAL AALEAEAFIV WPTPSLRRLQ RVLRLGGEP