Gene Acel_0791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0791 
Symbol 
ID4486225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp872001 
End bp873452 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content67% 
IMG OID639729562 
ProductCBS domain-containing protein 
Protein accessionYP_872550 
Protein GI117927999 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.847387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.143113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGACG CCGTCTTGTT CACCCTCGCC GCCCTGCTCG TCGTGTTCGC CGGGGCGTGC 
GCGATGGCCG ATTCCGCACT CTCCCGGGTA CCCCGGGTGC ACGTGGAGGA ATTCACCCGC
GACGGTCGGC GTGGCGCCAA GTCCCTGGCC AAGGTGATCA ATGACGCATC GCATTACCTC
AATCTCGTCT TGCTGCTGCG GGTGACCGCG GAGATGACGG CGGCGGTCAT TGTCGCGGTC
GCCGCGGTAC GGATTTTCGG TTTGCATTGG CCGGTCGTCG TCGTGGTGTC GCTCGTCATG
GTGGTGGTCA GTTACGTCGT CATCGGCGTG GCGCCGCGCA CCATCGGACG CCAGCACGCG
GACGGCGTTT CGCTCGCCGT CGCCCCGGTC GTCTACGGCC TGGCGCGTGT TCTCGGCCCG
CTGCCCCGGT TGCTCATCGC ATTCGGCAAC GCCATCACCC CCGGCCGGGG TTTCCGGGAG
GGGCCGTTCA CCACCGAAGC CGAGTTGCGT GACCTCGTGG ATTTAGCCGA AAAGGCCCGG
GTCATCGAGC ACGGCGAGCG GCAAATGATT CACTCGGTCT TCGAATTGGG GGACACGCTC
GTGCGTGAGG TGATGGTGCC GCGGACGGAC ATCGTCTTCA TCGAGAAAAC GAAGACCCTG
CGTCAGGCGA TGTCCTTGGC GCTGCGCAGC GGATTCTCGC GGATTCCCGT TGTCGGAGAG
AATGAAGACG ACGTCGTCGG AATCGTCTAC CTGCGCGACC TTGCAAAGCG CATTTACGAG
TATCGGGAAG CGGAGACCCT GGAGCGCGTC GAATCGATTA TGCGCCCGCC GGTGTTTGTG
CCGGACAGCA AACCCATCGA TGAATTGCTC CGCGAAATGC AGGCGGCGCG CAATCACGTC
GCGATCGTCG TGGACGAGTA CGGCGGCACG GCCGGGCTCG TCACGATCGA GGACATTCTC
GAGGAAATCG TCGGGGAGAT CACCGACGAG TACGACCTGG AGCGGCCGCG GGTCGAGCCG
CTTGAGGACG GCACGGTTCG GGTGACCGCG CGGCTGTCGA TTGACGAGCT CGAGGAGCTC
TTCGGCGTGC AGATCGACCG CACTGACCAC GAGGTGGAGA CCGTCGGGGG ATTGTTGGCC
CAGGTTCTGG GTCGAGTGCC GATACCGGGG GCGCGGGCGG CGGTCGCCGG CTTGGAGCTC
ATCGCCGAGG GCGCGGCGGG CCGGCGCAAC AAAATTGCCA CTGTTCTCGT CCGCCGGCTG
CCGGAGGCGC CGGAGGCGGC AGCGGAGACG GAGCACACCG CTGCTGCGGA GCCGGCCGAC
GCTCCGGGAC GCGGCCCTTC CCGGGCCGAC GGCGGCGACC CGGCCGTACC TAGCCAGCCG
GCAGAGAATG GCGCGCACGA CGCCGGTACG CCGGTCGCGG CGGCCCCGCG TGGTGACGGA
GTACGCCGGT GA
 
Protein sequence
MVDAVLFTLA ALLVVFAGAC AMADSALSRV PRVHVEEFTR DGRRGAKSLA KVINDASHYL 
NLVLLLRVTA EMTAAVIVAV AAVRIFGLHW PVVVVVSLVM VVVSYVVIGV APRTIGRQHA
DGVSLAVAPV VYGLARVLGP LPRLLIAFGN AITPGRGFRE GPFTTEAELR DLVDLAEKAR
VIEHGERQMI HSVFELGDTL VREVMVPRTD IVFIEKTKTL RQAMSLALRS GFSRIPVVGE
NEDDVVGIVY LRDLAKRIYE YREAETLERV ESIMRPPVFV PDSKPIDELL REMQAARNHV
AIVVDEYGGT AGLVTIEDIL EEIVGEITDE YDLERPRVEP LEDGTVRVTA RLSIDELEEL
FGVQIDRTDH EVETVGGLLA QVLGRVPIPG ARAAVAGLEL IAEGAAGRRN KIATVLVRRL
PEAPEAAAET EHTAAAEPAD APGRGPSRAD GGDPAVPSQP AENGAHDAGT PVAAAPRGDG
VRR