Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0791 |
Symbol | |
ID | 4486225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 872001 |
End bp | 873452 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639729562 |
Product | CBS domain-containing protein |
Protein accession | YP_872550 |
Protein GI | 117927999 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.847387 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.143113 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGACG CCGTCTTGTT CACCCTCGCC GCCCTGCTCG TCGTGTTCGC CGGGGCGTGC GCGATGGCCG ATTCCGCACT CTCCCGGGTA CCCCGGGTGC ACGTGGAGGA ATTCACCCGC GACGGTCGGC GTGGCGCCAA GTCCCTGGCC AAGGTGATCA ATGACGCATC GCATTACCTC AATCTCGTCT TGCTGCTGCG GGTGACCGCG GAGATGACGG CGGCGGTCAT TGTCGCGGTC GCCGCGGTAC GGATTTTCGG TTTGCATTGG CCGGTCGTCG TCGTGGTGTC GCTCGTCATG GTGGTGGTCA GTTACGTCGT CATCGGCGTG GCGCCGCGCA CCATCGGACG CCAGCACGCG GACGGCGTTT CGCTCGCCGT CGCCCCGGTC GTCTACGGCC TGGCGCGTGT TCTCGGCCCG CTGCCCCGGT TGCTCATCGC ATTCGGCAAC GCCATCACCC CCGGCCGGGG TTTCCGGGAG GGGCCGTTCA CCACCGAAGC CGAGTTGCGT GACCTCGTGG ATTTAGCCGA AAAGGCCCGG GTCATCGAGC ACGGCGAGCG GCAAATGATT CACTCGGTCT TCGAATTGGG GGACACGCTC GTGCGTGAGG TGATGGTGCC GCGGACGGAC ATCGTCTTCA TCGAGAAAAC GAAGACCCTG CGTCAGGCGA TGTCCTTGGC GCTGCGCAGC GGATTCTCGC GGATTCCCGT TGTCGGAGAG AATGAAGACG ACGTCGTCGG AATCGTCTAC CTGCGCGACC TTGCAAAGCG CATTTACGAG TATCGGGAAG CGGAGACCCT GGAGCGCGTC GAATCGATTA TGCGCCCGCC GGTGTTTGTG CCGGACAGCA AACCCATCGA TGAATTGCTC CGCGAAATGC AGGCGGCGCG CAATCACGTC GCGATCGTCG TGGACGAGTA CGGCGGCACG GCCGGGCTCG TCACGATCGA GGACATTCTC GAGGAAATCG TCGGGGAGAT CACCGACGAG TACGACCTGG AGCGGCCGCG GGTCGAGCCG CTTGAGGACG GCACGGTTCG GGTGACCGCG CGGCTGTCGA TTGACGAGCT CGAGGAGCTC TTCGGCGTGC AGATCGACCG CACTGACCAC GAGGTGGAGA CCGTCGGGGG ATTGTTGGCC CAGGTTCTGG GTCGAGTGCC GATACCGGGG GCGCGGGCGG CGGTCGCCGG CTTGGAGCTC ATCGCCGAGG GCGCGGCGGG CCGGCGCAAC AAAATTGCCA CTGTTCTCGT CCGCCGGCTG CCGGAGGCGC CGGAGGCGGC AGCGGAGACG GAGCACACCG CTGCTGCGGA GCCGGCCGAC GCTCCGGGAC GCGGCCCTTC CCGGGCCGAC GGCGGCGACC CGGCCGTACC TAGCCAGCCG GCAGAGAATG GCGCGCACGA CGCCGGTACG CCGGTCGCGG CGGCCCCGCG TGGTGACGGA GTACGCCGGT GA
|
Protein sequence | MVDAVLFTLA ALLVVFAGAC AMADSALSRV PRVHVEEFTR DGRRGAKSLA KVINDASHYL NLVLLLRVTA EMTAAVIVAV AAVRIFGLHW PVVVVVSLVM VVVSYVVIGV APRTIGRQHA DGVSLAVAPV VYGLARVLGP LPRLLIAFGN AITPGRGFRE GPFTTEAELR DLVDLAEKAR VIEHGERQMI HSVFELGDTL VREVMVPRTD IVFIEKTKTL RQAMSLALRS GFSRIPVVGE NEDDVVGIVY LRDLAKRIYE YREAETLERV ESIMRPPVFV PDSKPIDELL REMQAARNHV AIVVDEYGGT AGLVTIEDIL EEIVGEITDE YDLERPRVEP LEDGTVRVTA RLSIDELEEL FGVQIDRTDH EVETVGGLLA QVLGRVPIPG ARAAVAGLEL IAEGAAGRRN KIATVLVRRL PEAPEAAAET EHTAAAEPAD APGRGPSRAD GGDPAVPSQP AENGAHDAGT PVAAAPRGDG VRR
|
| |