Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0206 |
Symbol | |
ID | 4485292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 220075 |
End bp | 221874 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639728969 |
Product | hypothetical protein |
Protein accession | YP_871966 |
Protein GI | 117927415 |
COG category | [S] Function unknown |
COG ID | [COG2898] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.914985 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.140787 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGTG AGGCTCGGAC GACGACCACC CGCGACGCGG CGCGGCAACC GGTGAGGCGC CGGGATCGGC GTCGCCACCG TCCCTGGGTG CCTGCGCTCG CGGGCTGGGT GGCGGCCCTC GTCGGGATTC GGAATTTCAT CGTCGTCCTG CACCCCCACT GGTGGGAGCG GGTTCGTCCC GTCGGCAAGG TGCTGCCCGC ACCGGGGGAA ACACCGCTGC TCCACGGGCT GGCGGCCGCT GAGCTGGTCG GTTCAGGCGC GCTCATGCTG CTCCTCGCGC ACGCGCTCAA ACGCCGCAAG CGGCGGGCAT GGCAGGTCGT CGTCGGCGTT CTCGCCGTCG GGCTGGCCCT GCACGTGGTG CATCACCCGC CGCTGCACGG CATCCCGGGA TGGCTGGTCA TCGACGGCGG TTTCCTCATC GGCCTGCTGG CTTTCCGGAA GGAATTCTTC GCCGAACCGG ATCCCCGGAC CCGGTGGAGC GCCTTGGTCA CCTTCGTCGG ACTTGGCGTC GCGAGTTACG GGTTCGGCCT GGTCCTGGTG GGGCTGCGCA AGGACCAGCT GGTCGGCCAC CCGTCGTGGA CGGACATTCT CCTGCACGTC GGGTACGGAC TCGTCGGCAT CCCAGGCCCG CTCGTCTTTC GCACCGACGC CGGCGCGGAC GTCGTCTCGG CCACCCTGCT CGCAATGGGC GCCCTGACGG TCTTCAGCAC CGCGTATCTG CTCCTCCGTG CCGCGAAACC GCGACCGGCC CTCACGCCAG ACGACGAAGC GCGGATGCGC GCGCTGCTTG CCGCGCACGG ACACCGCGAC TCTCTGGGTT ACTTTGCGTT GCGGCGGGAC AAGAGCGTGG TCTGGTCCGA GACGGGCAAG TCCTGCATTG CATATCGCGT GGTCTCGGGC GTGATGCTGG TGAGTGGTGA TCCGCTCGGT GATCCGGAGG CTTGGCCGGG CGCGATCGAC GTCTTCCTCG AGCAGGCCGA ACGGCACGCT TGGGTGCCTG CCGTCCTGGG GTGCAGCGAA CGCGCCGCGG AAACGTGGCT CCGGCATGCC GATCTTGCGG CTCTGGAAAT TGGCGACGAA GCCGTCATTG ATGTCGCGGA TTTCTCCCTT GAGGGCCGGG CGATGCGCAA TGTGCGGCAA ATGGTGCACC GGGTCGAGCG GGCCGGCTAT GACGCCGTCA TCGCCCGAAA TTGCGACCTG GCCCCCGACC TGCGCGACCA GTTGCGGGCC GCCGCCGTGC GGTGGCGCGA CGGGGAGACC GAACGCGGTT TCGCCATGGC GCTGGGCCGG CTCGGTGACG TCACGGACCC CGACTGTCTG TTCGCGGTAG CCGTCAAGGA CGGCCGTCCC CATGCGTTCC TGCATTTCGT CCCGTGGGGA CGCGACGGCC TCTCCCTTGA CGTCATGCGG TGGGATCGAA CGAGCCATCC CGGGCTGAAT GAATTTCTCA TCGCCCGGGT GATACGCGCG GCGCCGCATC TGGGCATCCG CCGCATCTCG CTGAATTTCG CGGTCTTCCG GTCGGCATTG GAGCGTGGCG GGCGGATCGG CGCCGGTCCC ATCATCCGCG CGTGGCGAAG CATTTTGCTC ATTGTTTCTC GGTGGGTGCA GATCGAATCG CTGTACCGCT TCAACGCGAA ATTCCGGCCG GAGTGGGTCT CCCGGTACTT GCTGTATCCC GATCTGCTGG ACCTGCCGCG GATCGCCCTT GCCGCGCTGG AAGCCGAGGC CTTCATCGTC TGGCCGACCC CGAGCCTTCG GCGATTGCAG CGGGTGCTGC GCCTTGGAGG TGAGCCGTGA
|
Protein sequence | MTREARTTTT RDAARQPVRR RDRRRHRPWV PALAGWVAAL VGIRNFIVVL HPHWWERVRP VGKVLPAPGE TPLLHGLAAA ELVGSGALML LLAHALKRRK RRAWQVVVGV LAVGLALHVV HHPPLHGIPG WLVIDGGFLI GLLAFRKEFF AEPDPRTRWS ALVTFVGLGV ASYGFGLVLV GLRKDQLVGH PSWTDILLHV GYGLVGIPGP LVFRTDAGAD VVSATLLAMG ALTVFSTAYL LLRAAKPRPA LTPDDEARMR ALLAAHGHRD SLGYFALRRD KSVVWSETGK SCIAYRVVSG VMLVSGDPLG DPEAWPGAID VFLEQAERHA WVPAVLGCSE RAAETWLRHA DLAALEIGDE AVIDVADFSL EGRAMRNVRQ MVHRVERAGY DAVIARNCDL APDLRDQLRA AAVRWRDGET ERGFAMALGR LGDVTDPDCL FAVAVKDGRP HAFLHFVPWG RDGLSLDVMR WDRTSHPGLN EFLIARVIRA APHLGIRRIS LNFAVFRSAL ERGGRIGAGP IIRAWRSILL IVSRWVQIES LYRFNAKFRP EWVSRYLLYP DLLDLPRIAL AALEAEAFIV WPTPSLRRLQ RVLRLGGEP
|
| |