Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acel_0036 |
Symbol | |
ID | 4484531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidothermus cellulolyticus 11B |
Kingdom | Bacteria |
Replicon accession | NC_008578 |
Strand | + |
Start bp | 41319 |
End bp | 42296 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639728798 |
Product | restriction endonuclease |
Protein accession | YP_871798 |
Protein GI | 117927247 |
COG category | [V] Defense mechanisms |
COG ID | [COG1715] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTCG TGACCATTCC GCAGTATCAC GAGTTGATGT GGCCGGTGCT GATAGCGCTT CGGAAGCGTG GTGGATCCGC CACGATCAAA GAGATCTACG AGCAGGTCGT CGAAGACGAG CACTTCTCTG AGGAACAGCA GGCTGTCCCG ACAAAGGATG GTCGCATGTC CGAGCTGGAG TACCGGCTCC ATTGGGCGAG GACGCACCTC AAGGGTATCG GGGCGATCCA GAACAGTACG CGTGGGGTGT GGTCGCTGAC CGAAAAGGGT CAGACGATCA CGCCGGAGCA GATGCGCCGC GATACGAAGG CGTACCGAGA CGAACTGTTG CGTCGAGCGA AACAGAAGCA GCCTTCGAAC AGCGCAACCG ACCAAGCCAG CGACGATGAG CTGGATTCTT GGAAGGAGCA GCTTATTGCC CGACTGCTAC GCCTCCCGCC TCACGGCTTC GAGCGACTCG CGCAACGACT CCTTCGGGAA GCGGGATTCG TATACGTCAC CGTCCTCGGC ACGAGCGGGG ATGGAGGCAT TGACGGTGTC GGAGTGTACC GGCTGTCTCC GGTCTCGTTC CCCGTCTATT TCCAGTGCAA GCGTTACAAG GGATCCGTGA CGGCCGGTGT CGTGCGGGAC TTTCGAGGCG CGATGGCAGG CCGCGGAGAC AAGGTTCTTT TGATCACGAC CGGCTCGTTT ACGAAGGATG CTCAGAACGA AGCGAGCCGT GACGGAGCGC CGCCTGTCGA GTTGATCGAT GGAGATCGGC TCTGTGACCT CCTGCGCGAC TACCGGCTTG GCGTGGACGT CCGAGGCGTA TCGAGTATGA GGTTATCGTC AATGCGTCGT TCTTCGACGA GTACGACTCC GCCGGCACCG CCACGTCGAC GTATGCCTGA CACTGGCGCC AGTCGCTTCA CTAATTATGG CCGTGGCGGC ACGAGCGAGC GGTTTATCGC GGACCGATGC CGAGCTGTGC ACGACTGA
|
Protein sequence | MAVVTIPQYH ELMWPVLIAL RKRGGSATIK EIYEQVVEDE HFSEEQQAVP TKDGRMSELE YRLHWARTHL KGIGAIQNST RGVWSLTEKG QTITPEQMRR DTKAYRDELL RRAKQKQPSN SATDQASDDE LDSWKEQLIA RLLRLPPHGF ERLAQRLLRE AGFVYVTVLG TSGDGGIDGV GVYRLSPVSF PVYFQCKRYK GSVTAGVVRD FRGAMAGRGD KVLLITTGSF TKDAQNEASR DGAPPVELID GDRLCDLLRD YRLGVDVRGV SSMRLSSMRR SSTSTTPPAP PRRRMPDTGA SRFTNYGRGG TSERFIADRC RAVHD
|
| |