Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0695 |
Symbol | |
ID | 4810313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 853324 |
End bp | 854193 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640106112 |
Product | agmatinase |
Protein accession | YP_001037123 |
Protein GI | 125973213 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | [TIGR01230] agmatinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.109058 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTAA GAAACGGGAA TTCTACATTG CCGACCAAAT TTATGGCAAG CATTGACAGC TACGATGATG CATCCATAGT CATGGCCGGT GTTCCAATGG ACTTTACCTG CAGCTTCAGA CCGGGAACAA GGTTTGGACC GCAGAAAATC AGGGAAGTAT CCATAGGCAT TGAAGAATAC AGCGTATACA TGGACCGGGA TTTGACCCAG TGTTCTTTCT TTGATGCCGG TGACTTGGAT CTTCCCTTCG GTGACGTGGA TAAAAGCTTG AAACTTATCG GGGACGTGGC AGAAGAGATA CTCAGCGACA ATAAATTTCC ACTCTTTATC GGTGGAGAGC ATCTTATCAG CGTACCGGTA ATAAAAAAGG TGTATGAAAA ATACGGACCT GAACTGATAG TGGTGCAGTT TGACGCCCAT GCAGACCTCA GGGAAGGATA TCTGGGATGC CCAAACTCTC ATGCTTCGGC TGTAAGACGC TTGATTGACT TTATGCCGGG AAAAAATATT TATCAGTTCG GTATAAGGTC CGGAACGAAA GACGAGTTTG AATATGCAAA AAAACATACC AACATGTATA CTATCGACGT TTTTGAGCCG TTAAGCCGTG TCTTAGACGA TATCAAGGAC AAGCCCATAT ACATAACCCT TGACATCGAC GTGGTAGACC CTGCATACGC GAACGGTACT GGTACACCGG AGCCCGGCGG GATCAGCTCC AGGGAACTTT TGGATTCAAT ACATCTGTTC AAGGGAGCAA ATCTTGTCGG ATTTGACATT GTGGAAGTAT CACCCCACTA TGATCAATCT GACCGGACGG CGCTCCTTGC GGCAAAAATT ATCAGAGAAA TCATTATGAT GGTCGGATGA
|
Protein sequence | MSLRNGNSTL PTKFMASIDS YDDASIVMAG VPMDFTCSFR PGTRFGPQKI REVSIGIEEY SVYMDRDLTQ CSFFDAGDLD LPFGDVDKSL KLIGDVAEEI LSDNKFPLFI GGEHLISVPV IKKVYEKYGP ELIVVQFDAH ADLREGYLGC PNSHASAVRR LIDFMPGKNI YQFGIRSGTK DEFEYAKKHT NMYTIDVFEP LSRVLDDIKD KPIYITLDID VVDPAYANGT GTPEPGGISS RELLDSIHLF KGANLVGFDI VEVSPHYDQS DRTALLAAKI IREIIMMVG
|
| |