Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1517 |
Symbol | |
ID | 4810555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1841721 |
End bp | 1842830 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106937 |
Product | type I phosphodiesterase/nucleotide pyrophosphatase |
Protein accession | YP_001037938 |
Protein GI | 125974028 |
COG category | [R] General function prediction only |
COG ID | [COG1524] Uncharacterized proteins of the AP superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACAA ACATTGTTTA TCCGGATTAC AGCAATTCAA TAGCGAATCT TGCAAATTCA ATTCTGAAAA AATGGGGACT TCCGACTAAT GGTAAAACTC TAGAGCTTCT CGACCGGTAT CTTGCGAAAG ATTATAAGAA TGTGGTGGTT ATTCTTCTTG ATGGAATGGG CAGATGCATT ATTGAGCGCA ATCTCGAAAA AGACGGCTTT TTCAATACCC ACTTGGCGGG AACATACAGC TCGACATTTC CATCAACTAC AGTAGCGGCT ACAACATCAA TCGATAGCGG TCTTACTCCC TGTGAACATG GATGGCTTGG ATGGGACTGT TACTTTCCGC AGATAGACCG AAATGTAACG GTATTTCATA ATACGGACAC CGAGACCGGA GAGAAGGTGG CAGAGGAGAG TGTTGCATGG AAGTACTGCT GGTATTCAAG CGTGATTAAC AGGATTGATT CAGCGGGAGG AAAAGCATAT TATGCTATAC CGTTTGTTTC GCCTTATCCG GCAACATTTG AGGAAAGATG CGAACTGATA AAAAAATATT GTGATGAACC CGGGCAAAAG TATATCTACT GTTATTGTGA CGAACCTGAC AAAACTATGC ATCTGACCGG TTGCTACAGT GAAGAATCAA GGAAAGTGAT TTCATGGCTT GAAAGAAAAA TAGAGAGTCT TACTACCGAA CTAAGGGACA CTCTTGTGAT AATAACTGCC GACCATGGCC ATGTGAACAC AAAACGTGTG TGTATTAAGG ATTATCCAAA TATTATGAAT TGTCTGAAAA GAATTCCCAC TATTGAACCC AGAGCTTTGA ATTTGTTTGT GAAGGAAGAC AGAAGAGACG AATTTGAGAA AGAATTTACT TGTGAATTTG GCGGCAAGTT TCTCCTTTTG CCAAAAGAAA AAGTACTCGA AATGAAATTA TTCGGATACG GAACAGAGCA TAAAGACTTT CGCAATATGC TGGGAGATTA TCTTGCTGTT GCAACAGATG ATTTGTCTAT TTTTAACACA AAAGAAAAGA AAGAGAAATT CGTTAGCTCT CATGGGGGAC TTACAGAAGA CGAGATGATT ATTCCGTTGA TTATTGTGGA AAAGAAATAG
|
Protein sequence | MNTNIVYPDY SNSIANLANS ILKKWGLPTN GKTLELLDRY LAKDYKNVVV ILLDGMGRCI IERNLEKDGF FNTHLAGTYS STFPSTTVAA TTSIDSGLTP CEHGWLGWDC YFPQIDRNVT VFHNTDTETG EKVAEESVAW KYCWYSSVIN RIDSAGGKAY YAIPFVSPYP ATFEERCELI KKYCDEPGQK YIYCYCDEPD KTMHLTGCYS EESRKVISWL ERKIESLTTE LRDTLVIITA DHGHVNTKRV CIKDYPNIMN CLKRIPTIEP RALNLFVKED RRDEFEKEFT CEFGGKFLLL PKEKVLEMKL FGYGTEHKDF RNMLGDYLAV ATDDLSIFNT KEKKEKFVSS HGGLTEDEMI IPLIIVEKK
|
| |