Gene Cthe_1517 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1517 
Symbol 
ID4810555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1841721 
End bp1842830 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content40% 
IMG OID640106937 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001037938 
Protein GI125974028 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAA ACATTGTTTA TCCGGATTAC AGCAATTCAA TAGCGAATCT TGCAAATTCA 
ATTCTGAAAA AATGGGGACT TCCGACTAAT GGTAAAACTC TAGAGCTTCT CGACCGGTAT
CTTGCGAAAG ATTATAAGAA TGTGGTGGTT ATTCTTCTTG ATGGAATGGG CAGATGCATT
ATTGAGCGCA ATCTCGAAAA AGACGGCTTT TTCAATACCC ACTTGGCGGG AACATACAGC
TCGACATTTC CATCAACTAC AGTAGCGGCT ACAACATCAA TCGATAGCGG TCTTACTCCC
TGTGAACATG GATGGCTTGG ATGGGACTGT TACTTTCCGC AGATAGACCG AAATGTAACG
GTATTTCATA ATACGGACAC CGAGACCGGA GAGAAGGTGG CAGAGGAGAG TGTTGCATGG
AAGTACTGCT GGTATTCAAG CGTGATTAAC AGGATTGATT CAGCGGGAGG AAAAGCATAT
TATGCTATAC CGTTTGTTTC GCCTTATCCG GCAACATTTG AGGAAAGATG CGAACTGATA
AAAAAATATT GTGATGAACC CGGGCAAAAG TATATCTACT GTTATTGTGA CGAACCTGAC
AAAACTATGC ATCTGACCGG TTGCTACAGT GAAGAATCAA GGAAAGTGAT TTCATGGCTT
GAAAGAAAAA TAGAGAGTCT TACTACCGAA CTAAGGGACA CTCTTGTGAT AATAACTGCC
GACCATGGCC ATGTGAACAC AAAACGTGTG TGTATTAAGG ATTATCCAAA TATTATGAAT
TGTCTGAAAA GAATTCCCAC TATTGAACCC AGAGCTTTGA ATTTGTTTGT GAAGGAAGAC
AGAAGAGACG AATTTGAGAA AGAATTTACT TGTGAATTTG GCGGCAAGTT TCTCCTTTTG
CCAAAAGAAA AAGTACTCGA AATGAAATTA TTCGGATACG GAACAGAGCA TAAAGACTTT
CGCAATATGC TGGGAGATTA TCTTGCTGTT GCAACAGATG ATTTGTCTAT TTTTAACACA
AAAGAAAAGA AAGAGAAATT CGTTAGCTCT CATGGGGGAC TTACAGAAGA CGAGATGATT
ATTCCGTTGA TTATTGTGGA AAAGAAATAG
 
Protein sequence
MNTNIVYPDY SNSIANLANS ILKKWGLPTN GKTLELLDRY LAKDYKNVVV ILLDGMGRCI 
IERNLEKDGF FNTHLAGTYS STFPSTTVAA TTSIDSGLTP CEHGWLGWDC YFPQIDRNVT
VFHNTDTETG EKVAEESVAW KYCWYSSVIN RIDSAGGKAY YAIPFVSPYP ATFEERCELI
KKYCDEPGQK YIYCYCDEPD KTMHLTGCYS EESRKVISWL ERKIESLTTE LRDTLVIITA
DHGHVNTKRV CIKDYPNIMN CLKRIPTIEP RALNLFVKED RRDEFEKEFT CEFGGKFLLL
PKEKVLEMKL FGYGTEHKDF RNMLGDYLAV ATDDLSIFNT KEKKEKFVSS HGGLTEDEMI
IPLIIVEKK