Gene Cthe_2411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2411 
Symbol 
ID4808126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2877937 
End bp2879082 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content41% 
IMG OID640107824 
Productmetallophosphoesterase 
Protein accessionYP_001038806 
Protein GI125974896 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCAT TAAAGTTTCT GCATTTTTCG GATATTCATC TTGACGCACC TTTCAGCTCT 
TTGGGCTCAA AATTTGCGGC TGAACAAAGA AGACGGGACC TTCTTGAAGT GTTTGGCCGG
ATAATTGACC TTGCGAAAAA GGAAGCTGTG GACATCATAC TGATAAGCGG AGATCTTTAC
GAACATGAGT ATGTAAGAAA GTCTACAATT CATTACATAA ATAAAAAATT CAGCGAGATT
CCCGAAACAA AAGTGTTTAT TGTTCCGGGA AACCATGATC CGTGTATTTC CAATTCCTAT
TACCAAAACT TTGAGTGGAG CAAAAATGTC TGTATTCTGT CGGAGAACAG GACGAAAGTT
TTTCTTGAGG AGCACAATGC CTGTGTGTAT GGTGCGGGCT TTTCAAATTT CCATGAAGGA
ACAAGTCTTA TAAACAAAAT TGAGCCGGCA GACCCAAGGT ATATTAATAT TTTGCTGGTT
CACGGCACCG TGGATTTGGA TTTCAAAGAC AGCCGGTATA ATCCCATGTC AAGCGGCGAA
CTTGCACTTT TGGGCATGGA TTATATTGCT TTGGGGCATT TTCATAATAC TCTGCGGGGT
GTTGGAAAGA GTGAAAACAT ATACAACCCC GGAAGCCCGG AACCTCTGGG ATTTGATGAG
GAGGGGGAGC ACGGTGTCTT TATTGGCAGA ATAGACTTTG TGTCAAAAGA GGAAAAAAAA
CTTCAGGTAA AGTTTGAGAA GACCTGCAAA AGGCAGTACA AATCTTTTGA GATAAAATCC
GACTGTTTTG AGAGTGACGA CCAAATTATA GACGAAATTT TTCGTAAAGC CGTAAAAGAT
GAAAACCGGA ACGACCTTGT ACATATAACT TTGAAGGGCT ATACCATGCC TGGATATCGT
ATTAATGCTG CAAACATAGC CAGTGCTATT GAGGACAGCT TTTTTTATGC GGTTGTAAAT
GATGAAACCG TAAATCAATA TAATTACGAG GAGCTTATGA ACGAACCGGG ATTAAAGGGG
CTGTTTGTGA GGAAAATGTT TTCGCTTATA GACAAAGCTG AAAATGAAAA AGAAAAGCAT
CTTTTGATGA AGGCCATGCA ATACGGTGTC GAAGCCCTGG AACAGGGCAA AGTGGAAGTG
CTGTAA
 
Protein sequence
MNSLKFLHFS DIHLDAPFSS LGSKFAAEQR RRDLLEVFGR IIDLAKKEAV DIILISGDLY 
EHEYVRKSTI HYINKKFSEI PETKVFIVPG NHDPCISNSY YQNFEWSKNV CILSENRTKV
FLEEHNACVY GAGFSNFHEG TSLINKIEPA DPRYINILLV HGTVDLDFKD SRYNPMSSGE
LALLGMDYIA LGHFHNTLRG VGKSENIYNP GSPEPLGFDE EGEHGVFIGR IDFVSKEEKK
LQVKFEKTCK RQYKSFEIKS DCFESDDQII DEIFRKAVKD ENRNDLVHIT LKGYTMPGYR
INAANIASAI EDSFFYAVVN DETVNQYNYE ELMNEPGLKG LFVRKMFSLI DKAENEKEKH
LLMKAMQYGV EALEQGKVEV L