Gene Cthe_1961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1961 
Symbol 
ID4810744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2336838 
End bp2339288 
Gene Length2451 bp 
Protein Length816 aa 
Translation table11 
GC content43% 
IMG OID640107377 
Productnucleotidyl transferase 
Protein accessionYP_001038372 
Protein GI125974462 
COG category[G] Carbohydrate transport and metabolism
[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1109] Phosphomannomutase
[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.333619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCTA TTATAATGGC CGGCGGGGAA GGCTCAAGGC TTCGGCCTCT TACATGCGAC 
TTGCCCAAGC CTATGGTCCC AATAATGAAC ATTCCAATAA TGGAACACAT AATCAACCTA
CTGAAAAAAC ATGGAATTAC AGAGATCGGC GTTACGCTCA TGTACCTTCC CCAAAAGATT
AAGGATTATT TTGGGAATGG TTCCAATTTT GGAGTCAACA TAACCTACTT TACCGAAGAT
ACTCCTTTAG GTACGGCGGG AAGTGTAAAA AACGCCGAAG ATTTCCTTGA TGAAACCTTT
ATAGTAATAA GCGGGGATTC CCTTACCAAT ATGAATATTA CCAAAGCAAT CGAGTTTCAC
AGGATGAAAA ACTCAAAAGC CACTCTGGTT TTGACGAGGG TTGACGTTCC CCTTGAATAC
GGTGTAGTTA TCACGGACAA GTCCGGTGCC ATAACCGGCT TTTTGGAAAA ACCTAGCTGG
GGAGAAGTTT TCAGTGACAC CGTAAACACC GGAGCCTATA TTTTGGAGCC TGAAATATTA
AAATACCTGG AAAAGGGCAA AAAAGTGGAT TTCAGCCAGG ACCTCTTTCC CTATCTCCTT
TTGAAGAAGG AACCCATGTA TGGTTACGTC ATGGACGACT ATTGGTGTGA CATCGGCGAC
CTTCAGGCTT ATCTTCAGGC CCATTATGAT GTCCTTGAAG GCAAAATCCA GCTTGATATA
AACGGAACTG AAATTCAAAA AGGTGTTTGG GTCGGCTCGG GTGCAATCAT CGAGCCCGGT
GCGATACTCA ACCCCCCCTG TGTTATCGGA GACAACTGCC GCATAGAAAG CGGTGCGGTA
ATCGACAGCT TGAGTGTCAT CGGAAATAAT AATGTGATTG AAAGGGACAG TTCCGTAAAA
CGCAGTGTTA TTTGGGACGG CAACTACATT GAGTATGGCT CGAAAATTCG TGGTGCCATT
TTGTGCAGCA AGACAAACCT CAAGCGCTAT GTACATATAT TTGAAAATGC CATTGTGGGA
GACAATTGTT TGATTAACGA AAGGGTCGTT ATCAAACCCA ATATAAAAAT ATGGCCGCAA
AAAACCGTTG AACCCTTCGC CATAGTGGAC AGAAACATTA TCTGGGGATC AAAGCACTCA
AAAAGTATCT TTGGTGAAAA CGGTCTTTCC GGAATTATCA ACGTTGACAT ATCCCCTGAG
TTTGCAACAA GGCTTGGCGC GGCATACGGT TCCATATTCA AAAAGGGTTC CAAGGTTGTT
GTAAGTTCCA CCACCGCCAA CTCCGCCAGA ATGTTCAAAC ATGCCTTTAT ATCAGGTATT
CTTTCGGTCG GTGTGGAAGT ATTCAACTTA AGCAGCCTTC TCACCCCTTT GGCCCGCCAT
GCAATCAATT TCCTTTCCGT TGAGGGAGGA ATTCACATCA AGCTCAGTGA GGACAATCCA
AACAAGCTCA AGGTTGATTT TATGGACTCC AAAGGGGCAA GCATCAGCAG GGTTACAGAA
AGGAAAATAG AAAACTCTTT TGCCCGCGAG GACTTTAAAC GCTGCTCCGG AGACGAAGTC
AGCAGACTTA ACAATATTAC GGACTTCAAG AACTATTATG TACGTTCCAT ATTGAATGAA
GTAAATGTTG AAGCCATAAA AAACAATCCG CCAAAGTTAT GCATAGTTTC ACCGTCAGAT
TTTGTAATAT CCATTGTAGT ACCGATGCTC ACGGATTTAG GCTGCAAGGT TGCCAGTTTC
TCCTCCACCA ACGTGAGTGA AGTTGACACC ATTGTTGACG AGATAAAAGA CAATAATGCC
AGCTTTGCGG CTTTCATTGA CAGCAACGGT GAAACACTGG TGCTGATAGA CAAAAACGGC
AATGTGGTAA AGGATGATTT GTTCCTCTGT CTGACATCCC TTATTACTTT TAAGTCGGTT
CCAAATTCAA AGGTGGTTGT GCCCATAACG GCACCATCCA TTATTGAAAC ATTGGCCGAG
CGCTACAACG GCAAGGTTGT GAGGACCAAA ACCTCACCCC AGGCAGTAAT GGAGCAAATG
TTAAACCACA ACCTTTTCAA AAATCGTGAA AACATGTATC AATTTCTGCT CAATTTCGAT
GCAATTGCAG GCCTGGTAAA AATAATTGAA TTCTTATGCC TCCAAAATAC GACGTTGACA
GAAACCATCA AGGAAATACC TGATTTCTAC GTCAGCAAAA AGAAGATTTT CTGTCCGTGG
GAGTTGAAAG GCCGGGTAAT GAGAACCCTT ATAACCGAAA AAGACCAGGA AAAAGTGGAA
CTTTTGGACG GCGTGAAATT TATCCTGGAG AACGGTTGGG CCCTTGTCCT GCCCGACGCG
GATATGCCCC TTTGCAGGGT TTACTCCGAA GGGGTGACAC CTGAAGTGGC GGAAACCATT
TCAGACAAAT ATATTGACAA AATAAAAGCA ATAATAAATG ATAAGAAGTA G
 
Protein sequence
MKAIIMAGGE GSRLRPLTCD LPKPMVPIMN IPIMEHIINL LKKHGITEIG VTLMYLPQKI 
KDYFGNGSNF GVNITYFTED TPLGTAGSVK NAEDFLDETF IVISGDSLTN MNITKAIEFH
RMKNSKATLV LTRVDVPLEY GVVITDKSGA ITGFLEKPSW GEVFSDTVNT GAYILEPEIL
KYLEKGKKVD FSQDLFPYLL LKKEPMYGYV MDDYWCDIGD LQAYLQAHYD VLEGKIQLDI
NGTEIQKGVW VGSGAIIEPG AILNPPCVIG DNCRIESGAV IDSLSVIGNN NVIERDSSVK
RSVIWDGNYI EYGSKIRGAI LCSKTNLKRY VHIFENAIVG DNCLINERVV IKPNIKIWPQ
KTVEPFAIVD RNIIWGSKHS KSIFGENGLS GIINVDISPE FATRLGAAYG SIFKKGSKVV
VSSTTANSAR MFKHAFISGI LSVGVEVFNL SSLLTPLARH AINFLSVEGG IHIKLSEDNP
NKLKVDFMDS KGASISRVTE RKIENSFARE DFKRCSGDEV SRLNNITDFK NYYVRSILNE
VNVEAIKNNP PKLCIVSPSD FVISIVVPML TDLGCKVASF SSTNVSEVDT IVDEIKDNNA
SFAAFIDSNG ETLVLIDKNG NVVKDDLFLC LTSLITFKSV PNSKVVVPIT APSIIETLAE
RYNGKVVRTK TSPQAVMEQM LNHNLFKNRE NMYQFLLNFD AIAGLVKIIE FLCLQNTTLT
ETIKEIPDFY VSKKKIFCPW ELKGRVMRTL ITEKDQEKVE LLDGVKFILE NGWALVLPDA
DMPLCRVYSE GVTPEVAETI SDKYIDKIKA IINDKK