Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3113 |
Symbol | |
ID | 4809744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3673753 |
End bp | 3674802 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108546 |
Product | nucleotidyl transferase |
Protein accession | YP_001039501 |
Protein GI | 125975591 |
COG category | [J] Translation, ribosomal structure and biogenesis [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) |
TIGRFAM ID | [TIGR01208] glucose-1-phosphate thymidylylransferase, long form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.409116 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTGA AGGCATTATT TTTAGCAGGT GGTTTAGGAA CTCGTTTAAG ACCCATTACC AATGATTTGC CCAAGCCAAT GGTGCCCATT ATGGGAAAGC CACTGTTGGA AAGAAATATA GAAAAGCTAA AAAGCTATGG AATTGATGAG GTTGTATTGA GTACATGCTA TAAGCCACAT AAAATTGACA AATATTTCGG AGACGGGAAA AAATTCGGTG TTAAAATAAG CTACATAACA GAGGATAAGC CTTTGGGGAC TGCAGGAGCC ATAAAAAACG CAGAGGAGCT TTTAAGTGAC ACGTTCCTGG TGTTTAACGC CGATATATTG AGCGATATAG ACATAGCTAA CATGATACGT TTCCACAAGG AAAAAGGGGC ACTTGCAACC ATTGCCGTAA CCAAGGTTGA CAATCCGTCG GCGTATGGAG TCATTGAACA TGACGATGAT AATTTTATTA CGGCCTTTAA AGAAAAGCCT CAGCCTCATG AGAGCAAATC CAATTTAATT AATGCAGGAG TATATATCTT TGAAAAGGAA CTTTTAAACC ATATTCCTCG CGGAAGGGCT GTGTCCATTG AAAGAGAAAC CTATCCTTTG CTGCTTGAAA AAGGATACAA GATGGCAGTG TACAATAAAT GCGGCTACTG GCTTGATTTG GGCACGCCGG GAAAATATCT TAAGGTACAC AAGGACATAC TCAAAGGTCT TGTACCAATT GGAAATTATG ATTTCGGACA GAACCGCACA TACATCAGCA AAAGTGCTAA AATTGACCGG AGCGCAAAAA TAAGAGGGCC GGTATACATT GGTGAAAATG TTGTAATCGG CCCCTCGGCG GTGATAGGTC CTAATGCGGT TTTATTCGAT GACGCTGTTG TCGGAATGGG AGCAAAGGTT GTGGACAGCG TGGTTTGGGA CAATGTTAAT GTGGAGAGAG GAGCAACGGT TGTAAATTCT GTAATTATGT CAAATTGCAG AGTTGATGAA GACAGTGAAA AATACAATTC CGTTTTGACA GAAAATTTCA GCGAGCCGAT AGCGGTATAA
|
Protein sequence | MNVKALFLAG GLGTRLRPIT NDLPKPMVPI MGKPLLERNI EKLKSYGIDE VVLSTCYKPH KIDKYFGDGK KFGVKISYIT EDKPLGTAGA IKNAEELLSD TFLVFNADIL SDIDIANMIR FHKEKGALAT IAVTKVDNPS AYGVIEHDDD NFITAFKEKP QPHESKSNLI NAGVYIFEKE LLNHIPRGRA VSIERETYPL LLEKGYKMAV YNKCGYWLDL GTPGKYLKVH KDILKGLVPI GNYDFGQNRT YISKSAKIDR SAKIRGPVYI GENVVIGPSA VIGPNAVLFD DAVVGMGAKV VDSVVWDNVN VERGATVVNS VIMSNCRVDE DSEKYNSVLT ENFSEPIAV
|
| |