Gene Cthe_3113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3113 
Symbol 
ID4809744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3673753 
End bp3674802 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content41% 
IMG OID640108546 
Productnucleotidyl transferase 
Protein accessionYP_001039501 
Protein GI125975591 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID[TIGR01208] glucose-1-phosphate thymidylylransferase, long form 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.409116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTGA AGGCATTATT TTTAGCAGGT GGTTTAGGAA CTCGTTTAAG ACCCATTACC 
AATGATTTGC CCAAGCCAAT GGTGCCCATT ATGGGAAAGC CACTGTTGGA AAGAAATATA
GAAAAGCTAA AAAGCTATGG AATTGATGAG GTTGTATTGA GTACATGCTA TAAGCCACAT
AAAATTGACA AATATTTCGG AGACGGGAAA AAATTCGGTG TTAAAATAAG CTACATAACA
GAGGATAAGC CTTTGGGGAC TGCAGGAGCC ATAAAAAACG CAGAGGAGCT TTTAAGTGAC
ACGTTCCTGG TGTTTAACGC CGATATATTG AGCGATATAG ACATAGCTAA CATGATACGT
TTCCACAAGG AAAAAGGGGC ACTTGCAACC ATTGCCGTAA CCAAGGTTGA CAATCCGTCG
GCGTATGGAG TCATTGAACA TGACGATGAT AATTTTATTA CGGCCTTTAA AGAAAAGCCT
CAGCCTCATG AGAGCAAATC CAATTTAATT AATGCAGGAG TATATATCTT TGAAAAGGAA
CTTTTAAACC ATATTCCTCG CGGAAGGGCT GTGTCCATTG AAAGAGAAAC CTATCCTTTG
CTGCTTGAAA AAGGATACAA GATGGCAGTG TACAATAAAT GCGGCTACTG GCTTGATTTG
GGCACGCCGG GAAAATATCT TAAGGTACAC AAGGACATAC TCAAAGGTCT TGTACCAATT
GGAAATTATG ATTTCGGACA GAACCGCACA TACATCAGCA AAAGTGCTAA AATTGACCGG
AGCGCAAAAA TAAGAGGGCC GGTATACATT GGTGAAAATG TTGTAATCGG CCCCTCGGCG
GTGATAGGTC CTAATGCGGT TTTATTCGAT GACGCTGTTG TCGGAATGGG AGCAAAGGTT
GTGGACAGCG TGGTTTGGGA CAATGTTAAT GTGGAGAGAG GAGCAACGGT TGTAAATTCT
GTAATTATGT CAAATTGCAG AGTTGATGAA GACAGTGAAA AATACAATTC CGTTTTGACA
GAAAATTTCA GCGAGCCGAT AGCGGTATAA
 
Protein sequence
MNVKALFLAG GLGTRLRPIT NDLPKPMVPI MGKPLLERNI EKLKSYGIDE VVLSTCYKPH 
KIDKYFGDGK KFGVKISYIT EDKPLGTAGA IKNAEELLSD TFLVFNADIL SDIDIANMIR
FHKEKGALAT IAVTKVDNPS AYGVIEHDDD NFITAFKEKP QPHESKSNLI NAGVYIFEKE
LLNHIPRGRA VSIERETYPL LLEKGYKMAV YNKCGYWLDL GTPGKYLKVH KDILKGLVPI
GNYDFGQNRT YISKSAKIDR SAKIRGPVYI GENVVIGPSA VIGPNAVLFD DAVVGMGAKV
VDSVVWDNVN VERGATVVNS VIMSNCRVDE DSEKYNSVLT ENFSEPIAV