Gene Cthe_1245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1245 
Symbol 
ID4809750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1508881 
End bp1510140 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content45% 
IMG OID640106668 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_001037670 
Protein GI125973760 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.495633 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTAC TGGTTGTAGG AAGCGGTGGC CGTGAGCATG CATTGGTATG GAAAATTGTT 
CAAAGTCCCA GGGTGGAGAA AGTGTACTGT GCTCCCGGCA ACGGAGGAAT TTCCGCAATA
GCAGAGTGTG TGCCGATAAA GGCAATGGAT ATAGAGGGAA TTGTAAACTT TTCGAAGGAA
AAGAAAATTG ACATGGTAGT GGTTGCGCCT GACGACCCTC TTGCAGCCGG TATGGTTGAT
GCTCTTGAGG AAGCGGGAAT CAGAGCGTTT GGACCAAACA AGGCGGCGGC CGTGATTGAA
AGCAGCAAGG CTTTTGCGAA AAATCTTATG AAGAAATACA ATATACCCAC TGCCAGATAT
GAAATTTTTG AAAACAGCGC CGATGCCATA AATTATTTAC AGGATCAGAA ATATCCGGTG
GTTGTGAAAG CCGATGGTTT GGCGCTGGGC AAAGGTGTAA TAATTGCACA GAATTTTGAC
GAGGCCAAAC AAGCGGTTCA AAGCATCATG GAGGACAAGG TCTTTGGTGA AGCCGGAAAC
AAGGTGGTAA TTGAAGAGTT TTTGGTGGGT CAGGAGGTAT CAATGCTGGC CTTTACCGAC
GGAAAAACCA TAAAGACCAT GGTTTCCTCC CAGGACCACA AGAGGGCTTT GGACAATGAC
CAGGGGCTTA ATACCGGAGG TATGGGAACA TTTTCGCCCA GCAGGATATA TACAGAGGAA
ATTGACAGGT ACTGCATGGA AAGAATCTAC AAGCCCACGA TTGAGGCTAT GGAGAAAGAA
GGCAGGAAGT TTAAGGGAGT ATTGTACTTT GGGCTTATTA TCACAAAAGA CGGCCCGAAA
GTACTGGAGT ATAATGCAAG ATTCGGAGAT CCGGAGACCC AGGTGGTGCT TCCAAGGCTT
AATACCGACA TAATTGATAT ATTTGAAGCT GTTATTGATG AAAGATTGGA TGAAGTTGAA
ATAAGCTGGA ATGACAGTGC ATGTGTGTGC GTAATCATGG CTTCGGGAGG ATACCCAAAA
GAGTATAAAA CCGGCTATGA GATATCAGGT ATTGAAGATG CGGAAAGAGA TGCCAACATA
GTGGTCTTCC ACGCAGGTAC AAAGCGTGAA AACGGCAAAT ATTACACCGC AGGAGGACGT
GTGCTGGGAG TTACGGCCAT GGAAAACACT TTGGATGAGG CAATAAAGAA GGCTTACGAG
GGCGTAGGAA AGATTAAGTT CCAGGATATG CACTATCGAA AGGATATAGG AAAAAAGTAG
 
Protein sequence
MKVLVVGSGG REHALVWKIV QSPRVEKVYC APGNGGISAI AECVPIKAMD IEGIVNFSKE 
KKIDMVVVAP DDPLAAGMVD ALEEAGIRAF GPNKAAAVIE SSKAFAKNLM KKYNIPTARY
EIFENSADAI NYLQDQKYPV VVKADGLALG KGVIIAQNFD EAKQAVQSIM EDKVFGEAGN
KVVIEEFLVG QEVSMLAFTD GKTIKTMVSS QDHKRALDND QGLNTGGMGT FSPSRIYTEE
IDRYCMERIY KPTIEAMEKE GRKFKGVLYF GLIITKDGPK VLEYNARFGD PETQVVLPRL
NTDIIDIFEA VIDERLDEVE ISWNDSACVC VIMASGGYPK EYKTGYEISG IEDAERDANI
VVFHAGTKRE NGKYYTAGGR VLGVTAMENT LDEAIKKAYE GVGKIKFQDM HYRKDIGKK