Gene Cthe_0372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0372 
Symbol 
ID4808449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp467460 
End bp468854 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content46% 
IMG OID640105786 
Productsulfide dehydrogenase (flavoprotein) subunit SudA 
Protein accessionYP_001036803 
Protein GI125972893 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR01316] glutamate synthase (NADPH), homotetrameric 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.63559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAACA TGTCACCGAA AAAAGTTCCC ATGCCGGAGC AGGACCCAAA CGTCAGAATC 
AAAAACTTTT TGGAGGTTGC TTTAGGATAT ACCGAGCAAA TGGCAATGGA AGAAGCTCAA
AGGTGTCTTA ACTGCAAGCA CAAACCTTGT GTTTCCGGCT GTCCCGTAAA CGTAAAAATT
CCTGAGTTTG TACAGCTTAT CGCTCAGGGA AAATTTGAGG AAGCCTACAA TAAAATAAGA
GAAACCAACA ACCTTCCGGC AATATGCGGC AGAGTCTGTC CGCAGGAAAA CCAGTGTGAA
AAGTTCTGTG TAAGGGGTAT AAAAGGTGAA CCTGTTGCCA TAGGAAGGCT TGAAAGATTT
GCGGCGGACT GGCACATGAA AAACGGCACC ACTTCTTATG AAAAGCCTGA AAAAAACGGC
AAAAGGGTGG CAGTAATAGG TTCGGGACCT GCAAGCCTTA CCTGTGCAAG CGACCTGGCC
AAACTCGGCT ACGAAGTAAC AATCTTCGAA GCCTTTCACG TGCCCGGCGG AGTGTTGATG
TACGGTATTC CGGAATTCAG GCTTCCAAAG AAACTGGTTC AGGAGGAAAT TGAAACCATA
AAGCAGCTGG GTGTGGAAAT TAAAACAAAT ATGGTTATAG GAAAGGTTTA TTCCATTGAC
GAACTCAAAG CTGAAGGATA TGATGCCATA TTTATAGGCT CGGGTGCCGG ATTGCCTTCA
TTTATGAAAA TTCCCGGAGA AAACCTCAAC GGAGTTTACT CGGCAAATGA GTTTCTCACA
AGAATAAACC TCATGAAGGC TTATGAATTC CCCAACTGCG ATACTCCCGT GAAAGTAGGA
AAGAATGTCG CCGTTGTGGG CGGAGGAAAT GTCGCAATGG ACGCCGCAAG AAGCGCAAAA
AGACTTGGCG CGGAAAACGT TTATATAGTA TACAGGCGTT CGGAAGCGGA AATGCCCGCA
AGACTTGAAG AAATTCATCA CGCAAAGGAA GAAGGAATTT TGTTCAAATT CCTTACAAAC
CCCACAAGAA TTCTTGGCAC CGACGACGGC TGGGTCAAAG GCATGGAGTG CATAGAGATG
GAGCTGGGCG AACCTGATGA ATCCGGAAGA AGAAGACCCG TGCCAAAGCC GGGATCCGAA
CATGTAATTG ATGTTGAAAC GGTTATTATC GCCATCGGCC AAACTCCAAA TCCGTTAATT
GCCTCAACAA CCCCGGGGCT GGCCACTCAA AAATGGGGCG GAATTATTGT CGATGAAAAC
ACCGGCGCCA CCAACATAGA AGGTGTATAT GCCGGCGGAG ATGCGGTAAC CGGTGCCGCA
ACCGTCATTC TTGCAATGGG AGCAGGCAAA AAAGCCGCAA AGGCAATTGA CGAATATCTT
AAAAACAAAA AATAG
 
Protein sequence
MPNMSPKKVP MPEQDPNVRI KNFLEVALGY TEQMAMEEAQ RCLNCKHKPC VSGCPVNVKI 
PEFVQLIAQG KFEEAYNKIR ETNNLPAICG RVCPQENQCE KFCVRGIKGE PVAIGRLERF
AADWHMKNGT TSYEKPEKNG KRVAVIGSGP ASLTCASDLA KLGYEVTIFE AFHVPGGVLM
YGIPEFRLPK KLVQEEIETI KQLGVEIKTN MVIGKVYSID ELKAEGYDAI FIGSGAGLPS
FMKIPGENLN GVYSANEFLT RINLMKAYEF PNCDTPVKVG KNVAVVGGGN VAMDAARSAK
RLGAENVYIV YRRSEAEMPA RLEEIHHAKE EGILFKFLTN PTRILGTDDG WVKGMECIEM
ELGEPDESGR RRPVPKPGSE HVIDVETVII AIGQTPNPLI ASTTPGLATQ KWGGIIVDEN
TGATNIEGVY AGGDAVTGAA TVILAMGAGK KAAKAIDEYL KNKK