Gene Cthe_0325 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0325 
SymbolnadE 
ID4808471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp410598 
End bp412535 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content44% 
IMG OID640105736 
ProductNAD synthetase 
Protein accessionYP_001036756 
Protein GI125972846 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0171] NAD synthase
[COG0388] Predicted amidohydrolase 
TIGRFAM ID[TIGR00552] NAD+ synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0404544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACG GTTTTTTCAG AGTGGGAGCG GCTGTTCCCA GATTGAAAGT AGGGGGATGC 
CGTTACAATT CCGATCAAAT AATCGGACTT ATTGGAAAAG GAGAAAAAGC AGGCATACAA
ATACTGGTTT TCCCTGAACT TTCCATAACG GGATATACAT GCGGGGATTT GTTTCACCAG
GAAACTTTGC TTGACGATGC CAAAGTGCAG TTGGGAAGGA TTCTGGAGGA GACTAAAAAC
TCCTCCTGTA TATCCCTGAT TGGCATGCCG CTGGGCATTG ACAATCAGCT TTTTAACTGC
GCCGTTGCAA TACAAAAAGG AAGAATACTT GGTGTTGTTC CAAAGACATA TGTTCCCAAC
TACAGTGAGT TTTACGAGCA AAGGTGGTTT TCTTCCGGCA GAAACGCTCT GAGGGATACA
ATTATGCTTT GCGGGCAGGA AGTACCCTTC GGGGATGATT TGCTGTTCGA GGACGAAAAA
GGGGAAATGT GCTTTGGAAT TGAGATTTGT GAAGATTTGT GGGTGCCTGT TCCTCCAAGC
TCTTTTCAGG CGATGGCCGG AGCGTTGGTT ATTTTTAATC TTTCTGCCAG CAATGAAATT
GTAGGCAAAT ATGAGTACAG AAAGGAACTG GCAAGGCAGC AGTCGGCAAG GTGTATAGCA
GGTTATGTTT ACACATCTTC GGGCGTGGAT GAATCGACCA CGGATGTTGT TTTCGGAGGT
CACGCAATGA TCTTCGAAAA CGGAAGTCTG CTTTGCGAAT CCGAAAGATT TTTGATTGAC
GAGCAGCTAA TTTTTTCTGA AATTGATATC CAGAAGCTGA TGAATGACAG AAGAAAAAAC
ACCAGCTTTA TGGAGCTTTG GAGAGATAAC GTAAGAGAGT TCCGGAAGGT GAAGTTTGAA
ATTGAGGAAT TTGAAGCGGA AAACATAACA AGATATGTGC CACCTCATCC TTTTGTGCCT
TCAGACGGGA GCAGCCGTGA CAGAAGATGC AGCGAGATTT TTGCAATTCA GACCTCCGCT
TTGGCAAAGA GAATCAGGCA TACTGGGCTG AAACGGGCTG TTATAGGCAT ATCGGGAGGC
CTTGATTCCA CACTGGCACT TTTGGTAACC GCTAAAGCTT TTGATTTGCT AAATATCCCT
AGAAAAAATA TTTTGGCAAT TACCATGCCG GGCTTTGGAA CTTCCGATGT GACTTATACC
AACGCCATGG AGTTTATGAA GTCAATGGAC GTGGAAATAC GGGAAATAAA TATTAAGGAT
GCATGTCTTC AGCATTTTAA GGATATCGGT CACGATCCCA GCATACATGA CATTACTTAT
GAAAATGTTC AGGCGAGGGA ACGTACGCAG ATATTAATGG ATATCGCAAA CAAGGAAGGC
GGTCTTGTAA TCGGGACCGG TGACCTTTCG GAGCTGGCAT TGGGCTGGTG TACCTATAAC
GGGGATCATA TGTCGATGTA TGCAGTAAAC GCCAGCATTC CCAAGACATT GGTGAGTTTT
CTTGTAAGAT GGGTTGCGGA CAATATGTTG GAAAGCAAAG CAAAGGATGT ATTGTACAGA
ATACTTGACA CTCCTATATC TCCGGAGCTT CTTCCTCCCG ATGCAAAGGG TGAAATTAAT
CAAAAGACGG AAGATATTAT AGGACCTTAT GAACTTCATG ATTTCTTCCT GTATCATATG
TTAAGGTACG GCGCAGCTCC GGGAAAAATA CTGATTCTTG CAAAGAAAGC CTTCGAAGGC
AAATACACGG ACGATACAAT AAAGAAATGG TTAAAGGTAT TTGTGAAACG CTTTTTCAGC
CAGCAGTTTA AAAGATCGTG TCTGCCTGAC GGGCCAAAGG TCGGTACAAT CAGCCTTTCA
CCAAGGGGAG ACTGGCGTAT GCCCAGTGAT GCCGTTGCAG ATTCGTGGTT GTCTGAATTG
GAAAGCATGC AGGAATAA
 
Protein sequence
MKNGFFRVGA AVPRLKVGGC RYNSDQIIGL IGKGEKAGIQ ILVFPELSIT GYTCGDLFHQ 
ETLLDDAKVQ LGRILEETKN SSCISLIGMP LGIDNQLFNC AVAIQKGRIL GVVPKTYVPN
YSEFYEQRWF SSGRNALRDT IMLCGQEVPF GDDLLFEDEK GEMCFGIEIC EDLWVPVPPS
SFQAMAGALV IFNLSASNEI VGKYEYRKEL ARQQSARCIA GYVYTSSGVD ESTTDVVFGG
HAMIFENGSL LCESERFLID EQLIFSEIDI QKLMNDRRKN TSFMELWRDN VREFRKVKFE
IEEFEAENIT RYVPPHPFVP SDGSSRDRRC SEIFAIQTSA LAKRIRHTGL KRAVIGISGG
LDSTLALLVT AKAFDLLNIP RKNILAITMP GFGTSDVTYT NAMEFMKSMD VEIREINIKD
ACLQHFKDIG HDPSIHDITY ENVQARERTQ ILMDIANKEG GLVIGTGDLS ELALGWCTYN
GDHMSMYAVN ASIPKTLVSF LVRWVADNML ESKAKDVLYR ILDTPISPEL LPPDAKGEIN
QKTEDIIGPY ELHDFFLYHM LRYGAAPGKI LILAKKAFEG KYTDDTIKKW LKVFVKRFFS
QQFKRSCLPD GPKVGTISLS PRGDWRMPSD AVADSWLSEL ESMQE