Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0325 |
Symbol | nadE |
ID | 4808471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 410598 |
End bp | 412535 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105736 |
Product | NAD synthetase |
Protein accession | YP_001036756 |
Protein GI | 125972846 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG0171] NAD synthase [COG0388] Predicted amidohydrolase |
TIGRFAM ID | [TIGR00552] NAD+ synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0404544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACG GTTTTTTCAG AGTGGGAGCG GCTGTTCCCA GATTGAAAGT AGGGGGATGC CGTTACAATT CCGATCAAAT AATCGGACTT ATTGGAAAAG GAGAAAAAGC AGGCATACAA ATACTGGTTT TCCCTGAACT TTCCATAACG GGATATACAT GCGGGGATTT GTTTCACCAG GAAACTTTGC TTGACGATGC CAAAGTGCAG TTGGGAAGGA TTCTGGAGGA GACTAAAAAC TCCTCCTGTA TATCCCTGAT TGGCATGCCG CTGGGCATTG ACAATCAGCT TTTTAACTGC GCCGTTGCAA TACAAAAAGG AAGAATACTT GGTGTTGTTC CAAAGACATA TGTTCCCAAC TACAGTGAGT TTTACGAGCA AAGGTGGTTT TCTTCCGGCA GAAACGCTCT GAGGGATACA ATTATGCTTT GCGGGCAGGA AGTACCCTTC GGGGATGATT TGCTGTTCGA GGACGAAAAA GGGGAAATGT GCTTTGGAAT TGAGATTTGT GAAGATTTGT GGGTGCCTGT TCCTCCAAGC TCTTTTCAGG CGATGGCCGG AGCGTTGGTT ATTTTTAATC TTTCTGCCAG CAATGAAATT GTAGGCAAAT ATGAGTACAG AAAGGAACTG GCAAGGCAGC AGTCGGCAAG GTGTATAGCA GGTTATGTTT ACACATCTTC GGGCGTGGAT GAATCGACCA CGGATGTTGT TTTCGGAGGT CACGCAATGA TCTTCGAAAA CGGAAGTCTG CTTTGCGAAT CCGAAAGATT TTTGATTGAC GAGCAGCTAA TTTTTTCTGA AATTGATATC CAGAAGCTGA TGAATGACAG AAGAAAAAAC ACCAGCTTTA TGGAGCTTTG GAGAGATAAC GTAAGAGAGT TCCGGAAGGT GAAGTTTGAA ATTGAGGAAT TTGAAGCGGA AAACATAACA AGATATGTGC CACCTCATCC TTTTGTGCCT TCAGACGGGA GCAGCCGTGA CAGAAGATGC AGCGAGATTT TTGCAATTCA GACCTCCGCT TTGGCAAAGA GAATCAGGCA TACTGGGCTG AAACGGGCTG TTATAGGCAT ATCGGGAGGC CTTGATTCCA CACTGGCACT TTTGGTAACC GCTAAAGCTT TTGATTTGCT AAATATCCCT AGAAAAAATA TTTTGGCAAT TACCATGCCG GGCTTTGGAA CTTCCGATGT GACTTATACC AACGCCATGG AGTTTATGAA GTCAATGGAC GTGGAAATAC GGGAAATAAA TATTAAGGAT GCATGTCTTC AGCATTTTAA GGATATCGGT CACGATCCCA GCATACATGA CATTACTTAT GAAAATGTTC AGGCGAGGGA ACGTACGCAG ATATTAATGG ATATCGCAAA CAAGGAAGGC GGTCTTGTAA TCGGGACCGG TGACCTTTCG GAGCTGGCAT TGGGCTGGTG TACCTATAAC GGGGATCATA TGTCGATGTA TGCAGTAAAC GCCAGCATTC CCAAGACATT GGTGAGTTTT CTTGTAAGAT GGGTTGCGGA CAATATGTTG GAAAGCAAAG CAAAGGATGT ATTGTACAGA ATACTTGACA CTCCTATATC TCCGGAGCTT CTTCCTCCCG ATGCAAAGGG TGAAATTAAT CAAAAGACGG AAGATATTAT AGGACCTTAT GAACTTCATG ATTTCTTCCT GTATCATATG TTAAGGTACG GCGCAGCTCC GGGAAAAATA CTGATTCTTG CAAAGAAAGC CTTCGAAGGC AAATACACGG ACGATACAAT AAAGAAATGG TTAAAGGTAT TTGTGAAACG CTTTTTCAGC CAGCAGTTTA AAAGATCGTG TCTGCCTGAC GGGCCAAAGG TCGGTACAAT CAGCCTTTCA CCAAGGGGAG ACTGGCGTAT GCCCAGTGAT GCCGTTGCAG ATTCGTGGTT GTCTGAATTG GAAAGCATGC AGGAATAA
|
Protein sequence | MKNGFFRVGA AVPRLKVGGC RYNSDQIIGL IGKGEKAGIQ ILVFPELSIT GYTCGDLFHQ ETLLDDAKVQ LGRILEETKN SSCISLIGMP LGIDNQLFNC AVAIQKGRIL GVVPKTYVPN YSEFYEQRWF SSGRNALRDT IMLCGQEVPF GDDLLFEDEK GEMCFGIEIC EDLWVPVPPS SFQAMAGALV IFNLSASNEI VGKYEYRKEL ARQQSARCIA GYVYTSSGVD ESTTDVVFGG HAMIFENGSL LCESERFLID EQLIFSEIDI QKLMNDRRKN TSFMELWRDN VREFRKVKFE IEEFEAENIT RYVPPHPFVP SDGSSRDRRC SEIFAIQTSA LAKRIRHTGL KRAVIGISGG LDSTLALLVT AKAFDLLNIP RKNILAITMP GFGTSDVTYT NAMEFMKSMD VEIREINIKD ACLQHFKDIG HDPSIHDITY ENVQARERTQ ILMDIANKEG GLVIGTGDLS ELALGWCTYN GDHMSMYAVN ASIPKTLVSF LVRWVADNML ESKAKDVLYR ILDTPISPEL LPPDAKGEIN QKTEDIIGPY ELHDFFLYHM LRYGAAPGKI LILAKKAFEG KYTDDTIKKW LKVFVKRFFS QQFKRSCLPD GPKVGTISLS PRGDWRMPSD AVADSWLSEL ESMQE
|
| |