Gene TM1040_2278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2278 
SymbolnadE 
ID4078462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2394154 
End bp2395815 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content61% 
IMG OID638007600 
ProductNAD synthetase 
Protein accessionYP_614272 
Protein GI99082118 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0171] NAD synthase
[COG0388] Predicted amidohydrolase 
TIGRFAM ID[TIGR00552] NAD+ synthetase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.601825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCCG ATAGTTTTCG CATCACCCTT GCGCAATTGA ATCCGACGGT TGGGGATTTG 
GCGGGCAATG CCGCCAAGGC TCTCTCGGCC TGGCAGGAGG GTCGCGCGGC GGATGCGGAT
CTCGTGGCGC TGCCGGAGAT GTTCATCACC GGCTACAACA CGCAGGATCT GGTGATGAAA
CCGGCCTTTC ATCAGGCTGC GATCGCTGAG GTCGAGGCGC TTGCGAAGGC CACTGCAGAG
GGTCCTGCCT TGGCCATCGG CAGCCCCTGG GTCGAGGACG GCAAGCTCTA CAATGCCTAT
CTGATCCTCA AGGGCGGCAA GATCGCCTCC AAATGCCTCA AGCACCATCT TCCCAATGAG
ACGGTGTTTG ACGAGGTTCG GATCTTTGAC GCAGGCCCGC TGGGCGGGCC GTATTCCGTG
GGCAACACCC GCATCGGCAG CCCGATCTGC GAGGATGGCT GGCACGAGGA TGTCGCCGAG
ACGCTGGAAG AAACCGGCGC TGAGTTCCTG CTCATTCCCA ATGGCTCGCC CTATTACCGC
GGCAAGATGG AAACCCGCAT CAACCACATG GTTGCTCGGG CGGTCGAAAC GGGCCTGCCG
GTGATCTACC TCAACATGGT GGGCGGTCAG GACGATCAGG TCTTTGATGG GGGCAGCTTT
GCGCTCAACC CGCATGGGGC TTTGGCGGTG CAGATGCCGG TCTTTGACGA ATGCATCGCA
CAGCTTGATC TCGAGCGCAC CGCCGATGGC TGGCGCATCA AAGAGGGCGA AAAGGCGCAT
CTACCCGATG CGTGGGAGCA GGACTATCGC ACCATGGTGC TGTCCTTGCG CGATTACATG
GGCAAGACCG GCTTCAAAAA AGCGCTGCTG GGCCTCTCGG GGGGGGTGGA TTCCGCCATC
GTTGCGGCCA TCGCCGTCGA TGCCTTGGGC GCGGAAAACG TGCGCTGCGT CATGCTGCCG
TCGGAATATA CCTCCAAGGA ATCACTCGAG GATGCAGAGG CCGTCGCCAA GGCGCTGGGC
GTCCACTACG ACTATGTGCC GATCTCTGAG GGCCGCGAAG CAATCACCAA CACGCTTGCG
CCGCTCTTTG CGGGACGGGA CGCGGATCTC ACCGAGGAAA ACATCCAGTC CCGCCTGCGC
GGGCTTCTGC TAATGGCCAT GTCGAACAAA TTTGGCGAGA TGCTCCTGAC CACCGGCAAT
AAATCCGAGG TCGCGGTGGG CTATGCCACC ATCTACGGCG ATATGAACGG CGGTTATAAC
CCGATCAAGG ATCTCTACAA GACGCGTGTG TTTGAAACCT GCCGCTGGCG CAATGCCAAT
CACCGCGACT GGATGATGGG GCCAGAGGGC GAGGTGATCC GCCCCAATGT GATCGACAAG
CCCCCCTCAG CCGAGCTGCG CGAGGACCAG AAGGACAGCG ATTCCCTGCC GGACTATCCC
GAGCTCGATG CGATTCTCGA CATTCTCGTG GATCAGGAAG GATCAATCGC CGATTGCGTC
GCCGCAGGCT TTGACCGCGA CGTGGCAAAA CGGGTCGAAC ACCTGCTTTA TATCAGCGAA
TACAAGCGCT TTCAGTCCGC TCCCGGCGCG CGGCTGACCA AGCGGGCCTT CTGGCTTGAT
CGGCGTTATC CGATTGTCAA TCGCTGGCGT GATCCCAGCT GA
 
Protein sequence
MMADSFRITL AQLNPTVGDL AGNAAKALSA WQEGRAADAD LVALPEMFIT GYNTQDLVMK 
PAFHQAAIAE VEALAKATAE GPALAIGSPW VEDGKLYNAY LILKGGKIAS KCLKHHLPNE
TVFDEVRIFD AGPLGGPYSV GNTRIGSPIC EDGWHEDVAE TLEETGAEFL LIPNGSPYYR
GKMETRINHM VARAVETGLP VIYLNMVGGQ DDQVFDGGSF ALNPHGALAV QMPVFDECIA
QLDLERTADG WRIKEGEKAH LPDAWEQDYR TMVLSLRDYM GKTGFKKALL GLSGGVDSAI
VAAIAVDALG AENVRCVMLP SEYTSKESLE DAEAVAKALG VHYDYVPISE GREAITNTLA
PLFAGRDADL TEENIQSRLR GLLLMAMSNK FGEMLLTTGN KSEVAVGYAT IYGDMNGGYN
PIKDLYKTRV FETCRWRNAN HRDWMMGPEG EVIRPNVIDK PPSAELREDQ KDSDSLPDYP
ELDAILDILV DQEGSIADCV AAGFDRDVAK RVEHLLYISE YKRFQSAPGA RLTKRAFWLD
RRYPIVNRWR DPS