Gene Saro_1925 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1925 
SymbolispDF 
ID3917148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2036643 
End bp2037803 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content65% 
IMG OID640444671 
Productbifunctional 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase/2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase protein 
Protein accessionYP_497199 
Protein GI87199942 
COG category[I] Lipid transport and metabolism 
COG ID[COG0245] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0865052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAGTG TCCCGTCCCT TCCCGGCCAG TCGGTTGCCG CAGTTGTCGT TGCCGGTGGC 
AAGGGGCTGC GCACGGGCGG GCCCGTGCCA AAGCAGTTCG TGATCTGGCG CGGAAAGCCG
CTTTTGCGCC ATTGCGTGGA GGCACTCGAG GCCGCCGGAA TTGCACCGAT CGTCGTCGCC
ATTCCCGCAG GCTGGGACGA AGCGGCGACG CAGGCGTTGG CGGGAATCTC CATGGTCCGC
CTCGTTCACG GCGGTGCGAC ACGGCAGGAA TCTGTGAAGG CCGCGCTCGA AGTGCTGGAA
GGCGATGCGC CCGCTCGCGT GCTCATCCAT GATGCCGCAC GGCCAGACCT GCCGGGTTCC
GTGATCGAAA GGCTCCTCAC CGCGCTGGAC AAGCGTACCG GGGCCATTCC GGTGCTGCCC
GTTGTGGACA GCATGGTGCG CGGATCCGGA GACGCGATGG GCGAAACGGT TGCCCGTGAA
GACCTGTATC GCGTCCAGAC TCCGCAAGCG TTCCACTATC CGGCAATCCT TGCCGCCCAT
AGGGCCTGGC AGGGTGAGGC TCTTGCCGGC GATGACGCGC AAGTGGCCAT GCGAGCAGCG
CACGAGATCG CGCTCGTCGA GGGCGATGAA GCATTGCGAA AGGTGACGTT CGCGTCCGAT
CTCGAGGAGC AGAGCATGAG CGTCATTCCC CGCACCGGAA TGGGCTTCGA CGTCCATAGG
CTGGTGGAAG GCGAGGAACT TTGGCTTTGC GGGGTGAATA TCCCGCACGG AAAAGGTCTT
TCAGGACATT CGGATGCGGA CGTCGCGATC CACGCACTTG TCGACGCATT GCTCGGCGCG
ATTGCGGCGG GGGATATCGG CGATCATTTT CCGCCGTCCG ATCCGCAGTG GAAGGGGGCC
TCGTCGGACC GTTTTCTCGC GCACGCGGGC ACCCTGGTGA CCGAAGCGGG TTACCGGATA
GCGAACGTCG ACGTGACGAT TATCTGCGAA GCCCCGAAGA TCGGACCGCA CAAGGCGGCC
ATGCGCGAGA CGCTTGCCCG GATTCTCGGG ATTGACTCCG CGCTGGTCTC GGTCAAGGCG
ACGACGACGG AACGCCTTGG ACTGACTGGT CGGGGCGAAG GCATAGCGGC GCAGGCCGTG
GCAACAGTTG TCTCGGGCTG A
 
Protein sequence
MNSVPSLPGQ SVAAVVVAGG KGLRTGGPVP KQFVIWRGKP LLRHCVEALE AAGIAPIVVA 
IPAGWDEAAT QALAGISMVR LVHGGATRQE SVKAALEVLE GDAPARVLIH DAARPDLPGS
VIERLLTALD KRTGAIPVLP VVDSMVRGSG DAMGETVARE DLYRVQTPQA FHYPAILAAH
RAWQGEALAG DDAQVAMRAA HEIALVEGDE ALRKVTFASD LEEQSMSVIP RTGMGFDVHR
LVEGEELWLC GVNIPHGKGL SGHSDADVAI HALVDALLGA IAAGDIGDHF PPSDPQWKGA
SSDRFLAHAG TLVTEAGYRI ANVDVTIICE APKIGPHKAA MRETLARILG IDSALVSVKA
TTTERLGLTG RGEGIAAQAV ATVVSG