Gene Saro_1696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1696 
SymbolfumC 
ID3916271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1782016 
End bp1783392 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content66% 
IMG OID640444437 
Productfumarate hydratase 
Protein accessionYP_496970 
Protein GI87199713 
COG category[C] Energy production and conversion 
COG ID[COG0114] Fumarase 
TIGRFAM ID[TIGR00979] fumarate hydratase, class II 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.397787 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCGGA TCGAAACCGA CAGCCTGGGC GAAGTGCGCG TGCCCGCAGA TGCCTATTGG 
GGCGCCCAGA CGCAGCGAAG CATCGAGAAC TTTCCGTTCG GCGAGATGGA GCGGATGCCG
ATTGGCATCA TTCGCGCACT GGCGCTGGTA AAACGGGCAG CGGCGGGGGT GAACCGGGCA
CACGGGCTAG ACCCGACGCT GGCCGACGCC ATCGAACGGG CCGCCGCCGA AGTGGTCGAC
GGCAAGCTGG ACGACCAGTT CCCGCTGGTC ATCTGGCAGA CCGGCAGCGG CACCCAGTCC
AACATGAACG CCAACGAAGT CATCGCAGGG CGCGCCAACG AGATCCTTAC CGGCAAGCGC
GGAGGCAAGA GCCCGGTCCA TCCCAATGAC CACGTAAACA TGGGCCAGTC CTCGAACGAC
ACCTTCCCTA CCGCGCTGCA CGTCGCGGCC GTCGAGGCGC TGAACCGAGA CCTGCTGCCC
GCGCTGGGAC GGCTGCGGGG CGCGCTGGCG GGCGCGGCGC TCGCGTGGAA GGATATCGTC
AAGATCGGGC GAACCCACTT GCAGGATGCG ACCCCGCTGA CACTGGGGCA GGAGTTTTCC
GGCTATGCCG AGCAGTTGCG CGGCATCGAA GCGCACCTGA AGGCCGCCGA GACGCACCTT
CTGCGACTGG CGCAAGGCGG GACTGCGGTG GGGACCGGGC TCAATGCGCC GGAAGGCTTT
GGCGACGCCA TGGCGGCAGA AATCGCCGCA TTGACCGGCA GGCCCTTCGT TTCCGCGCCG
AACAAGTTCG AAGCGCTCGC CAGCAACGAC GCTCTTGTCC AGTTTTCGGG GACGCTTTCG
ACGCTGGCGG TGGCGCTGAC CAAGATTGCC AACGACATCC GCCTGCTGGG TTCCGGCCCG
CGCTCAGGCC TTGGCGAGCT GGAATTGCCG GCCAACGAAC CGGGCAGTTC GATCATGCCC
GGCAAGGTCA ACCCGACGCA GGCCGAAATG CTCACGATGG TGGCGGCGCA GGTGATCGGC
AATCATCAGG CCGTGACCGT GGGAGGGATG CAGGGGCATC TGGAACTCAA CGTGTTCAAG
CCGATGATCG GCGCCGCCGT GCTGCGATCG GTCCACCTGC TGTCGACCGG CATGACAAGC
TTTGCCACCC GCTGCGTCGA AGGGATCGAG CCGAACCGGC GGCGGATCGC GGACCTTGTC
GAACGCTCGC TCATGCTTGT GACCGCGCTG GCGCCGGAGA TCGGGTACGA CAACGCGGCG
AAGATCGCCA AGCACGCGCA CGAGCATGAC CTGAGCCTGC GACAGGCGGC GCTGGCGCTG
GGACTGGTTG ACGAGGCGAC GTTCGACCGG CTTATCCGAC CGGGGGACAT GGTCTAG
 
Protein sequence
MHRIETDSLG EVRVPADAYW GAQTQRSIEN FPFGEMERMP IGIIRALALV KRAAAGVNRA 
HGLDPTLADA IERAAAEVVD GKLDDQFPLV IWQTGSGTQS NMNANEVIAG RANEILTGKR
GGKSPVHPND HVNMGQSSND TFPTALHVAA VEALNRDLLP ALGRLRGALA GAALAWKDIV
KIGRTHLQDA TPLTLGQEFS GYAEQLRGIE AHLKAAETHL LRLAQGGTAV GTGLNAPEGF
GDAMAAEIAA LTGRPFVSAP NKFEALASND ALVQFSGTLS TLAVALTKIA NDIRLLGSGP
RSGLGELELP ANEPGSSIMP GKVNPTQAEM LTMVAAQVIG NHQAVTVGGM QGHLELNVFK
PMIGAAVLRS VHLLSTGMTS FATRCVEGIE PNRRRIADLV ERSLMLVTAL APEIGYDNAA
KIAKHAHEHD LSLRQAALAL GLVDEATFDR LIRPGDMV