Gene Nmul_A2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2471 
SymbolfumC 
ID3784820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2824597 
End bp2825985 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content57% 
IMG OID637812562 
Productfumarate hydratase 
Protein accessionYP_413152 
Protein GI82703586 
COG category[C] Energy production and conversion 
COG ID[COG0114] Fumarase 
TIGRFAM ID[TIGR00979] fumarate hydratase, class II 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTACC GTGATGAACG GGATACCATG GGCTCAATAC AGGTGCCCGC GCACGCGCTG 
TGGGGCGCGC AGACCCAGCG CTCCTTGCAG AATTTCAGAA TTTCAGGCGA ACGCATGCCG
CCTGCGCTGC TTCACGCGCT GGCACTCGTG AAGCGGGCGG CAGCTGCCAT AAACCGTGAT
CTGGGGGTTC TGGATGACGG GCGCGCTACC GCCATCATCC AGGCTGCGGA TGAGGTGATC
GCAGGCGATC ATGATGCAGA GTTTCCGCTG GTCGTGTGGC AGACCGGCTC GGGCACCCAG
ACCAACATGA ATATGAACGA AGTGCTGGCA AATCGCGCTT CTGAACTGCT GGGAGGCGGA
AGAGGAGGGG ATTGCAGGAT CCATCCCAAC GACGATGTAA ACAAGGGGCA GTCATCCAAT
GACGTGTTTC CCTCTGCCAT GCATGTGGCG GCCGCCTGCT CCCTCAATCA TCAGTTGAAG
CCTGGCATTG CCGCGTTGCG GCATACGCTG GCCGCCAAAG CGGAGGCATT TTCAGGCATT
GTCAAAATCG GCCGCACCCA TTTGCAGGAT GCTACTCCCC TCACCTTGGG GCAGGAATTC
TCGGGGTATG TTTCCCAACT GGACCACGGG TTGGGGCATG TCGACGCCGC GCTTCCGCAT
GTATATGAAC TGGCTCTGGG AGGGACGGCG GTGGGTACCG GATTGAATGC CCACCCTGAG
TTTGCGGTGC GGATCGCAGC GGAATTGGCA AGATTGACGG GCCTGCCTTT TGTGACAGCT
CCCAACAAGT TTGAAGCCCT CGCTGCCAAC GATGCGCTCG TCCATGCGCA CGGTGCGCTC
AAGACCCTGG CAGCCTCGCT CATGAAGATC GCCAATGATA TCCGCTGGCT TGCGTCGGGC
CCGCGCTGCG GAATCGGCGA ACTGAAAATT CCTGAAAACG AGCCGGGCAG TTCCATCATG
CCCGGCAAAG TCAATCCGAC ACAGTCAGAG GCGCTTACCA TGCTGTGCTG CCAGGTAATG
GGCAATGATG TCGCCATCAA TATGGGTGGC GCAATGGGGA ATTTTGAGCT CAACGTCATG
AAACCGCTTA TCATTCACAA TTTCCTGCAA AGCGTGCGCC TGCTCGCGGA TGGTATGATA
AGCTTCAACG AGCATTGCGC GATCGGAATC ACCGCAAACA TCGATCGCAT AGATGAATTG
CTGCGCAAAT CGCTGATGCT GGTGACTGCA CTCGCGCCCC ACATCGGCTA CGACAAGGCG
TCTGAAATCG CGAAGAAAGC GCATCGCGAA GGCACTACAC TGGAACAGGC AGCCGTTGCA
ACCGGCTATA TTACCGTCGA TCAATTTCAG GATTGGGTAA GACCGGAAGA CATGATTCAC
CCGCAATGA
 
Protein sequence
MNYRDERDTM GSIQVPAHAL WGAQTQRSLQ NFRISGERMP PALLHALALV KRAAAAINRD 
LGVLDDGRAT AIIQAADEVI AGDHDAEFPL VVWQTGSGTQ TNMNMNEVLA NRASELLGGG
RGGDCRIHPN DDVNKGQSSN DVFPSAMHVA AACSLNHQLK PGIAALRHTL AAKAEAFSGI
VKIGRTHLQD ATPLTLGQEF SGYVSQLDHG LGHVDAALPH VYELALGGTA VGTGLNAHPE
FAVRIAAELA RLTGLPFVTA PNKFEALAAN DALVHAHGAL KTLAASLMKI ANDIRWLASG
PRCGIGELKI PENEPGSSIM PGKVNPTQSE ALTMLCCQVM GNDVAINMGG AMGNFELNVM
KPLIIHNFLQ SVRLLADGMI SFNEHCAIGI TANIDRIDEL LRKSLMLVTA LAPHIGYDKA
SEIAKKAHRE GTTLEQAAVA TGYITVDQFQ DWVRPEDMIH PQ