Gene Nmul_A1200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1200 
Symbol 
ID3784311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1386932 
End bp1388392 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content56% 
IMG OID637811285 
Productinosine-5'-monophosphate dehydrogenase 
Protein accessionYP_411895 
Protein GI82702329 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0516] IMP dehydrogenase/GMP reductase
[COG0517] FOG: CBS domain 
TIGRFAM ID[TIGR01302] inosine-5'-monophosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.608786 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACTGG TTGAGAAAGC TTTAACCTTT GACGATGTCC TGCTGCTTCC CGCCCACTCC 
GTTGTTTTAC CGCGCAATGT CAATCTGACA ACGCGACTCA CGCGCGAGAT CTCCCTCAAT
ATTCCTCTTG TATCAGCCGC CATGGATACC GTTACCGAGT CGCGGCTCGC AATCGCACTC
GCCCAGGAAG GGGGAATCGG CATCATACAC AAGAATATGC CGGCGGAATC CCAGGCTGCC
CAGGTCTCCA ATGTCAAGCG CTTCGAAAGC GGCGTGGTAA AAGATCCCAT TACCATTCCG
CCTGACATGA CGGTGCGGGA AGTGCTCAAC CTGATCCACA AATTCAGGAT TTCAGGCTTA
CCCGTGGTGG AAGGTTCGAA AGTGGTGGGG ATTGTGACAA ACCGCGATCT TCGCTTCGAA
ACCAACCTGG ATCAGCCGAT CCGGAATATC ATGACCTTGA AGGAGCGGCT GGTCACGGTG
AACGAAGGCG CCAGTCGTGA AGAAGCAATG GCCCTGCTGC ACAAATACCG GCTGGAGCGT
GTGCTCGTCG TGAATAACGA CTTCGAGTTG CGCGGACTGA TCACCGTGAA AGATATCATC
AAGACATCCG AACACCCAAA CGCCTGCAAG GACGAACAAG GCCGGCTGCG TGTTGGCGCT
GCGATAGGGG TTGGCGAAGG CAGTGAAGAG CGCGCGGAGG CGCTGGTCGA TGCAGGTGTG
GACGTGATTG TCGTGGATAC CGCACATGGA CACTCACAGG GTGTGCTTGA ACGGGTACGG
TGGGTAAAGA AACGGTTCCC TAAAATCCAG GTCATCGGCG GCAATGTAGG TACTGCCGCC
GCTGCCAGGG CTTTGGTGGA TCATGGCGCC GACGCGGTAA AGGTAGGAAT CGGTCCCGGT
TCGATATGCA CCACCCGCAT TGTGGCGGGA GTGGGAATAC CTCAGATTAC GGCCATCAAA
AACGTTTCTG CAGAACTGGC AGGCAGCGGC GTGCCGCTGA TCTCCGACGG CGGCATCCGC
TATTCGGGGG ACATTGCAAA GGCACTGGCC GCCGGGGCGA GTTCAATCAT GCTCGGGGGC
TTGTTCGCAG GCACGGAAGA ATCACCAGGA GAAATAGAAC TGTTCCAGGG GCGCTCATAC
AAGACCTATC GTGGAATGGG TTCACTTTCG GCAATGCAGC AGGGTTCGAG CGACCGCTAT
TTCCAGCAGG CCGAGCAGGA CTCCAGGAAG CTCGTGCCGG AAGGCGTGGA AGGCAGGGTT
CCTTTCAAGG GAAGCGTCAT CGCGGTCATC CATCAGTTGA TTGGCGGCGT ACGCTCGGGC
ATGGGCTATC TGGGCTGCGA AACAATCGAT GACATGCACG CCAAGGCAGA GTTTATCGAG
ATTACTGCCG CAGGCATTCG TGAATCCCAC GTCCACAACG TACAGATTAC CAAAGAGGCC
CCGAACTATC ATATCGATTA G
 
Protein sequence
MRLVEKALTF DDVLLLPAHS VVLPRNVNLT TRLTREISLN IPLVSAAMDT VTESRLAIAL 
AQEGGIGIIH KNMPAESQAA QVSNVKRFES GVVKDPITIP PDMTVREVLN LIHKFRISGL
PVVEGSKVVG IVTNRDLRFE TNLDQPIRNI MTLKERLVTV NEGASREEAM ALLHKYRLER
VLVVNNDFEL RGLITVKDII KTSEHPNACK DEQGRLRVGA AIGVGEGSEE RAEALVDAGV
DVIVVDTAHG HSQGVLERVR WVKKRFPKIQ VIGGNVGTAA AARALVDHGA DAVKVGIGPG
SICTTRIVAG VGIPQITAIK NVSAELAGSG VPLISDGGIR YSGDIAKALA AGASSIMLGG
LFAGTEESPG EIELFQGRSY KTYRGMGSLS AMQQGSSDRY FQQAEQDSRK LVPEGVEGRV
PFKGSVIAVI HQLIGGVRSG MGYLGCETID DMHAKAEFIE ITAAGIRESH VHNVQITKEA
PNYHID