Gene Nmul_A2482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2482 
Symbol 
ID3784831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2833232 
End bp2834506 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content59% 
IMG OID637812573 
Productdihydroorotase 
Protein accessionYP_413163 
Protein GI82703597 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACATCG CCATCCGCAA TGGTCGCGTC ATCGACCCGA AAAACAGCTT CGATCGCGTA 
ACGGACATTT ACATCCAGTC AGGCAAAATC GCCTCTCTCG GAGCCGTCCC GCCAGGCTTT
GAGGCACGCC GGGAAATAAA CGCGCAGGGA TTGATCGTGT GTCCCGGGCT TGTGGATCTG
TCGGCCCGGC TACGGGAACC CGGCCTGGAA TACAAGGCGA CGCTTGAATC GGAAATGGGA
GCCGCGGTGG CGGGGGGTGT TACAAGCCTG GCATGTCCCC CCGATACCGA TCCTGTGCTG
GACGAACCCG GGCTGGTGGA AATGCTGAAG TATCGCGCAA GGAGCCGCAA TCAGACGCGT
GTGTATCCCA TTGGAGCGCT TACCCGTGGA TTGAGAGGGG AATGGCTGAC GGAGATGGCT
GAGCTGCACA GCGCGGGATG CGTTGCATTC GGCCAGTCGG ACAGGCCGCT TCCCAACAAC
CGGGTGCTCA TGCAGGCAAT GCAGTATGCC TCCACCTTCG GGTTTTGCCT GTGGCTGCGT
CCCCAGGATG TAAATCTCGC TGACGGCGGA GTTGCCCACG ATGGGGAAGT GGCAACGCGC
CTTGGGTTGG CCCCCATTCC CGTGTGCGCC GAAACTGTCG CCCTGTCCCA CATCATCCTG
ATGGCAAAAG AAACGGGCGC CCGGGTGCAC TTGTGCCGTA TCTCCAGCGC GGAAGGTGTG
ACCATGACAC GCGCCGCCCG CAAGCAGGGA TTATCTATTA CCTGCGACGT TGCGGCCAAT
CACGTCCACC TGTCTGAAAT GGATATCGGT TTCTTCGATT CCAATTGTCA TCTGGTGCCA
CCGTTGAGAA GTCTGGGAGA CCGCGACGCC TTGCGCGCAG GACTGCTGGA TGGCACTATA
GATGCCATAT GCTCCGACCA CGCTCCCGTG GACGAGGATG CAAAGCTGCT GCCCTTTGCC
GAGGCCGAGG CCGGCGCAAC CGGCCTCGAA TTACTCCTGC CGCTTACACT GAAATGGGCG
GCGGAAACGA AGCTGCCCCT GGTAGTCGCA CTATCGAAGA TCACGAGGGA ACCTGCCCGC
ATCCTTGGGG TGGAAGCCGG CCATCTCACG CCCGGGGGCA ATGCCGATCT GTGCATTTTC
GACCCTGATC ATTACTGGAC GATTGAAGCG CCGACACTCA AAAGCCAGGG CAAGAACACG
CCATTCCTGG GCTGGGAACT GCAAGGCAAG GTAAAATATA CGCTGATCAA CGGAAACGTT
GTTTACGTGG ACTAG
 
Protein sequence
MNIAIRNGRV IDPKNSFDRV TDIYIQSGKI ASLGAVPPGF EARREINAQG LIVCPGLVDL 
SARLREPGLE YKATLESEMG AAVAGGVTSL ACPPDTDPVL DEPGLVEMLK YRARSRNQTR
VYPIGALTRG LRGEWLTEMA ELHSAGCVAF GQSDRPLPNN RVLMQAMQYA STFGFCLWLR
PQDVNLADGG VAHDGEVATR LGLAPIPVCA ETVALSHIIL MAKETGARVH LCRISSAEGV
TMTRAARKQG LSITCDVAAN HVHLSEMDIG FFDSNCHLVP PLRSLGDRDA LRAGLLDGTI
DAICSDHAPV DEDAKLLPFA EAEAGATGLE LLLPLTLKWA AETKLPLVVA LSKITREPAR
ILGVEAGHLT PGGNADLCIF DPDHYWTIEA PTLKSQGKNT PFLGWELQGK VKYTLINGNV
VYVD