Gene Nwi_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_2008 
Symbol 
ID3674196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp2195887 
End bp2197188 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content67% 
IMG OID637713572 
Productdihydroorotase 
Protein accessionYP_318619 
Protein GI75676198 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.471135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGCG GCCATCGCCC GATCCTGCTC GCCAATGCCC GCATTGTCGA TCCCGCGCGC 
GATCTCGACG GCCCCGGCGA CGTGCTCATC GCCGATGGGG TTATTCGCGA TGCGCGGCAC
GGCATCGGCG CGGCGGGCGC GCCCGAAGGC ACCGACATCA TCAACTGCGC CGGCATGATC
GTCACGCCGG GCCTGATCGA CATCCGCGCC TTCGTCGGCG AACCGGGCGC GGGCCATCGC
GAAACCTTCG CATCGGCAAG CTGCGCCGCC GCCGCCGGCG GGATCACCAC CATTGTCTGC
CAGCCTGACA CCTCGCCGGT GATCGACAAT TCTGCCACCG TCGATTTCGT GCTGCGCCGC
GCGCGGGATA CCGCGATCGT CAACATTCAT CCGATGGCGG CGCTGACCAA GGGTCTTGCC
GGCAAGGAGA TGACCGAGTT CGGCCTCCTG AAGGAAGCGG GCGCGGTCGC CTTCACCGAC
GGCGCCCGCA GCGTGATGAA CGCGCAGGTG ATGCGCCGCG CCCTCACCTA TGCCCGCGAC
TTCGACGCGC TGATCGTCCA TCATACCGAG GATGCTCATC TCGTCGGCGA CGGCGTCATG
AACGAGGGCG AACTGTCTTC CCGCCTCGGA TTGACGGGTG TTCCGGCGAC CGCGGAAGCG
GTGATGCTGG AGCGCGACAT GCGGCTCGTC GGGCTGACCG GCGGCCGTTA CCACGCGGCG
TCCCTGACCT GCGCGGAATC GCTCGATATC CTCATGCGAG CGCGCGACGC CGGGCTCCCG
GTCAGCGCAT CCGCATCGAT CAATCATCTG ACGCTGAATG AGAACGATAT CGGTCCGTAC
CGGACGTTTC TGAAATTGTC GCCGCCGCTG CGCGGCGAGG ACGACCGACG CGCGCTGGTC
GCAGCGCTGG CGTCGGGCCT GATCGATGTG GTGATGTCCG ACCATAACCC GCAGGACGTG
GAAACCAAGC GGCTGCCGTT CGCGGAAGCC GCCGCAGGCG CGATCGGGCT GGAGACCATG
CTGACGGCCG CGCTGCGGCT CGTGCACAAC GCGGAACTCG ACTTCAAGAC GCTGATCCGG
GCGATGTCGA CGCGGCCCGC GGAGCTGCTG GGTTTGCCGG GCGGATCGCT GCGACCGGGC
GCGCCCGCCG ATGTCATCGT GATCGATCCT GACGTGCCAT GGATTCTCGA TCCCGCCGAT
CTCAAATCGC AATGCAAGAA TACGCCATTC GACGAATCCC GGTTCTCGGG TCGGGTTGTT
CGGACGATCG TCGGCGGACG CACGGTTTTC GAGCACGCCT GA
 
Protein sequence
MLSGHRPILL ANARIVDPAR DLDGPGDVLI ADGVIRDARH GIGAAGAPEG TDIINCAGMI 
VTPGLIDIRA FVGEPGAGHR ETFASASCAA AAGGITTIVC QPDTSPVIDN SATVDFVLRR
ARDTAIVNIH PMAALTKGLA GKEMTEFGLL KEAGAVAFTD GARSVMNAQV MRRALTYARD
FDALIVHHTE DAHLVGDGVM NEGELSSRLG LTGVPATAEA VMLERDMRLV GLTGGRYHAA
SLTCAESLDI LMRARDAGLP VSASASINHL TLNENDIGPY RTFLKLSPPL RGEDDRRALV
AALASGLIDV VMSDHNPQDV ETKRLPFAEA AAGAIGLETM LTAALRLVHN AELDFKTLIR
AMSTRPAELL GLPGGSLRPG APADVIVIDP DVPWILDPAD LKSQCKNTPF DESRFSGRVV
RTIVGGRTVF EHA