Gene Noc_2880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2880 
SymbolureC 
ID3705593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3259662 
End bp3261362 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content54% 
IMG OID637739356 
Producturease subunit alpha 
Protein accessionYP_344856 
Protein GI77166331 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTT CTCGGCACGC TTATGCGGAT ACCTATGGCC CTACTACAGG AGACCGGGTG 
CGTTTAGCCG ACACGGATCT GTGGATCAAA GTGGAACGGG ATTTAACGAC CTATGGAGAT
GAGGTTAAAT TTGGTGGTGG CAAGGTCATC CGTGATGGGA TGGGCCAAAG CCAGCTTTCC
AGGGATCAGG TCATGGATTT GGTGATTACC AATGCTCTCA TTGTAGACCA TTGGGGAATC
ATCAAGGCTG ATGTGGGCAT CAAGGATGGT CGTATCGCGG CGATAGGCAA AGCCGGGAAC
CCTGACACCC AACCTAACGT AGATATTAAT ATTGGGGCCG CGACTGAAGT TATTGCTGGG
GAAGGCAAGA TTCTCACCGC AGGAGGTATC GATAGCCATG TTCATTTTAT CTGTCCCCAG
TTAGTGGAAG AAGCGCTAAC CTCGGGGGTG ACTACCCTGA TTGGCGGCGG CACAGGCCCA
GCAACTGGCA CCAATGCGAC TACCTGCACG CCGGGGTCAT GGAATATTGG ACGGATGCTC
CAGGCGGCGG ATGCTTTCCC CATAAATATG GGTTTTTTAG GTAAGGGCAA TGCCAGTTTA
CCCCAATCCC TTGAGGAGCA GGTACGGGCG GGGGTACTCG GTCTTAAGTT GCACGAAGAC
TGGGGTACCA CGCCAGCCGC CATTGATAAT TGTTTGAGCG TGGCGGAGCG TTTCGATGTG
CAGGTGACTA TTCATACGGA TACCCTCAAT GAATCCGGTT TCGTTGAGGA CACGATTGCC
GCCTTCAAGG ACCGCACTAT TCATACCTAT CATACGGAAG GTGCCGGGGG TGGCCACGCG
CCGGATATCA TCAAGGCTTG CGGCGAGGCG AACGTACTGC CTAGTTCCAC TAACCCGACC
ACTCCCTTCA CGGCAAATAC CATAGATGAG CACCTAGACA TGTTGATGGT TTGCCATCAC
CTGGACCCTT CCATTCCGGA AGATGTGGCC TTTGCCGAAA GCCGTATTCG GCGGGAGACG
ATTTCCGCCG AGGATGCCCT ACATGATATG GGAGCGTTGA GTATGCATGG TTCCGACTCC
CAGGCCATGG GCCGGGTGGG GGAGGTGATC TTGCGCACTT GGCAGAGCGC ATCAGTTATG
AAACGCGACC GGGGTACCTT ACCGGAGGAT AAGGGGGATC ATGATAATTT TCGCATTAAG
CGCTATATTG CCAAGTACAC GATTAACCCG GCGATCACCC ATGGAATAGC CCACGAGATC
GGCTCGATAG AAGTGGGGAA ACTGGCGGAT TTGGTGCTGT GGAAACCAGC CTTCTTTGGG
GTTAAGCCAA GTTTGGTGCT AAAAGGAGGT GTGATTGTCA CTGCTCCCAT GGGGGATCCC
AACGCCTCTA TTCCCACCCC GCAGCCGGTG CATTACCGTC CCATGTTCGG CAGTTTTGGC
GGCAGCCGCA CCGCGAGCTG CGTCAGCTTT GTTTCCCAAG CAGGGTTAGA TGAAGGCATC
GGCGAGAAGC TGGGCCTCCA GAAGCGTCTT GTGGCGGTGA AAAATATTCG CGGCCTGCGC
AAAAGCGATC TGATCCATAA CAACGCGCTG CCGCGCATAG AAGTGGACCC TCAGAACTAT
CAGGTGCGTG CCGATGGCCA GCTTTTATGG TTCGAGCCAT CTAAGGTTCT GCCCATGGCC
CAACGTTATT TTCTATTTTA G
 
Protein sequence
MKISRHAYAD TYGPTTGDRV RLADTDLWIK VERDLTTYGD EVKFGGGKVI RDGMGQSQLS 
RDQVMDLVIT NALIVDHWGI IKADVGIKDG RIAAIGKAGN PDTQPNVDIN IGAATEVIAG
EGKILTAGGI DSHVHFICPQ LVEEALTSGV TTLIGGGTGP ATGTNATTCT PGSWNIGRML
QAADAFPINM GFLGKGNASL PQSLEEQVRA GVLGLKLHED WGTTPAAIDN CLSVAERFDV
QVTIHTDTLN ESGFVEDTIA AFKDRTIHTY HTEGAGGGHA PDIIKACGEA NVLPSSTNPT
TPFTANTIDE HLDMLMVCHH LDPSIPEDVA FAESRIRRET ISAEDALHDM GALSMHGSDS
QAMGRVGEVI LRTWQSASVM KRDRGTLPED KGDHDNFRIK RYIAKYTINP AITHGIAHEI
GSIEVGKLAD LVLWKPAFFG VKPSLVLKGG VIVTAPMGDP NASIPTPQPV HYRPMFGSFG
GSRTASCVSF VSQAGLDEGI GEKLGLQKRL VAVKNIRGLR KSDLIHNNAL PRIEVDPQNY
QVRADGQLLW FEPSKVLPMA QRYFLF