Gene Noc_0174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0174 
Symbol 
ID3706207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp193581 
End bp194666 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content52% 
IMG OID637736691 
Productchorismate mutase 
Protein accessionYP_342237 
Protein GI77163712 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGATA GCCACCAATT ACAGGAAATT CGGGCGCGTA TAGACGCTTT GGATGAACAG 
CTTCAATGTC TCATCAATGA ACGCGCCGAG CTTGCCCGCC AAACGGCGCA GATAAAACAA
GCAGCTGGCT TGGGGGAAAA TTGTTTTCGC CCGGAGCGGG AAGCTGAAAT TTTGCGACGG
GTTATCGCAC GCAACCAGGG ACCACTCAGT GGGCAAGAAA TGGCCCGCTT ATTTCGGGAG
ATTATGTCGG CCTGTCTGGC CCTTGAAACG CCCCTGGTGA TTGCTTATTT AGGTCCAGAG
GGAACCTTTA CCGAGGCTGC AGCGCTTAAG CATTTTGGCC ATTCAGTGAA AACCCAACCG
CTTATGGCCA TTGATGAGGT TTTCCGTGAA GTGGAGGCAG GTACAGCCTA CTATGGGGTC
GTCCCGGTAG AAAACTCCAC CGAAGGAGCC GTGACCCACA CCTTAGATCG GTTTTTAGTC
TCGCCCTTAC AGATTTGTGG TGAAGTGGAG TTGCGCATCC ACCATCATTT GCTTAGCAGA
AACCAAACCA TTGCCGAAGT AAACCGGTTA TATGCCCATC AGCAAACATT GGCACAATGC
CGAGAGTGGT TAGATGCTCA CCTGGCAGGA TGTGAGCGCA TTCCAGTAAG CAGCAATGGG
GAAGCGGCGC GGCGGGCTGG GGATGAATCC GATTGTGCCG CTATTGCGAG TGACCGGGCC
CGTGAAATTT ATGGGCTTCA CGCTTTAGCG ACCAATATTG AGGATGAGCC TGGCAATACC
ACCCGTTTCC TTGTAATTGG CTCCCAAGCC GTGGTTGCTA GCGGGAATGA CAAAACGTCG
TTGCTAGTCT CAGGTCCGAA TCGCTCCGGC TTGCTGTATG ATCTGCTGTC TCCCTTGGCA
GAGTATGGCA TTAGCATGAC CCGGTTGGAG TCCCGTCCCT CACGGCGCCA ACTTTGGGAA
TATGTGTTTT TTATTGATGT TGAAGGACAT ATAGACGATT CTAATCTAAC TACCGCGCTG
GCTACTCTCA AAGAGCGGGC CTCCTTTCTC AAATTATTAG GCTCTTATCC ACGGGCGGTA
ATATAA
 
Protein sequence
MDDSHQLQEI RARIDALDEQ LQCLINERAE LARQTAQIKQ AAGLGENCFR PEREAEILRR 
VIARNQGPLS GQEMARLFRE IMSACLALET PLVIAYLGPE GTFTEAAALK HFGHSVKTQP
LMAIDEVFRE VEAGTAYYGV VPVENSTEGA VTHTLDRFLV SPLQICGEVE LRIHHHLLSR
NQTIAEVNRL YAHQQTLAQC REWLDAHLAG CERIPVSSNG EAARRAGDES DCAAIASDRA
REIYGLHALA TNIEDEPGNT TRFLVIGSQA VVASGNDKTS LLVSGPNRSG LLYDLLSPLA
EYGISMTRLE SRPSRRQLWE YVFFIDVEGH IDDSNLTTAL ATLKERASFL KLLGSYPRAV
I