Gene Noc_1185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1185 
Symbol 
ID3706759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1292819 
End bp1293874 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content50% 
IMG OID637737688 
Producthypothetical protein 
Protein accessionYP_343217 
Protein GI77164692 
COG category[S] Function unknown 
COG ID[COG3249] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAG TTGCTTTGGC GATAGTGATA TTTTATGTGC TGCCATTAGA GGTTCAAGCG 
ATAGGAGCGG TGGATCTTTA TGAAGCCCAG GTGCCTGTAA GTAATCAAAC CCCCGAGGAG
CAGGCGAGGG CGGTGAAAGA GGCTTTTCAA AAGGTGTTAT TGAAAGTGAT GGGAAACCGG
CGTACCTTGG CTCGCGCACC CCTTGCCTTG TTGCTTGAGA AGTCCTCCAG TTTGGTTCAG
AAATTTCGCT ATAATGCTTC TGATGAGGAA AACGGTGCGG CGACCTTTTG GGTTCGTTTT
GACCCACTGG GTGTAGAGCA ATTGTTACGG CAGAAAGCGC TGCCCGTATG GGGCCAGGTC
CGTCCTATTT TATTGTTATG GGTTGCGATT GAAGAAGGCA GGCACCGCTA TTTGGTAGAT
GCAGACGGCA ATTTGCCTGC TGCCGAAATC CTGGAAGAGC AAGCAGGAGT ACGGGGGATG
CCCGTGATTT TACCTCTTTG GGATTTAGAG GATCGATCCC AGCTGTCTTT CAGCGATATT
TGGGGTAATT TCCCTGAGCC TATACTGGCT GCTTCCAAGC GCTATCCCGC TTCGGTACAG
TTAGTGGGGC GTTTGTCGCG CCAGAGTGAG GATGATTGGC AGGCCCGTTG GACTCTTTAC
GGCGTTGATA AAGCGCGAGA CTGGCGGGTT AATGGCGAGT TTGAACAGGT GTTGCGTGCT
GGTATCGATA AATCAGTAGA CACGATTGCT GCCCAGATAG TGCCGGCTAC CGGGAATAAT
TCACTATCCT CGGTGCAGGT TAGGGTGACC GGAGTAACCT CGTTTATGGA TTATGCTCGT
CTCTTCTCTT ATCTCAGCAG CCTTAGTCAA GTGATCACCA TGGAGCCAGT ACAATTATCC
CGGGCAGAAG CAAAATTTAG GCTTGAATTG CGGGGAAAGG CCGAGGGGTT AGCCACTTCC
ATCCGCTTTG GGAGGGTTTT GGTGCGAGCA ACGGAAGGTA TGGGCACAAA CATAGAGCCG
AATCACATGG AATTGAATTA TCGGCTATTA CCTTAA
 
Protein sequence
MKQVALAIVI FYVLPLEVQA IGAVDLYEAQ VPVSNQTPEE QARAVKEAFQ KVLLKVMGNR 
RTLARAPLAL LLEKSSSLVQ KFRYNASDEE NGAATFWVRF DPLGVEQLLR QKALPVWGQV
RPILLLWVAI EEGRHRYLVD ADGNLPAAEI LEEQAGVRGM PVILPLWDLE DRSQLSFSDI
WGNFPEPILA ASKRYPASVQ LVGRLSRQSE DDWQARWTLY GVDKARDWRV NGEFEQVLRA
GIDKSVDTIA AQIVPATGNN SLSSVQVRVT GVTSFMDYAR LFSYLSSLSQ VITMEPVQLS
RAEAKFRLEL RGKAEGLATS IRFGRVLVRA TEGMGTNIEP NHMELNYRLL P