Gene Noc_1357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1357 
Symbol 
ID3706121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1503452 
End bp1504654 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content46% 
IMG OID637737852 
Producthypothetical protein 
Protein accessionYP_343381 
Protein GI77164856 
COG category[S] Function unknown 
COG ID[COG3016] Uncharacterized iron-regulated protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.690238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATC GAGTAATCAC CCCCTTTTTC CTTCTATTAT TGTTTATCGT TTTCTTCTCT 
GTATGGGTGC GGCCTTTGAC GGCGGCAACC GTTGTAAGTG GACAAGCACA TGGAAAGGCT
AACGAGCTGG AAGCGCAACA GGCCCTTGAA ATGGATGCCT CCACCAAGGC GGTTAATTTG
AGGCAGTTGC TAGATTTGGA GGCAATTATT CCTAAGTTAG CGCGCCACCA GGTTATCTTT
GTTGGGGAAC AGCATCCCCG TTTTGATCAC CATTTGAATC AGCTTGCCAT AATTCGTGGC
CTTCACGGGA TACACCCAAA ATTAGTGATC GGCGTTGAAT TTTTTCAACA ACCGTTTCAG
CAGTATCTGG AGCAATTTGT CGCGGATCAA CTCACCGTAG AAGAATTTTT AAAAAAAACG
GAATATTATG ATCGTTGGCG CTACGACTTC CGGCTGTATG CACCGATACT TGAATTTGCC
CGGAAAAATA GTATTCCTAT ATTAGCCCTC AATGTGCCTA CCGAACTTAT ACAGAAAGTG
GGCCGGGAGG GTTTGGAGGG GCTTTCTGAG AAGGAAAGAG CTCAACTTCC CTCCGAAATT
GACCGTTCCA GTGTGGCCTA CCGCGAGCGA TTGCAAGAGG TGTTCGAAAA CCATCCTCAG
CATTTTGGAA AGTTCGAGAC TTTTTATGAG GCTCAATTAG TATGGGATGA GGCAATGGCG
GAGAGCGCTA GCCGCTATCT TAAGGATCAC TCTGATTCCC ATATGATTGT TCTGGCCGGC
AATGGCCATT TGGCTTATGG CGTAGGTATT CCAGAGCGCC TTAATCGGCG TCTTGATACG
ACGGCTAGCG TGGCTATTGT GTTGAACGAT TGGGAGGGGC TTGTTGAGCC AGACATAGCG
GATTATTTAT TACTCTCTGA GAAAAAAGAA CTGCCAAAGG CCGGCTTTTT AGGGGTTATG
CTCAAGCAAT CTAGCGGAAA GCTTGAAGTC AATGCTTTTT CCGAGATCAG CGCGGCCAAA
ACTGCTGGAA TTGAGGAAAA GGACGAGCTC CTTTCTTTGA ATGGGCGCCT TGTTTCTGAT
ATGTCGGATG TAAAGGAAGT AATGTGGGAT AAAAAACCCG GTGAGGAAGT TCTCGTTAAG
GTGCGCCGTG GGGCTTTTAT GGGTAAGGAT GAAGAATTGG AATTTGAGAT AAAATTAAAA
TAA
 
Protein sequence
MQNRVITPFF LLLLFIVFFS VWVRPLTAAT VVSGQAHGKA NELEAQQALE MDASTKAVNL 
RQLLDLEAII PKLARHQVIF VGEQHPRFDH HLNQLAIIRG LHGIHPKLVI GVEFFQQPFQ
QYLEQFVADQ LTVEEFLKKT EYYDRWRYDF RLYAPILEFA RKNSIPILAL NVPTELIQKV
GREGLEGLSE KERAQLPSEI DRSSVAYRER LQEVFENHPQ HFGKFETFYE AQLVWDEAMA
ESASRYLKDH SDSHMIVLAG NGHLAYGVGI PERLNRRLDT TASVAIVLND WEGLVEPDIA
DYLLLSEKKE LPKAGFLGVM LKQSSGKLEV NAFSEISAAK TAGIEEKDEL LSLNGRLVSD
MSDVKEVMWD KKPGEEVLVK VRRGAFMGKD EELEFEIKLK