Gene Noc_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1940 
Symbol 
ID3705477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2219606 
End bp2220733 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content55% 
IMG OID637738416 
Producthypothetical protein 
Protein accessionYP_343932 
Protein GI77165407 
COG category[H] Coenzyme transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0189] Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTGGA GACTGCGCCA GCGCGTTAGC CGGTTTTGCA AAGATTTTGC CGCGTTGCCG 
GGGGTACTGT GGGAACATGG CAGGGCGGCG CGAAAGGCCT CTGGGAAGTC ACTACTGGGT
CAGGCTTCCG AGATGATTGC CCTGCGCTGG GGCAAGGGCC GCTTGGCTCC TGATGAATAC
TATCAGTATT GTTTATTTGA TGATCGGCGT TTCAGCCCCG AGCAGAAGCG GACTTTCCTT
GGTCGGCACA TGCAGTACGA TCTATGGGAG CTGTTCGATT CCTGGTCATG GCATGCCATT
GCGAACGACA AGCTGGTGGC ATGCAGCCTA TTTGAAGCCT TGCAATTGCC GTCGCCCAAA
TTGTACGGGT TCTTTCATCC GATACGCCGC CATGGCGCCT TGCCTATAGT GCGAAACGGG
GCGCAGCTCG GGCAGTTTCT TCGTGAACAG GCGCCATTCC CTTTGGTTGC CAAGCCTGTG
CTTGGAATGT GGGGTAAAAA TGTATACGCC ATCGAGCGCC TTGAGCATGA AAGCGATGAA
CTGGTGCTGG TTAATGGGAA GCGTATGGCT ATAGCGGATT TTGTGGCTGC TCTTGAGCCC
CTGGTGAAAC AAGGGTGGCT CTTTCAGGAG CTTTTGAAGC CACATCCCAT GCTATTGGAA
CTATGTGGTA ACCGCATTTG CAGTGTCCGC GTAGTGACCC TGCTGGACCC GGCGCCTATC
ATAATTAGCA CTCTCTGGAA AGTCGCTGTG GGCAACGCAA TGGCTGATAA TTATTGGGAG
CCTGGAAATT TAGTAGGGCC TATCGACCCT GAGACGGGAG TCGTGGGGCA GATGTTTACG
GGTTTGGGGT TACAACGCCG CAATGTTTCC GAGCACCCGG ATACGGGGGA GAAGCTGGTA
GGGATTACTT TGCCCAACTG GGAGCAGACG CTGGAACTTT GCCGGGAGGG CACGGCGTCA
TTGCCAGGCC TAAAAATGCA GGCGTGGGAT ATTGCGTTGA CCGATCGGGG GCCGGTGATG
CTCGAAGTCA ATATCATCGG CGGGGTACGC TTGCCCCAGC TGGTAGTTGA TGCAGGCATG
AATCGAGGTC CATTGAGAGA GATGCTGCGC AAACATAGAT ATATGTAG
 
Protein sequence
MFWRLRQRVS RFCKDFAALP GVLWEHGRAA RKASGKSLLG QASEMIALRW GKGRLAPDEY 
YQYCLFDDRR FSPEQKRTFL GRHMQYDLWE LFDSWSWHAI ANDKLVACSL FEALQLPSPK
LYGFFHPIRR HGALPIVRNG AQLGQFLREQ APFPLVAKPV LGMWGKNVYA IERLEHESDE
LVLVNGKRMA IADFVAALEP LVKQGWLFQE LLKPHPMLLE LCGNRICSVR VVTLLDPAPI
IISTLWKVAV GNAMADNYWE PGNLVGPIDP ETGVVGQMFT GLGLQRRNVS EHPDTGEKLV
GITLPNWEQT LELCREGTAS LPGLKMQAWD IALTDRGPVM LEVNIIGGVR LPQLVVDAGM
NRGPLREMLR KHRYM