Gene Noc_1984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1984 
Symbol 
ID3704868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2279740 
End bp2281341 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content44% 
IMG OID637738460 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_343976 
Protein GI77165451 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGAGC TGTGGTTACA GCTATATAGT TATTGGTGCG GGATTTGGCG TTACCGTTGG 
TATGGCGTTC TAATGGCATG GGTGGTGGTG ATTGTGGGTT GGACATATGT GATCCAGATG
CCTGATAGAT ATGAATCCTC AGCACGAGTT TATGTCGATA CTGATTCGCT ACTGCGCCCC
CTTTTGAAAG GATTGGCTAT TCAGCCTAAT GTAGACCAAA GGCTTAGAAT AATGACTCAG
ACTTTATTAA GCCGGCCTAA TTTAGGAAAG GTACTCCGTC AAACCGATAT GGATTTGTCC
GTGACCACGC CGGAGCAGGA AATCAAACTC CTTAACCAGC TTGAAAAAAA TATCCATATA
AAAGGGGCGA GAAGAGATAA TCTATATACA ATTGCATATG AAAATACTGA TCCTCAACTT
GCACAGCGGG TGGTGCAGGC CATATTGAAT ATTTTTGTGG AAAGCACTAT GGGTGCCTCC
CGTAAAGATA GTAATACTGC TCAACAGTTC ATTGGCCAAC AAATTAAAGA ATATGAAAAA
CTGCTTCGTA CTGCTGAACA AAGGTTAATG GACTTTAAGC GGGAGCATGT AGGCATGATG
CCCAATGAAA AGGGCGATTA TTATCAGCGT TTGCAGACTG CGGTAGAGAA TTTACGTACG
GCGCGCACTG AGCTTAATAT GGCGATAGGG CGCCGGGATG TACTTAAACG TCAATTGAGA
GGAGAGGAAC CGGTTTTTGG ATTTGGAACA GGTGGAACAG GAACCTCGTC GAAAGATAGT
AGTCCTGCGG GTATGCGTAT TCAATCCTTG CAAGCGGAAC TAGATGAGGT GTTGCTCAAG
TACACGGATA AACATCCAAA GGTTTCGGCG ATTAAAGAAA CTATTGCGAT GTTACAAAAA
AGAGAAGAGC AGCAATTTTC TTTGCCGCAG AAGCAGCAAG GTGAAGGAGA AGAAGCTAAT
GAAGCGGGAG AATATGCAGT GGGGGGTAAC TTTTATTATC AGCAAATGCA GATTTCTCTG
GCAGAAGCGG AAGCCAATAT TGCTTCTCAA GAAGCGGAAG TTAGCGCCTT AGAGAAAGAT
GTAGAACGGT TGCATGAGCT CGTCGATACT ATTCCTAAGG TAGAAGCGGA ATTAGCTCAG
CTAAATCGTG ACTATGGGGT TTATAAAAGC AACTATCAGC AACTATTAAC CCGCCTAGAA
TCAGCGAAAA TGGGTGAAAG GGTGGAAGAG TCGCCAGATA ATGTTAAGTT TAAAATTGTG
GAGCCGCCTA TACAGCCGCT CCTCCCTTCT GGTCCTGACC GCCCCTTGTT ATTAACGTTA
GTTCTGGTGG TTGCGGGAGG TGCCGGAGGG GCTTTGGCAT TTTTTCTTTC CCAATTAAGG
CCCGTTTTTT ATACCCGCCG GGATTTAGAA GAAGCGACGG GGCTTCCCGT CCTAGGCCCG
GTATCAATGA TATTATCAGG GCGTATTTTA TGGAAGCATC GGCTTAATCT GGCGTTTCTT
CTTACTTTTC TAGGTCTTCT CATCGCTGGA TATGGATTAT TGGTATCCAA TTATCTCTTT
GGTATCAAAA TGTTCGACAC AATTAAGCAT TCGCTTTTTT AG
 
Protein sequence
MHELWLQLYS YWCGIWRYRW YGVLMAWVVV IVGWTYVIQM PDRYESSARV YVDTDSLLRP 
LLKGLAIQPN VDQRLRIMTQ TLLSRPNLGK VLRQTDMDLS VTTPEQEIKL LNQLEKNIHI
KGARRDNLYT IAYENTDPQL AQRVVQAILN IFVESTMGAS RKDSNTAQQF IGQQIKEYEK
LLRTAEQRLM DFKREHVGMM PNEKGDYYQR LQTAVENLRT ARTELNMAIG RRDVLKRQLR
GEEPVFGFGT GGTGTSSKDS SPAGMRIQSL QAELDEVLLK YTDKHPKVSA IKETIAMLQK
REEQQFSLPQ KQQGEGEEAN EAGEYAVGGN FYYQQMQISL AEAEANIASQ EAEVSALEKD
VERLHELVDT IPKVEAELAQ LNRDYGVYKS NYQQLLTRLE SAKMGERVEE SPDNVKFKIV
EPPIQPLLPS GPDRPLLLTL VLVVAGGAGG ALAFFLSQLR PVFYTRRDLE EATGLPVLGP
VSMILSGRIL WKHRLNLAFL LTFLGLLIAG YGLLVSNYLF GIKMFDTIKH SLF