Gene Noc_2369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2369 
Symbol 
ID3704809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2716002 
End bp2717945 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content55% 
IMG OID637738852 
Producthypothetical protein 
Protein accessionYP_344357 
Protein GI77165832 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCAA GCGATCTTTT AGGAATAGGT GCTTCCGGGC TTTTAGCCGC GCAGCGGGCC 
CTGGCCACAA CTAGTCATAA TATTGCCAAT GTGAATACGC CGGGTTTTTC TCGGCAACGG
ACGGAACTGG CTGCCCGCTT GCCCGAGTTT ACCGGGCAGG GATTTATCGG TACCGGGGTG
GATGTCACCA CAGTGCGCCG CGCCTACGAT TCTTTTCTGA CCGAGCAGGC GCGGTACTCC
ACTTCGGAGT ATGCTCAATC CAAGGTTTTC CATGATCTAG CAGCCCAGGT AGATAATCTT
TTTGCGGATT CTGACACGGG GCTCTCTGCT TCTTCGCAGC GATTTTTTAA TGCCACGCAG
GAGTTGGCAA ACGATCCCTC CAGTTCGGCG GCACGCCAGG TGCTGCTTGC CGAGGGTGGG
GCCTTTGCGG CCCGTGTTCA TTCTCTCGGT GACCGGTTGG GTGAACTGGA CGAGGATGTC
AATACCCGGT TGCGCGATAC TGTTGCGGAA GTGAATACGC TCTCTTCCTC CATTGCCGGG
CTCAATCAGC AAATCCGCGA CCTGCGTGGG CAAAATAATA ACCAGCCGCC TAACGATTTA
CTGGATCAAC GGGATCAGTT AATTCAAGAT CTTTCCCAAA AAGCAGCGGT CACCGTGCTT
CCCCAGGACG ATGGCAGCCT CAACGTTTTT ATTGGTAAAG GGCAATCCCT GGTAACAGGA
GACCACAGCC ATTCCCTAAC CACTTTGGCC AACCCCTATG AGGCATCCCG CCTGGAAGTG
GGTTATGCTG CCAAGGGCGG GGTTGCCCCC ATTTCTGACT CTATTCAGGG CGGTGAATTA
GGGGCTCTGT TGAGCTTCCG CGATGAAGTT TTGTCGTCTT CCCGCAATGC TTTGGGGCAA
CTAGCGGTGG GAGTCGCCCA ATCCTTCAAC GAGCAGCACC GCCTGGGGGT GGACCTTCAG
GGAGAACTCG GTGGCGATTT TTTTGCTGCC ATCGATTCCA ATACGGCGGT TTCGTTGCCA
CGCGCTGATA ATACTGGCGA TGGCGTTATC GAAATCGCTA TTAACGATGC CAGCAAGCTG
ACCGATAGCG ATTATCGCTT GGACCGGAAT GGGGCGGGGT TTACCCTGAC CCGGCTGTCG
GATAATCAAG CTTTCTCCTT AAGTACCTTT CCCGGCAGCG CGGAAACTGT CGATGGGCTG
ACCTTAAATT TAACCTCTGG CTCCATTAAT GGGGGCGATA GCTATCTGAT TCAGCCTACC
CGGGCGGCGG CGCAACAGTT TGGGGTGATG CTTACCGATT CTGCCCGCAT TGCCGCCGCA
GGACCCATCC GTACCGAGGC AAACCTTGGC AACAGAGGCA CGGGGCAAGT TTCAGCCGCT
GCGGTAACCG CTACTGCCGG CCTTCCCTTG CCGTCTAATG GAGAGGTGAC CTTGACCTAT
GACGCCGCAG CGCGGCAGTT CAATGTAAGC GGCGGTCCAG GAGGGACCCT GGATTTCGAT
CCGGCTACCG AAAGTAATGG CAAGGAATTT CATCTCCCCA GCGTGGGGGG ACTGAATTTT
ACGGTTTCCG GAGTTCCCGC GGATGGGGAT ACCTTTATGC TCCAGAATAA TACAGGGGGC
GTTGGGGATA ACCGTAACGC CCTCAGCCTT GCTGGATTAC AAACCAAGCC TGTTTTTCAG
GACGGTACAA CGACCTACCA GGAGCAGTAT GGTCGCTTGG TGGCGGATGT GGGCGCCCGC
ACCCGCCAAG CGGAAGCAAA CCAGGATACC CATAAAACCT TACTTGATCA AGCTGTTGCA
GCAAGAGAAG GGGTATCGGG AGTGAACCTG GAAGAGGAAG CGGCGAATTT AATCCGCTTC
CAGCAGGCTT TTCAAGCCGC CGCCCGGGTG ATCTCAACCG CCGATACCAT GTTTCAGACT
TTATTAGGCG CGGTAGGTAG ATAA
 
Protein sequence
MASSDLLGIG ASGLLAAQRA LATTSHNIAN VNTPGFSRQR TELAARLPEF TGQGFIGTGV 
DVTTVRRAYD SFLTEQARYS TSEYAQSKVF HDLAAQVDNL FADSDTGLSA SSQRFFNATQ
ELANDPSSSA ARQVLLAEGG AFAARVHSLG DRLGELDEDV NTRLRDTVAE VNTLSSSIAG
LNQQIRDLRG QNNNQPPNDL LDQRDQLIQD LSQKAAVTVL PQDDGSLNVF IGKGQSLVTG
DHSHSLTTLA NPYEASRLEV GYAAKGGVAP ISDSIQGGEL GALLSFRDEV LSSSRNALGQ
LAVGVAQSFN EQHRLGVDLQ GELGGDFFAA IDSNTAVSLP RADNTGDGVI EIAINDASKL
TDSDYRLDRN GAGFTLTRLS DNQAFSLSTF PGSAETVDGL TLNLTSGSIN GGDSYLIQPT
RAAAQQFGVM LTDSARIAAA GPIRTEANLG NRGTGQVSAA AVTATAGLPL PSNGEVTLTY
DAAARQFNVS GGPGGTLDFD PATESNGKEF HLPSVGGLNF TVSGVPADGD TFMLQNNTGG
VGDNRNALSL AGLQTKPVFQ DGTTTYQEQY GRLVADVGAR TRQAEANQDT HKTLLDQAVA
AREGVSGVNL EEEAANLIRF QQAFQAAARV ISTADTMFQT LLGAVGR