Gene Noc_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2159 
Symbol 
ID3704833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2494696 
End bp2495829 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content53% 
IMG OID637738635 
Productflagellar biosynthetic protein FlhB 
Protein accessionYP_344149 
Protein GI77165624 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1377] Flagellar biosynthesis pathway, component FlhB 
TIGRFAM ID[TIGR00328] flagellar biosynthetic protein FlhB 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGACT CTAGTGCTCA GGAGCGCACG GAACAACCGA CTCCAAAGCG CCAGGAAGAG 
GCTCGAAAAA AAGGCCAGGT TCCCCGTTCA CGGGAACTTA GCACGACAGT TTTGCTGCTG
AGCTCCGCCC TGGGGCTGGT GTTGATAGGG GAGCCTTTGC TGCAGGGGCT AGCCGATCTA
ATGCGCCAGG GTTTGCAGTT GGAACGCGCC CAGATTTTTG AGCCGAAGGC CGCTATTCTA
CAGTTTCAGC AAGGAGTAGG TGAAGCGGTC AAAATAATAA CGCCTTTTTT GGCTTTAACA
CTTATCGCGG CGTTGGCCGC CCCTCTTTTA ATGGGGGGTT GGAGTTTTAG CGCTCAGTCC
TTGGGTTTTA AGTGGGAGAA ACTAAATCCG GCCAAGGGCA TGAAGCGAAT TTTTGGCCCT
CAAGGCGGAA TGGAACTACT CAAGGCATTG ATTAAATTCT TGCTTCTAAG TGGTGTAGGT
TGTCTACTGT TTTGGCTTTT TAGTCCCGAT TTGATCGCTC TGGGAAGGCA ACCGTTTGTT
CCAGCCGTAT TTCAATTAGC CCACTTGATG GGATGGAGTT TAGTGGGCCT TTCCGCCAGT
CTTGCGCTTA TCGCGGTGAT CGATGCTCCC TTTCAAGGAT GGAATCATAC CCGCCAGCTC
AAGATGACGC GGCAGGAGGT CAAAGAGGAA CATAAAGAAA CTGATGGCAA CCCGGAGCTC
AAGGGGCGGA TTCGTCGTGT TCAACGGGAA ATAGCAAGCC GCCGCATGAT GGCGGCGGTT
CCCCAGGCCG ATGTGGTGGT GGTCAACCCT ACCCATTACG CGGTAGCCCT GAATTACGAG
CAGGATAAAC AGGGTGCCCC GCGAGTAGTT GCTAAGGGGG TTGATCAGGT GGCCATCAAA
ATTCGAACGG TAGCGGCAGG TAATAACGTA CCCGTACTTT CCGCACCGGC TTTGAGCCGT
GCCATTTACC ACAGTACCAA GCTAGACCAA GAAATTCCCG CCGGACTTTA CCGTGCTGTG
GCGCAGGTGC TGGCTTATGT CTTGCAACTG CGCCAGTACC AACGCCGGGG TGGCCCCCGG
CCCCAACCTA TTCCAAATGA ATTTCCAATT CCTGAAGACT TAAGACGGGA TTAA
 
Protein sequence
MADSSAQERT EQPTPKRQEE ARKKGQVPRS RELSTTVLLL SSALGLVLIG EPLLQGLADL 
MRQGLQLERA QIFEPKAAIL QFQQGVGEAV KIITPFLALT LIAALAAPLL MGGWSFSAQS
LGFKWEKLNP AKGMKRIFGP QGGMELLKAL IKFLLLSGVG CLLFWLFSPD LIALGRQPFV
PAVFQLAHLM GWSLVGLSAS LALIAVIDAP FQGWNHTRQL KMTRQEVKEE HKETDGNPEL
KGRIRRVQRE IASRRMMAAV PQADVVVVNP THYAVALNYE QDKQGAPRVV AKGVDQVAIK
IRTVAAGNNV PVLSAPALSR AIYHSTKLDQ EIPAGLYRAV AQVLAYVLQL RQYQRRGGPR
PQPIPNEFPI PEDLRRD