Gene Noc_2367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2367 
Symbol 
ID3704807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2712931 
End bp2714409 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content54% 
IMG OID637738850 
Productflagellin 
Protein accessionYP_344355 
Protein GI77165830 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.912497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAGA CTATCAATAC CAATATTGCC TCACTTAATG CCCAGCGGAA TTTGAATAGC 
TCCCAAGGTG CGCTGCAGAC TTCACTAGAA CGTCTGTCCA GCGGTTTTCG GATTAACAAC
GCCAAGGATG ATGCTGCCGG GCTGGCGATT ACGGAGCGGA TGACCTCCCA GATTCGTGGC
CTTGCTCAGG CGACCCGTAA TGCGAATGAC GCTATTTCTG TGACCCAAAC CGCTGAGGGG
GGCTTGAAGG AGAGCAGCAA TATCCTGCAA CGGATGCGGG AGCTTGCCGT CCAGTCTGCT
AATGATACCA ACAGTGACTC GGATCGGGCC AATCTGCAAA AGGAGGTATC GCAACTACAG
TCGGAACTCA ACCGACTTGC GGATTCCACT ACTTTTAATG GTAAAAATCT ATTGGATGGT
TCCTTTACGG GACAGAAATT CCAAATTGGC GCCAATGCTA ACGAAAGCAT TGGTTTTTCC
ATTAATAGCG CCCGCGCCAC GACTCTTGGG GAGCAGCTAG GAAAGAGTAT TACCAATGTA
GGTGCGGGTC TAGCCGTAGC GGCGGATACC TCAGGCGGTA ATACCGTAGC TGCCCAGAAT
ATAACCGTCA ATGGTTCCAC GGGCTCTAAG ATGGTAGCGC TTACAGGCAA CGAGAGTGCC
AAGGCGATAG CTGATTTGGT CAATGAACAA TCCGGGAGTA CCGGCGTGAC GGCCTCCGCC
CAGACTTCGG TGACCCTAGA CAACGTGGCG GCCGATGGCA CGGTCTCTTT TACCCTCCAG
TCCAGCGGCG GCGGCTCGGC GGCGGCGATT TCCGCAGGGG TCACCACCAC TGATCTGACC
AATTTGGCCG ATGCCGTCAA TGCTCAAAGC GCCGAGACTG GCGTGACCGC CACGCTCAGC
GAAAACCGGG ACGCCATTAC CCTGGAAAAT GCCGAAGGCG AGGATATTTT GGTCTCGGAT
GCGGATAATA CGGGGGTCGC GGCAGCCGCG GCGGCATTTG ATACAGGGGG ACAGAGTTTA
ATCAAAACCG ATGGGGCCAC CGGTACGGCA AATGATAGTA TTGTGGTGGG TGGCCAAGTA
AGCTTCCAGT CCGATAAAAG CTTTACCACA ACCAGCGATA CCGGCAATAC GGTGGTGGGA
GCGGGTGGCG TGACCTCCGC CTTATCTTCG GTGGCCCAAA TCGATCTCTC CAGCCAGGAC
GGTTCCAACA GCGCCCTGTC CGTTATCGAT AAAGCTCTGG GTTCAATCGC TACCCAGCGG
GCAGATTTAG GTGCCCTGCA AAATCGTTTT GAGTCTACTA TTTCTAATTT ACAGAATGTT
TCCGAGAATA CTTCTGCTGC CCGTTCCCGC ATCCGGGATG CGGATTTTGC TTCCGAGACG
GCTGAAATGA CCCGCAATCA GATTCTCCAG CAGGCAGGTA CCGCTATGCT GGCACAGGCG
AATTCCCTGC CTCAGGGGGT TTTGAGCTTG TTGAGATAG
 
Protein sequence
MAQTINTNIA SLNAQRNLNS SQGALQTSLE RLSSGFRINN AKDDAAGLAI TERMTSQIRG 
LAQATRNAND AISVTQTAEG GLKESSNILQ RMRELAVQSA NDTNSDSDRA NLQKEVSQLQ
SELNRLADST TFNGKNLLDG SFTGQKFQIG ANANESIGFS INSARATTLG EQLGKSITNV
GAGLAVAADT SGGNTVAAQN ITVNGSTGSK MVALTGNESA KAIADLVNEQ SGSTGVTASA
QTSVTLDNVA ADGTVSFTLQ SSGGGSAAAI SAGVTTTDLT NLADAVNAQS AETGVTATLS
ENRDAITLEN AEGEDILVSD ADNTGVAAAA AAFDTGGQSL IKTDGATGTA NDSIVVGGQV
SFQSDKSFTT TSDTGNTVVG AGGVTSALSS VAQIDLSSQD GSNSALSVID KALGSIATQR
ADLGALQNRF ESTISNLQNV SENTSAARSR IRDADFASET AEMTRNQILQ QAGTAMLAQA
NSLPQGVLSL LR