Gene Noc_2368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2368 
Symbol 
ID3704808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2714796 
End bp2715998 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content52% 
IMG OID637738851 
Productflagellin-like 
Protein accessionYP_344356 
Protein GI77165831 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID[TIGR02550] flagellar hook-associated protein 3 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATCT CCACTTCTTT ATTTCAGCAA CAAAGCATTG ATGGGATGCT TCGCCAGCAA 
GCCCAGGTAA GTAAAACCCA GCAGCAAATA GCCAGTGGCG AGCGTATGCA AACGCCGGCC
GATGATCCTA TTGCGGCCGC CCGCTTGCTA GAAGTCCGTG AAGCTTCCGG GAGAACCGCC
CAGTTTCAAA CAAACGCTGA CCGGGCGACA GCCCGTTTGT CCCAAGAGGA AAATGCCCTG
GCGGGAGTTA ATAATGTATT GCAGGGAGTG CGGGAGCTGG CGGTGCAGGC CAATAATGGC
GCTCAAAACA ATGAGAATCG GGCTATCATT GCCCAAGAGG TTCGGCAGCG CCTCAACGAA
CTGGTAGGGC TGGCGAATAG TCAAGATGCC AGTGGCGAAT ATCTTTTTGC TGGCGCTAAG
GGTCGCTCCC AGCCTTTTAT TCAAGAAGGG GGAAGCGTTT CTTATCAGGG GGATCAGGCC
CAGCGTCTGA TCTCTATTGG CCCTTCGGTG CAAGTGGCGG ATAGTCACTC TGGCTCCGAG
GTGTTTTTAG CCATCCGCGA GGGTAATGGC GTTTTTGCCA CCGAAGCGAA CCCTTCAAAT
ACGGGTTCAG GAGTGATTGC GCCCGGTTCG GTCAATGGAG CTTTCATTCC TGACAATTAT
ACCCTGCAAT TTTCCCAGGC GACGCCTGAT GATCCCCTTA CCTACCAAGT GTTGGACTCC
CAGAATACTG TTGTGGCTAA TGGTGGTTTC GCTAGCGGGG AAGAGATTAC TTTTGGCGGT
GCCCAGGTAA GTATTACCGG CATTCCCGCG GATGGAGACA GTTTCACCCT CCATGCAAGT
GCTCACCGGG ATATGTTCAC TATCGCCCAG CATTTTATTG AGGCTCTGGA GCGGCCAATA
AATGATACGG CCAGCCAGGC TCGATTTCAT AATGATATGA ACAGGGCCCT CACCGATCTG
GATCAAGCCA TGGGCAAGAT TTTGGAAGTC CGGACAGAGG TGGGTACTCG CCTTAATGCC
GTCGATAGGG AACGCCAAGT AAATGAAGAG GCTAGCTTGC AATTGGCTAG GGAGCAATCT
TCGCTTAATG ATCTGGATTT GGCTGAAGCT ATTGGGCGCT TGAACCAGCA GTTAACGGGA
CTTGAGGCCG CCCAGCGGAC TTACGCCCGT TTGCAGGGAT TATCCTTGTT TAATTTTCTA
TAA
 
Protein sequence
MRISTSLFQQ QSIDGMLRQQ AQVSKTQQQI ASGERMQTPA DDPIAAARLL EVREASGRTA 
QFQTNADRAT ARLSQEENAL AGVNNVLQGV RELAVQANNG AQNNENRAII AQEVRQRLNE
LVGLANSQDA SGEYLFAGAK GRSQPFIQEG GSVSYQGDQA QRLISIGPSV QVADSHSGSE
VFLAIREGNG VFATEANPSN TGSGVIAPGS VNGAFIPDNY TLQFSQATPD DPLTYQVLDS
QNTVVANGGF ASGEEITFGG AQVSITGIPA DGDSFTLHAS AHRDMFTIAQ HFIEALERPI
NDTASQARFH NDMNRALTDL DQAMGKILEV RTEVGTRLNA VDRERQVNEE ASLQLAREQS
SLNDLDLAEA IGRLNQQLTG LEAAQRTYAR LQGLSLFNFL