Gene Noc_2365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2365 
Symbol 
ID3704805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2710354 
End bp2712387 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content51% 
IMG OID637738848 
Productflagellar hook-associated protein 2 
Protein accessionYP_344353 
Protein GI77165828 
COG category[N] Cell motility 
COG ID[COG1345] Flagellar capping protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACTT CGACAGGTAT CGGTTCCGGG CTGGATGTGG AAAGCCTGGT GAACCAATTG 
GTGGCAGCGG AAGGGCGGCC CCAGCAATTG CGCCTCAACC GCCAGGAAGC AAAAATGCAA
GCGACTCTTT CCGCCATGGG CACATTAAAG AGTGAGCTGG CAAGCTTCAA GGATGCGGTT
TCAAAGCTTG ATTCGCCTTC GGCTTTTCAG GCCATCAAGG CCAGCGTTGG CAATAGGGAT
TTATTGACTG CTTCGGCAGC GGCGAAAGCG GCCACTGGCA CCTACAGTGT CGAAGTTCAG
CAATTAGCCC AGGCGCAAAA ACTCGCTTCC AAAGCCTTCG CCGATTCTAG CGCTGCCTTG
GGGACGGGAA CCCTGAGTTT TCGTTTCGGT AGCTATGACA ATAACACCAA TACTTTTACC
GGAAATCCTG ATAGAACCGC CCAAATCGTG ACTATTGATA GCGCCCATAA TAGTTTGAGC
GGCATTCGGG ATGCGGTTAA TGAAGCCGAT ATCGGAGTCC AGGCTAACAT TATCAATGAT
GGAACGGGGG ATCGCCTGGT TTTTACCGCT CAAACCATGG GTCTCGCTAA TAGCTTGGAA
ATTACCGTAA GCGATGAGGA TGGCAATAAT CTGGATGAGA GTGGCCTGTC TCAATTGGCT
TATGATCCGA CCGGAGCAGA TTCAGGGAGC GGGAAAAGTT TAACGGAGAC AGTAGCCGCT
CAAGATGCCC GTCTAGTGGT CGATGGCTTG ACCGTTACTC GTCCCCAAAA CACGATTACC
GGGATGATCG AGGGGGTAAC TTTAGAGCTC AACAGCGCTG AACTTGATTC CCCCACGACA
CTTACGGTGG CGGCTGATGG GCACATAGCC AGCAAATCGG TGAATGGATT TGTAGAGGCT
TTTAATAGTT TGGCGAAGAC CTTAAATTCT CTTTCTTCCT ATAATCCGGA AACTCAAGAA
AAAGGCCCTC TGCTTGGGGA TGCAAGTTTA CGCGGAATAG AAAACCGCCT CCGGCGGGTT
GCTAGCGATA CTGTTACCGG TCTGTCTGGC CCTTATCGAA CCTTAGCCGA TATTGGTATT
ACCACCCAGC GGGATGGCAC CCTGAAACTG GATGAAAGCA AGCTGCAACA GGTGGTTGAA
TCTGATCCCG AAGCGGTTGC CCGTTTATTT ACAGGAGGCG GCAGCAGCTC GGACCCGTTG
GTGCGGTTTG TAGAAGCTAC CGATGAGGCC CAAGCCGGAG AATTTGCGGT CAACGTTACT
CAGCCAGCAA CCCAGGGAAA ATATACGGGG AACGTCTTTT CTGGAAGCGG TTCACCCATC
ACTATTGATG CTGATAATGA TGAATTTACC CTTAAGGTAG ATGGGGGGGA GGCCGTGATG
ATTTCCTTGA CCCAGCAAAC CTATAATGAT GGGATGGCCC TTGCCCAGGA GATCCAAAGC
CAAATTAACG TTAATCCCAC TCTAAGCGGG GCCGGAAGCC AAGTAACCAT TGAATTCATA
AATGATCGCT TTGAAATTCG TTCTAGCCGT TATGGGAGTG GTTCTAACGT TGAAATCCTG
GCACTGGATC CTTCGCCCTC GGAGACCACT ACCCAAACCT TGGGTCTGAC GGTTAAAAGC
GGGAATGCGG GGGAAGATGT GGCTGGAACT ATCGGTGCTC AGGCGGCAAC AGGCTCGGGT
CGCTTTCTAA CTGGAAGTGA TAGGGCAGAA GGTATTCGTC TGGAGATCCT GGGAGAAAGC
TCGGGAGCGC GGGGGAGTAT CAGCTTTTCC CGGGGAGTAG CAGCCCATCT GAATACTTAT
TTGGATCAAG TACTGGATTC GGAAGGATTC TTAGAAAATC GCATTGATGG CTTAAATCAT
CGCATTGGTG ATTTCAGCGA GCAGCGGGAA GATTTGGTGC AGCGTTTGGA TGCTATTGAG
AAGCGTTACC GGGCACAGTT TACGGTACTG GATAGTCTGC TTGGACAACT CCAAACAACG
AGCAGTTTTT TAAGTCAGCA AATCGCTAAT TTACCCGGAA ATAGATCTGC CTAA
 
Protein sequence
MITSTGIGSG LDVESLVNQL VAAEGRPQQL RLNRQEAKMQ ATLSAMGTLK SELASFKDAV 
SKLDSPSAFQ AIKASVGNRD LLTASAAAKA ATGTYSVEVQ QLAQAQKLAS KAFADSSAAL
GTGTLSFRFG SYDNNTNTFT GNPDRTAQIV TIDSAHNSLS GIRDAVNEAD IGVQANIIND
GTGDRLVFTA QTMGLANSLE ITVSDEDGNN LDESGLSQLA YDPTGADSGS GKSLTETVAA
QDARLVVDGL TVTRPQNTIT GMIEGVTLEL NSAELDSPTT LTVAADGHIA SKSVNGFVEA
FNSLAKTLNS LSSYNPETQE KGPLLGDASL RGIENRLRRV ASDTVTGLSG PYRTLADIGI
TTQRDGTLKL DESKLQQVVE SDPEAVARLF TGGGSSSDPL VRFVEATDEA QAGEFAVNVT
QPATQGKYTG NVFSGSGSPI TIDADNDEFT LKVDGGEAVM ISLTQQTYND GMALAQEIQS
QINVNPTLSG AGSQVTIEFI NDRFEIRSSR YGSGSNVEIL ALDPSPSETT TQTLGLTVKS
GNAGEDVAGT IGAQAATGSG RFLTGSDRAE GIRLEILGES SGARGSISFS RGVAAHLNTY
LDQVLDSEGF LENRIDGLNH RIGDFSEQRE DLVQRLDAIE KRYRAQFTVL DSLLGQLQTT
SSFLSQQIAN LPGNRSA