Gene Nwi_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1100 
Symbol 
ID3676437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1209848 
End bp1211716 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content66% 
IMG OID637712650 
Producthypothetical protein 
Protein accessionYP_317714 
Protein GI75675293 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.782435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.13317 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTGA GTTCGGCCCT CTCGATCGCA ATGTCCGGCC TGCGCGCCAA CCAGGCCGCG 
CTGTCGATCG TGTCCGGCAA CGTCGCCAAC GCGAACACGC CGGGCTACGT TGCGCAGACG
CTGGTGCAGG ACCAGGTCGT GACCGGCGGC ATGGGTTCCG GCGTCCGCGT TATCGGGGTC
AACCGCACGC TCGACGCCTA TGTGCAGTCG CAACTGCGCA CGGAAAATGC CGGCGGCGCT
TATGCGAGCC ACATCGCCGG GGTGCTCGGC CAGTTGCAGA ACGTCTATGG CACGCCGGGC
AACGACGGCA CGCTGGAGGC GGCCTATAAC CGGTTCACCT CGGCATTGCA GGCGCTGTCG
GCGAGTTCGG GCAATCCGGC GGCGCAGTCG TCGGTGCTGA ATGCCGCGCA GGCGCTGGCG
CACCAGCTCA ACGCGACCAC CAGCGGCATT CAGACGCTGC GATCCAACGC CGAGCAGAGC
CTGTCCGCAT CGGTCTCGCA GGCCAATGCG GCGATGCAGC AGATCGCGCA GCTCAATCAG
CGGCTGCAAA GCATGGGTGC GCAGGATCCG GCGGCCGCGA CGCTGATGGA TCAGCGCGAC
GCCGCGATCG ATCAGCTTTC AGGGTTGATG GACATCCGCG CGGTGACGGA CGGCGCCAAC
CAGACCTCGG TGTTCACCAC GGCCGGCGTC CAGCTCGTCG GCGGCGTTTA TGCGTCGACG
CTGACATTCA ACGCGCAGTC CTCGCTCACC GCGAGTTCGC AATGGAACGC CGATCCCGCG
AAGTCGAGCG TCGGCACCAT CATCTGCCAG CTTCCGAACG GCGCCAGGAT CGACATGATC
GCTTCGCAAG GCATCAACTC CGGCCAGATC GCCGCCGATG CCCAGTTGCG CGACAAGATT
CTGGTGCAGG CGCAGAACCA GGTGGACCAG ATGGCGGCGA CGCTCGCCAG CGCGTTGTCC
GACGTCACCA CGAGCGGCGC GCCGGTGACC GGACCGCCAT CGGGCTTCGA CCTCGATCTG
TCCGGCATTC TGCCGGGCAA CGCATTCCGC GTCACCTACA CCGACTCCGC TAACGTCTCG
TACACCGTCA CCGTGGTGCG GGTGGACGAT CCGTCCGCGC TGCCGATTCC GAACGCGGCC
GCGAATCCGA ACGACCGGGT GATCGGCGTC GATTTCTCGG GCGGGCTCGC CTCGGTCGTA
TCGCAACTGA ATGCGCAGAT CGGGGCGGCG TCGCATCTGA TGTTTTCCGC TTCCGGTTCG
CTGTTACGGG TGGCCGACGA CGGCTCAGGA CAATCCACGG TGAGCGCGGC ATCGGCCACG
ACCACGGTGC AGTCGCTGGC GTCCGGCGAT CCGCAACTGC CGTTCTTCAC CGACCGCGCC
TCGCTTTATA CCGGCGCGAT CACAGGCTCC GCGTCGCAGA TCACGGGGTT GGCGGGGCGC
ATTCAGGTCA ACGCCGCGCT GCTCGCCGAT CCCTCGAAGC TCACGGTCTA CAGCACCTCG
CCGGTAACGC CGGCCGGCGA CACCACGCGC GCCGACTTCA TCTACGACAG GCTGACGTTC
GCGAGCTTCA GCTACGCGCC TAACACCGGA CTTGGCGCTG CGGCAGCGCC ATTCCAGGGC
ACGCTGTCGT CATTCCTGCA GCAGTTCGTC AGCCTGCAGA GCGGCGCGGC GACGACGGCG
CGGCAGGTCG CGCAGGGGCA GAGCATCGTC GTCAACACCT TGCAGGAGAA ATTCGACGAC
AAGTCCGGCG TGAACATGGA TACCGAAATG GCCAATCTGA TCGCTTTGCA GAACTCTTAC
GCGGCGAATG CGCACGTGAT GACCGTGGTG CAGAGCATGA TGCAGACGCT GATGCAGGCG
CAATGGTAA
 
Protein sequence
MSLSSALSIA MSGLRANQAA LSIVSGNVAN ANTPGYVAQT LVQDQVVTGG MGSGVRVIGV 
NRTLDAYVQS QLRTENAGGA YASHIAGVLG QLQNVYGTPG NDGTLEAAYN RFTSALQALS
ASSGNPAAQS SVLNAAQALA HQLNATTSGI QTLRSNAEQS LSASVSQANA AMQQIAQLNQ
RLQSMGAQDP AAATLMDQRD AAIDQLSGLM DIRAVTDGAN QTSVFTTAGV QLVGGVYAST
LTFNAQSSLT ASSQWNADPA KSSVGTIICQ LPNGARIDMI ASQGINSGQI AADAQLRDKI
LVQAQNQVDQ MAATLASALS DVTTSGAPVT GPPSGFDLDL SGILPGNAFR VTYTDSANVS
YTVTVVRVDD PSALPIPNAA ANPNDRVIGV DFSGGLASVV SQLNAQIGAA SHLMFSASGS
LLRVADDGSG QSTVSAASAT TTVQSLASGD PQLPFFTDRA SLYTGAITGS ASQITGLAGR
IQVNAALLAD PSKLTVYSTS PVTPAGDTTR ADFIYDRLTF ASFSYAPNTG LGAAAAPFQG
TLSSFLQQFV SLQSGAATTA RQVAQGQSIV VNTLQEKFDD KSGVNMDTEM ANLIALQNSY
AANAHVMTVV QSMMQTLMQA QW