Gene Nwi_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_1105 
Symbol 
ID3677289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp1217759 
End bp1219369 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content64% 
IMG OID637712655 
Producthypothetical protein 
Protein accessionYP_317719 
Protein GI75675298 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0838477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.21931 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACA TCGTTCTCTC GGCATCCGTT CGCCAGAATT TGCTCTCGCT GCAATCCACC 
GCCGATCTGC TGGCGAGAAC GCAGCACCGC CTGGCCACCG GCAAGAACGT CAACACCGCC
CTCGACAATC CCACCAACTA TTTCACGGCC GCGGCGCTCG ACAATCGCGC CAACGATATC
AGCAACCTGC TCGACGGCAT CGGCAACGGC GTTCAGGTGC TGCAGGCGGC CAACACCGGC
TTGACCTCGC TCCAGAAGCT GGTGGATACG GCGAAGTCGA TCGCCAATCA GGTTCTGCAA
TCCCAGGTCG GGTATTCGCA GAAATCGTCG GTGGAAAGCC TTCCCGTAAC GGGGGCGACC
GCGGTCGATC TCAGAGGCAC CACACCGTTC ACGAATGTCG CCGCGACGCC CGGCGCGGCG
CTGAGGACCG GCACATCCAA CGCCGCGGCG ACCAGCGCCA CCCTGATCAA CGCGCTGACC
GACGGCGCGG TCGCGACGCC GACCGGCCCC GCCGCCGGAG AGAGCTTTGC CGTTAACGGC
AAGACCATCA CCTTTGTTGC TTCCGGCGCG GTGGCCGCCG ACGCTTCCGG GAACATCACG
ATCGGCATTG ACCAGACCCT GGACTCGCTG CTCGGGGTGA TCGACGTGCT GCACGGCAAC
ACCGACACGG CGTCGACGAT CGACGGCAGC GGGCAGATCG TGCTCAACAC CGGCGTTGCT
CACGACCTTA CATTGACGGA TTCAACGCCG GCCGGAGTTC TCGCAAAGCT TGGCCTCACC
GCGGCAACGA CGGCGCGCGG CGGAGGCGCG GGTAACCTCG ACGGCAAGAC GCTGACGATC
GGGCCGACCG GTGACGGCGT CGCGACCCAC ATCACATTCG GTGACGGCGC CAACGGGACC
GTCAGAACCC TCAACGACCT GAATGCGCAA CTCGCTTCAA ACCACCTTCA GGCCACGATC
AACGCCGCCG GCGTTATCAC CATCTCGACC GCCAACGAGG CGGCGGCCGA GACCATCGGT
GCGATCGGCG GGACCGCGGC CGGCGCGGGG TATGCCTTCC ACGGCCTTGT CGCGGGCGCG
CCTGTTCAGG ATCCGGCGGT TCAGGCGTCA CGCGCCGGGC TCGTCAATCA GTACAACAAC
ATCATTCAGC AATTAAACAC GACGGCGGCG GATGCGTCGT TCAACGGCAT CAACCTGTTG
GGTGGTGATA CGCTGAAGCT CATCTTCAAC GAGAGCGGGA AGTCCACGCT CACCATCACC
GGCGCCAGCA TGAATGCGGC TGGTCTTGGA CTGGCGAGTC TTACGGTCGG CGTCGACTTC
CTGGATAACG GCTCGGCCAA CAGCGTGATC GCGAGCCTCG ATAGCGCATC GATCACCCTG
CGCACGCAGG CTTCGACGCT CGGCTCGAAC CTGTCGATCG TGCAAATCCG TCAGGACTTC
TCGAAGAACC TGATCAACGT GCTTCAAACC GGCTCGTCGA ACCTCACGCT TGCCGACACC
AACGAGGAAG CCGCGAACAG CCAGGCGCTG GCGACCCGGC AGTCGATCGC GGTGTCCGCG
CTGGCGCTGG CCAACCAGTC GCAGCAGAGC GTGTTGCAGT TGCTGCGCTG A
 
Protein sequence
MSDIVLSASV RQNLLSLQST ADLLARTQHR LATGKNVNTA LDNPTNYFTA AALDNRANDI 
SNLLDGIGNG VQVLQAANTG LTSLQKLVDT AKSIANQVLQ SQVGYSQKSS VESLPVTGAT
AVDLRGTTPF TNVAATPGAA LRTGTSNAAA TSATLINALT DGAVATPTGP AAGESFAVNG
KTITFVASGA VAADASGNIT IGIDQTLDSL LGVIDVLHGN TDTASTIDGS GQIVLNTGVA
HDLTLTDSTP AGVLAKLGLT AATTARGGGA GNLDGKTLTI GPTGDGVATH ITFGDGANGT
VRTLNDLNAQ LASNHLQATI NAAGVITIST ANEAAAETIG AIGGTAAGAG YAFHGLVAGA
PVQDPAVQAS RAGLVNQYNN IIQQLNTTAA DASFNGINLL GGDTLKLIFN ESGKSTLTIT
GASMNAAGLG LASLTVGVDF LDNGSANSVI ASLDSASITL RTQASTLGSN LSIVQIRQDF
SKNLINVLQT GSSNLTLADT NEEAANSQAL ATRQSIAVSA LALANQSQQS VLQLLR