Gene Nmul_A1339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1339 
Symbol 
ID3785065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1528393 
End bp1530402 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content56% 
IMG OID637811427 
Productflagellar hook-associated 2-like 
Protein accessionYP_412034 
Protein GI82702468 
COG category[N] Cell motility 
COG ID[COG1345] Flagellar capping protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.542838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATTA CCGCTGCCGG AGCAGGTTCG AACCTGGACG TAAATGGCAT AGTTAGCCAG 
TTGATGGCGG CCGAACGTAC GCCGCTTGCT TTGCTTCAGA AGCGCGAATC CGACTACCAG
GCAAAACTAT CCGCTTATGG AACACTGAAA GGTGCCTTGT CTGCTTTCCA GACGGCAATG
CAGGGACTGG CTGATCCTGC AAAATATAAT GCAGTGAGCG CGAGTGCGGC AGATAGCTCT
CTGTTGACCG CAACCGGGAA TAGCAATGGC AAGGCGGTGC CGGGTAGTTA TTCGGTGGAA
GTGCAGCAGC TTGCCCAGCA GCAGAAAATC CGTTCCGAAG GATTTGCCAG CACATCAAGC
ACCGTAGGCA GCGGAACTTT GACGATACAG TACGGCACTT ATGATAATGT GCTGAATACC
TTTACTCTCA ACAACGCGAA ACCGGCGCAA ACGATAACGA TCGAACCCTC CAGCAACACG
CTCTCCGGTG TGCGTGACGC TATCAACGCA GCCAATGCCG GGGTGAGTGC AACGATTGTG
AATGACGGCG CCGGCAACAA ACTTGTGCTG ACCGCAAAGG ATCCAGGAGC GGCGAGCAGC
CTGAAGATTA CAGTCAGCGA TGACGATGGG GGCAATCTCG ACACAACCGG ACTCTCCGCT
CTTGCCTTCG ACCCAACGGT CGGAGGGGGC TCCGGCAAGA ATCTCATCCA GGTGCAGGCC
GCGCAGGATG CGAAACTCCG GATCGACGGG ATCGATATCG TCAAGTCCTC GAATACGATT
ACAGACGCAA TTGAAGGCGT CACACTCAGC CTGCTCAAAA CGAATGCAGG CAGTCCGACG
ACGCTGAATG TTTCACCGGA CACCGCTGCT GCGAAGACAG CGGTGGAAGC ATTCGTAAAG
TCATACAACA GCATCAATCA AACGCTTTCC AATCTCAGCG CCTATAACCC TGGAGCAAAG
AAGGGCGCGA TATTGCAGGG GGATTCCGCG GCGTTCTCTA TTCAGCGCGG AATTCGATCG
GCATTAACAG CCATGATGGG TGACAGTCGC GGCTTCACCT CCCTTTCCCA GATCGGTGTT
ACCTTGCAGA AAGACGGCAG CCTGGCGGTG GATTCCGCAA AGCTGCAGGC GTCGATGGAT
ACTGGATTTG AACAGATCGG CAGGCTGTTT ACGATAGGCG GCACCTCCAC CGATAGTCTG
ATTTCGTACG AAGGAGCCAC GGATAAGACA GTGGCTGGAA ATTATGCCGT TACGGTTACA
CAGCTCGCAA CTCGAGGCAG CCTCAGCGGA AGCCAGGCAG CAGGGCTAAC GATAACGGCA
GGAATCAACG ATCAACTCAA TTTTAATGTG GATGGGGTGG CAGCCAGTAT TACGCTGGCC
GACGGAATTT ATGCTTCCGC GGATGCATTG GCGGCCGAAG TGCAAAGCAA GCTGAATGGT
CTCTCTGGCC TGACAACCGA AGGAATTTCA GTATCGGTTT CCCCGTCCGC CGGCGTATTG
AGCATCGTGT CGTCGCGTTA CGGCTCCGCC TCCAGCGTTA TCCTCACGGG AGGTAATGGT
GCGGGCAATC TGCTCGGTGC CAGTCCTGTG GGAGCCACCG GAATAGATGT CGCAGGCTCC
ATCAATGGTG TTTCCGCTAC AGGGTCGGGA CAGATGCTGA CTGCTGCTGG CGGGGGATCC
GCCGAAGGTC TGCGCGTCGC CATCCACGGC GGCGCGCTCG GCACCCGGGG GACAATCGAG
TTCTCGCGAG GTTACGCCGA GCGATTGAGT AAGGTTGCAG AAGAGTTTCT CGCCACTGAG
GGAGTGATTG CCACCCGGGT CGAGGGACTG AACGCCAGTA TCAAGGATCT CGATCGGCGC
CAGGAGGATT TTAGCCGTCG ACTTGAAACC GTTGAGGCGC GTTACCGGGC ACAGTTCTCT
GCGCTTGATG CCATGCTGGG CAGCTTGACC CAGACGAGCC AGTTTCTTCA ACAACAACTG
GCCTCGCTGC CAACTCTCAA CGAAAAATAA
 
Protein sequence
MAITAAGAGS NLDVNGIVSQ LMAAERTPLA LLQKRESDYQ AKLSAYGTLK GALSAFQTAM 
QGLADPAKYN AVSASAADSS LLTATGNSNG KAVPGSYSVE VQQLAQQQKI RSEGFASTSS
TVGSGTLTIQ YGTYDNVLNT FTLNNAKPAQ TITIEPSSNT LSGVRDAINA ANAGVSATIV
NDGAGNKLVL TAKDPGAASS LKITVSDDDG GNLDTTGLSA LAFDPTVGGG SGKNLIQVQA
AQDAKLRIDG IDIVKSSNTI TDAIEGVTLS LLKTNAGSPT TLNVSPDTAA AKTAVEAFVK
SYNSINQTLS NLSAYNPGAK KGAILQGDSA AFSIQRGIRS ALTAMMGDSR GFTSLSQIGV
TLQKDGSLAV DSAKLQASMD TGFEQIGRLF TIGGTSTDSL ISYEGATDKT VAGNYAVTVT
QLATRGSLSG SQAAGLTITA GINDQLNFNV DGVAASITLA DGIYASADAL AAEVQSKLNG
LSGLTTEGIS VSVSPSAGVL SIVSSRYGSA SSVILTGGNG AGNLLGASPV GATGIDVAGS
INGVSATGSG QMLTAAGGGS AEGLRVAIHG GALGTRGTIE FSRGYAERLS KVAEEFLATE
GVIATRVEGL NASIKDLDRR QEDFSRRLET VEARYRAQFS ALDAMLGSLT QTSQFLQQQL
ASLPTLNEK