Gene Nmul_A1057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1057 
Symbol 
ID3784877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1217239 
End bp1221033 
Gene Length3795 bp 
Protein Length1264 aa 
Translation table11 
GC content54% 
IMG OID637811141 
Producthypothetical protein 
Protein accessionYP_411752 
Protein GI82702186 
COG category[S] Function unknown 
COG ID[COG3164] Predicted membrane protein 
TIGRFAM ID[TIGR02099] conserved hypothetical protein TIGR02099 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGGTAC TGATTGCAGT GGCAGCCGTC TTTTCGGTTC TGCTGCTGTC TCTCCGCTAT 
TGGCTGCTGC CGAATATCGA ACAGTACCGT GAAAACCTCG CTTCCGCCAT CAGCCATGCT
TCCGGTCAAT ATGTAACACT GGGCGAGATC AGCGCGAATT GGGACGGGTT CCGTCCCCAC
ATGATGCTGC GTGATGTGCG CGTGCATGAT AACCAGGGTG TCACCTTGCT GCTGAATCGC
CTTGAAGGCA CGCTTTCATG GCGTTCGATA TTGCATGGCG AATTGCTTTT TCGTGAAATC
GCAATCGATC AGCCTGATTT GATCGTACGT CGAGACACTG CCGGTGTCAT TCATGTCGCC
GGATTCGCAC TAAAGCAGGA ATTTACCGAA AGTGAAGATG GCTTTTTCGA CTGGCTCCTC
AACCAGAGGC AAGTCATCAT AAACAATGCC AATATTCTCT GGCAGGATGA CCAGCGCACC
GCGCCGCAAC TGGAACTGCT GGTCAACCTG CGCCTCGAGA ACCGGGGCAG ACATCATCGC
TTCGGCATCC GCGCCATACC CCCCACGCGG CTTGCCGCTC ACCTTGATAT GCGCGGGGAT
TTCAAGGGAG AGTCGTTGGC CAACCCCGGA TTGTGGCGGG GCCGGCTTTT CATGCAGATA
ACCCGTGCCG ATATAGCGGG GTGGCAGGCC TGGCTGCCTT TCCCCGAGGA AATAAAGTTG
AATCGGGGCG TTGGCGCGCT GCGGATCTGG GCAAATATCG ACGGCACAGA CATGAACAAA
GTTACGGCAG ACATGCGCTT GCAGAATGTA AAGGCGCGGC TCGAGTCGAC TCTGCCGGAA
GTGAGCCTCA CCAGGCTGGC AGGCAGGGTA GGATGGCACA AGGTGGAGGA CAACGGCAGC
AACGGCAGTC AATTTTTTGC CCGTCGGCTC GATGCCGCGT TTTATGGCAA ACCGCCTTTG
CCTTTGCTGG ATTTTTCGTT CCAGCAGCTT CACCACGATG CCAGCCAACC GGATAGCAAC
ACATTGAGCG TCCAGAATCT GAGACTGGAT AAACTGGAGG AGTTGGCGAA ATACCTGCCG
ATGAGTGAGC CGCTCCGGGC TAAAATCCGC GCTGTTTCTC CCCGCGGTAA ACTGCATTCG
GTGTTGATAA AATGGACGGG GGAATGGGCG GAGCCTTCTT CTTTCCATGC CACTGGAAGA
TTTACCAAAC TGGGAATGAA AAGGTCCGAT GGCATGCCTG CCTTCACTGG CGTCAGCGGG
AATATTCATA TTACCGAACA GGGAGGCACG CTGAATCTGG ATTCCCAAAA CACGGTGCTG
CAACTACCGG ATCCAACGAT TGAACCGGTA ACACTGAATA CACTTGCTGG ACAGATAAGA
TGGAATCTGA CCAGCAACGG TTCAAAGTTG GTAAAATTCA GCAATATCTC CTTCTCCAGC
GCGTACGCAG CCGGGTCAGC CTATGGAAAT TACCAGACAG CGCCCGCCAG CCCGGGTATC
CTCGACTTGA CAACTCATCT CACGCGCGCC GATATACCTT CTCTGCTGCG TCTCCTGCCG
GCGAAAGGAA AAGGGAGGGA GTATCTTCCT GATTGGCTGG GTGAATCCAT TGTTGCAGGG
AGTATTTCGA ATGGCTGGTT TCACCTAAAG GGAAATCTGG CTCGACCTCC CTTTGTTTCA
AGCAATCCCG GTGTCTTCGA ATTTGCCGCA AAGATATCGG GCATGTTGCT CGATCCCCTC
CCCGGCTGGC CGCGAATAGA GAATCTCGCC GGAACAGTGC GTCTTAACGA TAAACGCATG
GAGATCAATG TTTCCAAAGG AGATATTCTT GGGGCGCGCC TTGGGAAGGC AAGATTGATC
ATTCCTGACA TGACTGCCAC CGAGGCTAGG CTCAAAACCG AACTGGAAGC GACCGGCGCC
ACTCGTCAGT TTCTGGCATT CGCTGCCGCA AAAACACCGG ACACCTACGA TAACTGGCTG
ATGGAGAATA CCCGTATTTT TGGCGACGGG AGATTGCTGC TCAAACTTGA TACTCCACTG
CGCGGCCCAG GGGAAACTAG ACTGTGGGGT CGCTACCAAT TCATGAACAA CCGGATCGCC
CCGAGCTCCT CATATATTCC CGAGCTGGAG CAATTGAATG GCACGTTGAC CTTTACCGAT
TCTGAAATAA GAACGAAAAA CCTGAGCGGC CGTCTTCTCG GCGGCCCCGT GCTGATCAGT
TCCACTGATA TGCCAGGCGG TGGCGTACGT TTTTCCGCTA TCGGAAAGGT CGACTTCGAT
AACCTGAATG CCATTTCGCA ACCCACCGGG TCACGCGATA TCCCTTTCTG GACCAGACAT
ATACACGGCA GCGGCGATTG GCGAGCAGCC GTGCTCGTAG GCAACCGGTC CACAGATGTG
AGCGTTGAAT CATCATTGGA GGGAATTTCC TCGGATCTGC CCGAGCCTTT GTCGAAGGCG
GCGCATGATG CAATACCAGT GAAATTCGAA GGAAAGGCGA CGGGTACGCA GAGCGAGGAA
CTGCATCTGA GCTATGGCGA GCGCATCAAG GCAAAGATTA GCCGCACCCG TGACGATTCC
GGTCATTTTC ATGTGGAACG TGGTTCCATC GTTTTCGGTC CCTCACCCGT TTTTCTCCCT
GAAGAGCCGG GTATCGTGAT AAAGGGTGCA TTGCCCGTAT TGAATCTCGA CCGTTGGCGC
CTCTTGCTCA AGCAATTCGA AATCCAACCC GCCGCCCCTT TCAGTTTGAA TGGCTTAAAC
CTGTATATAG AATCGCTTGG CTTTCTGGGC AGGCAGTTCG ATGATGTCAC ACTGGATGCC
AATAGAAAAG ACGGCCTGTG GTATTGCAGG ATCACAAGCG AGGAGGTCAA TGGAGGCATT
ACCTGGAATC CATCCGGTAC CGGCAAGATA GTGGCGCGAT TGAACAGGCT GATCATTCCT
GCAAACCCTC CTTCCGGCCC AGGCACGGTA TCCAGAAGCA GGCAACAGGA AAAGGATCTG
CCGGCGCTCG ATGTAATCGT GGATGACTTT GTTTTTGGCG AGAAACAGCT GGGGAAGCTG
GAACTGGTTG CAAATCAGGA AGAACGGAAC TGGTATATCG ACAAACTGCA CATTGTCAAC
CCCGACAGTT CGATAAAAAT GCGAGGACTA TGGAAAAACC GGGTTCCGAC TCCACAGACG
CAGGCCACAG TCATGCTGGA AACAGATGAC ATTGGGAAAT TCCTTGAGCG GCTCGCCCTT
CCTGATCGCG TAACCAGTGG AAGCGGTACG CTCGAGGGCA TCCTGTCCTG GCAGGGGGAT
CCCCTATCGA TAGATTACTC CACCTTGTCC GGCAGGTTCA AGCTTGGTGC AAGACGCGGA
CAATTTCCCA GGTTTGAGCC CGGAATCGGC AGGCTCTTCG GCATTTTCAA TCTCCGTTCA
CTACCGCGGA GGATCACGCT GGACTTTCGC GATGTGTTCA GTGAAGGTTT CGGATTTGAC
GACATTTCCG GCAGCATAAA TATCGCGAGT GGCATTGCAT CGACCGATGA GCTTAAAATA
AACGGGCCGG CTGCAAGGGT TACAATGAAT GGACAGATGA ATCTCGAAGC GGAAACGCAA
AAACTTCACA TCAGGGTGAC CCCTTCCTAT GGACTTGCCT CCCCCGTAGT GGGGATGGCA
TCGGTGATTG CAAGCACAGC CATGAAGAAA ACACCTGCTC CATCAAGAGA CTACAACATT
ACCGGCACCT GGGCAGATCC TGTCGTAACC CGGATAGGGC AGCCGGCCCA GGAACTAGCG
GAGCCTCAAC CCTGA
 
Protein sequence
MWVLIAVAAV FSVLLLSLRY WLLPNIEQYR ENLASAISHA SGQYVTLGEI SANWDGFRPH 
MMLRDVRVHD NQGVTLLLNR LEGTLSWRSI LHGELLFREI AIDQPDLIVR RDTAGVIHVA
GFALKQEFTE SEDGFFDWLL NQRQVIINNA NILWQDDQRT APQLELLVNL RLENRGRHHR
FGIRAIPPTR LAAHLDMRGD FKGESLANPG LWRGRLFMQI TRADIAGWQA WLPFPEEIKL
NRGVGALRIW ANIDGTDMNK VTADMRLQNV KARLESTLPE VSLTRLAGRV GWHKVEDNGS
NGSQFFARRL DAAFYGKPPL PLLDFSFQQL HHDASQPDSN TLSVQNLRLD KLEELAKYLP
MSEPLRAKIR AVSPRGKLHS VLIKWTGEWA EPSSFHATGR FTKLGMKRSD GMPAFTGVSG
NIHITEQGGT LNLDSQNTVL QLPDPTIEPV TLNTLAGQIR WNLTSNGSKL VKFSNISFSS
AYAAGSAYGN YQTAPASPGI LDLTTHLTRA DIPSLLRLLP AKGKGREYLP DWLGESIVAG
SISNGWFHLK GNLARPPFVS SNPGVFEFAA KISGMLLDPL PGWPRIENLA GTVRLNDKRM
EINVSKGDIL GARLGKARLI IPDMTATEAR LKTELEATGA TRQFLAFAAA KTPDTYDNWL
MENTRIFGDG RLLLKLDTPL RGPGETRLWG RYQFMNNRIA PSSSYIPELE QLNGTLTFTD
SEIRTKNLSG RLLGGPVLIS STDMPGGGVR FSAIGKVDFD NLNAISQPTG SRDIPFWTRH
IHGSGDWRAA VLVGNRSTDV SVESSLEGIS SDLPEPLSKA AHDAIPVKFE GKATGTQSEE
LHLSYGERIK AKISRTRDDS GHFHVERGSI VFGPSPVFLP EEPGIVIKGA LPVLNLDRWR
LLLKQFEIQP AAPFSLNGLN LYIESLGFLG RQFDDVTLDA NRKDGLWYCR ITSEEVNGGI
TWNPSGTGKI VARLNRLIIP ANPPSGPGTV SRSRQQEKDL PALDVIVDDF VFGEKQLGKL
ELVANQEERN WYIDKLHIVN PDSSIKMRGL WKNRVPTPQT QATVMLETDD IGKFLERLAL
PDRVTSGSGT LEGILSWQGD PLSIDYSTLS GRFKLGARRG QFPRFEPGIG RLFGIFNLRS
LPRRITLDFR DVFSEGFGFD DISGSINIAS GIASTDELKI NGPAARVTMN GQMNLEAETQ
KLHIRVTPSY GLASPVVGMA SVIASTAMKK TPAPSRDYNI TGTWADPVVT RIGQPAQELA
EPQP