Gene Nmul_A1797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1797 
Symbol 
ID3786348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2052858 
End bp2054714 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content56% 
IMG OID637811883 
Productsurface antigen (D15) 
Protein accessionYP_412486 
Protein GI82702920 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0729] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAGCC TGCCTTTCAT TCGCGGGCTT CCAGTATTGT TTATACAGCC CTTCTTGCAT 
TTGCCCGTCT TGAAATTCTT GTTTTCTCTT CTGGCGGATA AAGCGTGTCT TTATCTCGGG
GTGATCATTT TGCTCATGCT GAGTAACGAC ACGGTCGAAG CGGGTCTCTT CGGCGGCCTC
TTCGGAGACG ATGCTCCTCC CCCCTCCATC AGCCTTTCCG CTCCGGAGCC CGTGGCGGAT
CTGCTGAAAA CACATTTCCG TCTGCCGACA GAAGCTCTGG AAGATGAAAC TGCGCGCGCT
ACTTTCATGC GGCGCGCCCA GCGCGAAATT AGCGAACTGC TCGCCACCGA GGGTTATTTC
ATGCCCAGGA TAACATTGCA TCTCTCCACC CCCGGCGAGG TACCGCAACT GGAAGTCGCG
CCGGGACCGC GGACAATGGT TGTCGGGGTG CATATCGAGT TCAAGGGAGA CCTGAGTGTT
GATGAGCCTG GACGGCGCGC ACGGATTGAA AAGCTGCGCT CTGCCTGGTC CCTCAAGGAA
GGCCAGCCCT TTCGCTCCCC TGCCTGGGAA GAGGCCAAAT CGGTATTGCT ATCCAATGTC
GCGGGAGAGG ATTATGTCGC AGCGCAGATC GAGGAAAGCA GGGCGGAGAT AGATCCCGAT
TCTTCGCAGG CGCGGTTGAG GGTGATAGTG AACTCCGGGC CGGCATTCCA CTTTGGCGAG
CTCGACATAA AAGGGCTCAA TCGCTACGAA CCCTCACTCA TAAGCGGCCT TGCGCCATTT
AAACCGGGGG ACCTTTATCG CCGTGATCAA TTACTCTCGT TCCAGACGAA ATTGCAGAAT
CTGCCTCAAT TCAGTTCTGC GGCTGTCAAT ATTCAACCTG ACGAAGTAAC GCATCAGGCG
GCGCCGGTAG AGGTAGTGCT ATCGGAGGCG AAGTCGAAGA GGGTAGGGTT CGGCGCAGGG
TACAGTTCCA ATACGGGTGC GCGTGGCGAG GTCACTTACA TGAATAACGA TTTCCTGAAT
AACGCCTTGA GACTGAACAG TGGATTGCGT ATCGAGCAGA AACGCCAGAG CTTGACGGGC
TCAATCGACA GCGTGGCGGA TGCCTCGGGA ACATGGTTTT CCTTGGGGGC GGCAGCGGAT
AGAACCTTTA TCCAGCAACT GGAAACCATA CGCCAGAAAG TCGGCGTCAG TCGCAACCAG
CTCTTGGACA AGACCGAAAC AAGACTATCA TTGAACTGGC AGCGGGAAAA CCGGGATCCA
AAAGGGGGCC TGGAGCAGAT CAACCAGACC TTGGTGCTGG ACGGTTATCT ACGCTATCGT
TCCGTGGACA ACCCGTTATT CCCCAGGGAT GGCAGCGTTT CCGAATTGCG CATCGGTGGC
GGCAAGCGGG AACTGTTGTC CGATCAGGAC TTCTTGCGGA CTTATGCCAG GCATCAGTTC
TGGTATCCGG TGGGCAAGCG CGACGTGCTA TTTCTGAGGG GCGAGCTGGG GTACACCTTT
GCTCCCTCGC GCTTCGGCAT TCCCCAGGAA TATCTCTTTA GAGCGGGCGG TATTCAATCC
GTTCGCGGAT ACGCTTTTCA GCGTTTAGGC GTGAGGGAAG GCAGCGCGGT GGTCGGGGGC
AGGGTAATGT TCACGGGTTC AATTGAATAT AATCACTGGC TTACACGTAA TTGGGGTGCT
GCCATCTTTA CCGATGTGGG GGATGCGGCC GATACCATAG GCGGGTTGAA CCCGGCTGTC
GGATACGGGG GAGGGATACG CTGGCGCAGT CCTGTAGGGC CATTGGCGGT GGATGTCGCC
CGCGGGCAGC GGGACGGGAA ATTCCGTTTT CATTTTTCGA TTGCCGTGGC GTTCTGA
 
Protein sequence
MQSLPFIRGL PVLFIQPFLH LPVLKFLFSL LADKACLYLG VIILLMLSND TVEAGLFGGL 
FGDDAPPPSI SLSAPEPVAD LLKTHFRLPT EALEDETARA TFMRRAQREI SELLATEGYF
MPRITLHLST PGEVPQLEVA PGPRTMVVGV HIEFKGDLSV DEPGRRARIE KLRSAWSLKE
GQPFRSPAWE EAKSVLLSNV AGEDYVAAQI EESRAEIDPD SSQARLRVIV NSGPAFHFGE
LDIKGLNRYE PSLISGLAPF KPGDLYRRDQ LLSFQTKLQN LPQFSSAAVN IQPDEVTHQA
APVEVVLSEA KSKRVGFGAG YSSNTGARGE VTYMNNDFLN NALRLNSGLR IEQKRQSLTG
SIDSVADASG TWFSLGAAAD RTFIQQLETI RQKVGVSRNQ LLDKTETRLS LNWQRENRDP
KGGLEQINQT LVLDGYLRYR SVDNPLFPRD GSVSELRIGG GKRELLSDQD FLRTYARHQF
WYPVGKRDVL FLRGELGYTF APSRFGIPQE YLFRAGGIQS VRGYAFQRLG VREGSAVVGG
RVMFTGSIEY NHWLTRNWGA AIFTDVGDAA DTIGGLNPAV GYGGGIRWRS PVGPLAVDVA
RGQRDGKFRF HFSIAVAF