Gene Nmul_A0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0114 
Symbol 
ID3786381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp120420 
End bp121574 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content42% 
IMG OID637810184 
Producthypothetical protein 
Protein accessionYP_410815 
Protein GI82701249 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAACA TTTTTAACTT TGCTGACTGT AGATCCGCAA TTATTCGCTG TAATCTTGTA 
CTTATTGTAT TTGGGCTTAC GAACGTCGCA GTGGCGCACG ACTTGCCGGG GTTAAAATCG
GAAAACTTAT CGAATATATC GGACTCCACC GGACTAAGAA ATTCGACACC TGCACCCGAT
CAAATTAGCA GCGATAAAGA CGAAAATTCG CAACTGCGCA ACTCCGAACG CGCGACGTTA
ATCGCCATGA AAGATCAGAG TCATGCTATT GTCATTGCTT CTGCTCTAGC TAGAGAGGCC
GGATCTCAAA ATATATATGG TCTAACTGAA GCCGACAAAA AGAAAGTCGA AGCTGTAATT
CTGCGTGCTA AAACTTGGGA CCCAGGCACA ATAGTAAAGA TTTGCTTTTA TGAGGAAGGG
AAAACTGCGA GGCCAGCTAT TGTAGAATTT GCTAAAACTT GGACGAATCA CGGTAACGTT
AAATTTGATT TCGGAGTTGC CCCCGCTTAC AGAACCTGTT CTCCAAATGA AAGTAGTAAT
ATTCGAATTA CATTCCGTAC GAAAGGTTAT TGGTCTTTGA TCGGGACGGA CTCTATGTAT
AGAGTATCTG CTCCTAGTAT GGGTCTTCAG GATTTTGATT CTCGTGATCC CAAGGATCCA
GAATTTGCTA CTACTGTATT ACACGAATTT GGGCATGCGC TAGGCTTTCA TCATGAACAT
CAAACACCAG CTGCAGACTG TGAAGCTCAA ATGGATACGG GAAAGATAAA ACAAATATAC
AGATGGGACG ATAGTGAGAT AAAAGAAAAC TTCCATAGAA TTGAAGTTTC ATCGCTAACT
GGTATGAAGA ATGGGTTTAA AGTGGGAGAT TCCCCTGATG GCAAAGTGGC CTATACCGTA
TATGATCCAA ACTCAATCAT GCATTACGCA TTGCCATACA TAATTTTCAA GCAACCCTTG
CCTAAAACTG GTTCTTGCTA TATCCCACCA AACCGGACGT TATCGAAGAT TGATATTGCT
GGGATGAAGG AAGCGTATCC TAATACCGAT CAAGCGAGCC TTACTGAACA AAATAGAAAG
ATAATTAAGG AATTAATGGT TGATCAGAGG CTCACTACGA TCCAGAGGGC AGCCTTTTCC
GTTTTACAAA AATAA
 
Protein sequence
MLNIFNFADC RSAIIRCNLV LIVFGLTNVA VAHDLPGLKS ENLSNISDST GLRNSTPAPD 
QISSDKDENS QLRNSERATL IAMKDQSHAI VIASALAREA GSQNIYGLTE ADKKKVEAVI
LRAKTWDPGT IVKICFYEEG KTARPAIVEF AKTWTNHGNV KFDFGVAPAY RTCSPNESSN
IRITFRTKGY WSLIGTDSMY RVSAPSMGLQ DFDSRDPKDP EFATTVLHEF GHALGFHHEH
QTPAADCEAQ MDTGKIKQIY RWDDSEIKEN FHRIEVSSLT GMKNGFKVGD SPDGKVAYTV
YDPNSIMHYA LPYIIFKQPL PKTGSCYIPP NRTLSKIDIA GMKEAYPNTD QASLTEQNRK
IIKELMVDQR LTTIQRAAFS VLQK