Gene Nmul_A0337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0337 
Symbol 
ID3785962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp368771 
End bp370045 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content55% 
IMG OID637810413 
Producthypothetical protein 
Protein accessionYP_411037 
Protein GI82701471 
COG category[S] Function unknown 
COG ID[COG1565] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTGCCT CAAAAGAGGG ACTTAGATAC CGCTGCTATC TGCTGGCCGA ATCAACAGTA 
AAGCGTAGCA TAGCGCCAAT GCAGCCACAG CCGCCATTTC CTTTACCGGG CGAAGCAGCG
CTCGAGCATA GCCGCGTACT GACAAAACTC ATTCATGAGA AAATCAGTGC AGCTGGCGGC
TGGATTTCAT TCGAGCATTA CATGAGGCTG GCGTTGTACG CGCCGGGCAT GGGTTATTAC
AGCGGTGGTC CCGCCAAATT CGGCCAGGAA GGCGATTTCG TCACTGCCCC CGAAATTTCT
CCGCTGTTTG GCAGGACTGT GGCGCGGCAG GCCAGGCAGA TACTCGAGTT GGCGGATGAA
GGCAGCTGTA TTCTCGAATT TGGTGCCGGA ACAGGTAAGC TGGCGCTCGA TCTATTGGTT
GAACTGGAAA AACTCGACTG TCTGCCCCAG CAGTATTTCA TTCTCGAAGT CAGCGCGGAA
CTCCGGCAGC GTCAACGGCA GTTGCTTGAA CAGTTCGCGC CGCATCTCGC TTCACGCGTT
TTTTGGTTGA AGCATTTGCC GGAGCAGTTC AACGGTCTGA TATTGGCCAA TGAGGTGCTC
GACGCCATGC CGGTTCACCT GATCGCGTGG CGTGGCACCA CGGTGTACGA ACGCGGTGTC
TCCTCAGCCG GCCATGAGTT TATCTGGAGT GAGCGCCTTC TTGCAGAAGG AGTACTTTTC
GAAGCGGCGC AGGAACTGGC GGATCGAATT CGTTTGGGGC GTAATGAAGG TGAATACGTC
AGTGAAATCT GCCTGCAGGC ACGCGGTTTC ATTGCGAGTC TCGGCAAGAT GCTGCAACGA
GGAGCGATCC TGCTGATCGA TTACGGTTTC GGCCGGGATG AGTATTATCA CCCTCAGCGC
AGGCAGGGGA CATTGATGTG TCATTACCGT CACCACACCC ATGACAATCC GTTTTATCTC
CCGGGTCTGC AGGATATAAC CAGCCACGTC GATTTCAGCT CCGCTGCCAG TTCGGGGCTC
GAAGCGGGCC TGCAGTTGCT GGGCTATACG ACGCAAGCAC ACTTTCTCAT CAATTGCGGA
ATAACCGAAA TTCTGGCGGA GACACCTGCC GCGAACGCAA AGGATTATTT GCCACTGGCA
AACCAGGTAC AGAAACTGGT GAGCCCGGCT GAGATGGGGG AGTTGTTCAA AGTTATGATC
CTGGGCAAAG GGATTGGCAA TAATCATCCA CCTCCCGTCG GCTTTACGGG CGGCGACAAG
AGTCGCCTGC TATAG
 
Protein sequence
MSASKEGLRY RCYLLAESTV KRSIAPMQPQ PPFPLPGEAA LEHSRVLTKL IHEKISAAGG 
WISFEHYMRL ALYAPGMGYY SGGPAKFGQE GDFVTAPEIS PLFGRTVARQ ARQILELADE
GSCILEFGAG TGKLALDLLV ELEKLDCLPQ QYFILEVSAE LRQRQRQLLE QFAPHLASRV
FWLKHLPEQF NGLILANEVL DAMPVHLIAW RGTTVYERGV SSAGHEFIWS ERLLAEGVLF
EAAQELADRI RLGRNEGEYV SEICLQARGF IASLGKMLQR GAILLIDYGF GRDEYYHPQR
RQGTLMCHYR HHTHDNPFYL PGLQDITSHV DFSSAASSGL EAGLQLLGYT TQAHFLINCG
ITEILAETPA ANAKDYLPLA NQVQKLVSPA EMGELFKVMI LGKGIGNNHP PPVGFTGGDK
SRLL