Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0337 |
Symbol | |
ID | 3785962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 368771 |
End bp | 370045 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637810413 |
Product | hypothetical protein |
Protein accession | YP_411037 |
Protein GI | 82701471 |
COG category | [S] Function unknown |
COG ID | [COG1565] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCTGCCT CAAAAGAGGG ACTTAGATAC CGCTGCTATC TGCTGGCCGA ATCAACAGTA AAGCGTAGCA TAGCGCCAAT GCAGCCACAG CCGCCATTTC CTTTACCGGG CGAAGCAGCG CTCGAGCATA GCCGCGTACT GACAAAACTC ATTCATGAGA AAATCAGTGC AGCTGGCGGC TGGATTTCAT TCGAGCATTA CATGAGGCTG GCGTTGTACG CGCCGGGCAT GGGTTATTAC AGCGGTGGTC CCGCCAAATT CGGCCAGGAA GGCGATTTCG TCACTGCCCC CGAAATTTCT CCGCTGTTTG GCAGGACTGT GGCGCGGCAG GCCAGGCAGA TACTCGAGTT GGCGGATGAA GGCAGCTGTA TTCTCGAATT TGGTGCCGGA ACAGGTAAGC TGGCGCTCGA TCTATTGGTT GAACTGGAAA AACTCGACTG TCTGCCCCAG CAGTATTTCA TTCTCGAAGT CAGCGCGGAA CTCCGGCAGC GTCAACGGCA GTTGCTTGAA CAGTTCGCGC CGCATCTCGC TTCACGCGTT TTTTGGTTGA AGCATTTGCC GGAGCAGTTC AACGGTCTGA TATTGGCCAA TGAGGTGCTC GACGCCATGC CGGTTCACCT GATCGCGTGG CGTGGCACCA CGGTGTACGA ACGCGGTGTC TCCTCAGCCG GCCATGAGTT TATCTGGAGT GAGCGCCTTC TTGCAGAAGG AGTACTTTTC GAAGCGGCGC AGGAACTGGC GGATCGAATT CGTTTGGGGC GTAATGAAGG TGAATACGTC AGTGAAATCT GCCTGCAGGC ACGCGGTTTC ATTGCGAGTC TCGGCAAGAT GCTGCAACGA GGAGCGATCC TGCTGATCGA TTACGGTTTC GGCCGGGATG AGTATTATCA CCCTCAGCGC AGGCAGGGGA CATTGATGTG TCATTACCGT CACCACACCC ATGACAATCC GTTTTATCTC CCGGGTCTGC AGGATATAAC CAGCCACGTC GATTTCAGCT CCGCTGCCAG TTCGGGGCTC GAAGCGGGCC TGCAGTTGCT GGGCTATACG ACGCAAGCAC ACTTTCTCAT CAATTGCGGA ATAACCGAAA TTCTGGCGGA GACACCTGCC GCGAACGCAA AGGATTATTT GCCACTGGCA AACCAGGTAC AGAAACTGGT GAGCCCGGCT GAGATGGGGG AGTTGTTCAA AGTTATGATC CTGGGCAAAG GGATTGGCAA TAATCATCCA CCTCCCGTCG GCTTTACGGG CGGCGACAAG AGTCGCCTGC TATAG
|
Protein sequence | MSASKEGLRY RCYLLAESTV KRSIAPMQPQ PPFPLPGEAA LEHSRVLTKL IHEKISAAGG WISFEHYMRL ALYAPGMGYY SGGPAKFGQE GDFVTAPEIS PLFGRTVARQ ARQILELADE GSCILEFGAG TGKLALDLLV ELEKLDCLPQ QYFILEVSAE LRQRQRQLLE QFAPHLASRV FWLKHLPEQF NGLILANEVL DAMPVHLIAW RGTTVYERGV SSAGHEFIWS ERLLAEGVLF EAAQELADRI RLGRNEGEYV SEICLQARGF IASLGKMLQR GAILLIDYGF GRDEYYHPQR RQGTLMCHYR HHTHDNPFYL PGLQDITSHV DFSSAASSGL EAGLQLLGYT TQAHFLINCG ITEILAETPA ANAKDYLPLA NQVQKLVSPA EMGELFKVMI LGKGIGNNHP PPVGFTGGDK SRLL
|
| |