Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0872 |
Symbol | |
ID | 3784442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 990746 |
End bp | 992275 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637810954 |
Product | hypothetical protein |
Protein accession | YP_411567 |
Protein GI | 82702001 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000269659 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCCGA ACGAAAGCAA AAAATGGGAC ATGCACTGGT ACGATTGGCT GGTATTTGCG GTACCGACGA TCTTCATCGC GAGCCTGGGA CTGGAATCCG CGTTTGGGAC AACGCCGGGG CCCGGTACAG CTGCGTACCG TGATTTCCGC GCAGTTGCGC ACTGGTTTGA TCCGGTCTTC TTCCAGTACA ACCTTGTCAT GACGCTGCTG GCGGTTCTGG TTGTGCCATC GCTCGCCTTA AGCTACATGA TGAAGATGGC GAACCGCAAG GAACGGCGTT TATACCGCGA CGTTCCACCT GAGCGGCGCG GCGAAATCCG TCAGCGGATG GGACGACGGG CTTCCTTTGG CACTTACAGG GGAAGCGTCT GCCTGACCAC GGTCGTCGTG CTGCTCGGTA GTTCCATTCT GCTGCTGTTC AAGCCCGTTT CCTCGTCTGC GGAACTTGGG GTGGATTTCA GTCTCGGGGC GAACATGTTG ATGATGGGCC CCTTCATGGA ACTGTATGAA ACGGGTAATG GTTCCTATTA TTCCCACCTG GTCCGCAACC TGACGGCTTT CCAGTTCGGA TTCCTTGGCG CTTACGTCTA TTTCATCGGC TCTGTCGTCC GTGCCTACTT CACCATGGAC CTGACTTCCA ATACGTTCGT CGATGGCGCT ATCCGCATGA TCGTCGCAAG CCTGCTGGCG CTGGTGCTCT CCTTTGCGTT CGATCTGCTG CTGCCACACG AACTCGATGT GAATGTGTCC TCAACTGCCC CCTCAACCGC TGCGCCCTCC GACGTGAATG CCACGACACC CGCGCTCCCC CCACCCTCAA CGCCTTCTAC CGGTGAGGAA TCCGGAGAAA CCACATCTCC TGAAAGATCA GGCAATAGCG AAAAACTCCT TTCGAAACAG GAAGTCCCGT TGCCTGCCAG ATTGAGCCTC CTGCCTCTTG TTGCTTTCTT TCTCGGTTTC TATCCGAAAC GGACTGCACT CGGAATTGAG CGGATCGTCA TTAAACTGAC CAGAGGAATC ATTCCCACCA TGAGCTACCG CGCGCTGCCG CTTTCCATGC TGGCAGGAAT GAGCTACTCG CACGAATTGA GACTGGAGCG GGAAGGGTTC GATAATATCG AAAACCTCAG CAATGCTGAC CCTGTGGATC TCGCGATACG CACCTGCTTC AGCTATAGCC AGTTGAAGCA ATGGATCGAT CAGGCATGGC TGGCTTCTCA TCTGCGCGAA GATTATCCCG GCTTCGCGCA GCGGACAGGC ATCACGAACA GTGAGGAACT TCGCTGTTTC TTCTCCACCT GCGACACCTC CCATACCGAC GGAGTGGAAC AATTGCTAGC CGCCCTGTCA GCAGACCCGG CAACCGTCGC TTCGTGGAGA TTGAGGCTCA ATACTCTCCG GATACTGCTG GATACAAACT CTTCCCGGCA GGGGAGCGGT CACGACTTCA CCGAAAGGGA ACTCACCCCG CCTCAAGTTA CACATCCGGA AGCAGCAACC AGCTCACCTA AATCACAAAC AGATCGATAA
|
Protein sequence | MDPNESKKWD MHWYDWLVFA VPTIFIASLG LESAFGTTPG PGTAAYRDFR AVAHWFDPVF FQYNLVMTLL AVLVVPSLAL SYMMKMANRK ERRLYRDVPP ERRGEIRQRM GRRASFGTYR GSVCLTTVVV LLGSSILLLF KPVSSSAELG VDFSLGANML MMGPFMELYE TGNGSYYSHL VRNLTAFQFG FLGAYVYFIG SVVRAYFTMD LTSNTFVDGA IRMIVASLLA LVLSFAFDLL LPHELDVNVS STAPSTAAPS DVNATTPALP PPSTPSTGEE SGETTSPERS GNSEKLLSKQ EVPLPARLSL LPLVAFFLGF YPKRTALGIE RIVIKLTRGI IPTMSYRALP LSMLAGMSYS HELRLEREGF DNIENLSNAD PVDLAIRTCF SYSQLKQWID QAWLASHLRE DYPGFAQRTG ITNSEELRCF FSTCDTSHTD GVEQLLAALS ADPATVASWR LRLNTLRILL DTNSSRQGSG HDFTERELTP PQVTHPEAAT SSPKSQTDR
|
| |