Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2198 |
Symbol | |
ID | 3786223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2495984 |
End bp | 2497012 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637812285 |
Product | hypothetical protein |
Protein accession | YP_412882 |
Protein GI | 82703316 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02595] PEP-CTERM putative exosortase interaction domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000179032 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATC AAAAACAGAA AGCGATTGCA CAGGCAACGC GCGCGGTAGT TAACGGAGGA TTGTTTGCGG GTTGCCTGTC GTTCAGTCTC ATCGCAAACG CGGGCATCAA TTTCCAGTTC GTATACAAGG ATGCGCCTGG TACGGGATTT CTCGATCCCG TAAACGGCGC CAGTCGGCAA GCTGCGCTCA ACACTGCTGC CACGGAATTC TCCAGAATGT TTGGCACCCA CTTCGCCAAC TCGGGGACCA TTGTGCTGGA AGCAACGGCC ACCAATGATC CCCAGAGCAG TACCCTGGCA GGGGCGGGCA GTGAATATGT CGTTCCCCCG GTACCGGGAT TCAACCTCAA CGAGGTCGTG CGCGAGAAAC TCCAGACAGG GATTGATTCC AACGGAAGCA GACCCGATGG CTCGCTCGAT ATCAACTTTG GGAGCAAATG GGAGCTTGGC TTCAACACGC CCGTGTCGAG TGAGCGCTAC GACTTCTATT CCACGATGTT TCATGAATTT ACGCATACGC TTGGTTTTTC TTCATCCATA GGGCAATTCG GTGATCCGAT CGGGGGTACG AAGGATGCGG GAAGCTGGAG CAGCTTCGAC AGCTACCTGG TAAACAAGAG TGGAACTCCG GTCATTGATC CTGCAACTTT CGCACTCGAT CAGACTGTCT GGGATGCAGG CAGCGTGGGA GGCACCAGTC CCTCGGGAGG CCTGTTCTTC GATGGAGCTC ACGCCATGGC GGCAAACGGA GGCAATCCGG TGGGTCTGTA CACACCGTTT CCGTGGGAGG AGGGGAGCAG CGTTTCCCAC CTGGATGATA ATAATAGCGC TTATGCGGGA ATGATGATGC TGGCCGCCTC TGAGACGGGA CCTTATGCCC GGGACTACAG TGCGGTCGAG ATTGGCATGC TCCAGGATCT TGGATATACG GTAACGGCTG TGCCAGAGCC GGAGGTCTAC GCGATGATGC TGGCCGGCTT GGGGTTGCTG GGCTGGGGAA CGCGGCGCAA AAAGCGCCAT GACCAGTAA
|
Protein sequence | MKNQKQKAIA QATRAVVNGG LFAGCLSFSL IANAGINFQF VYKDAPGTGF LDPVNGASRQ AALNTAATEF SRMFGTHFAN SGTIVLEATA TNDPQSSTLA GAGSEYVVPP VPGFNLNEVV REKLQTGIDS NGSRPDGSLD INFGSKWELG FNTPVSSERY DFYSTMFHEF THTLGFSSSI GQFGDPIGGT KDAGSWSSFD SYLVNKSGTP VIDPATFALD QTVWDAGSVG GTSPSGGLFF DGAHAMAANG GNPVGLYTPF PWEEGSSVSH LDDNNSAYAG MMMLAASETG PYARDYSAVE IGMLQDLGYT VTAVPEPEVY AMMLAGLGLL GWGTRRKKRH DQ
|
| |