Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2651 |
Symbol | |
ID | 3785262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 3040242 |
End bp | 3041543 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637812740 |
Product | hypothetical protein |
Protein accession | YP_413330 |
Protein GI | 82703764 |
COG category | [S] Function unknown |
COG ID | [COG5563] Predicted integral membrane proteins containing uncharacterized repeats |
TIGRFAM ID | [TIGR02595] PEP-CTERM putative exosortase interaction domain [TIGR02913] probable extracellular repeat, HAF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAACCT TCGTTGATTT CAAAATCCGT GGCTTTATCC TCCTTGCAAC ATTTTTCACC GGCTTGGGTT TCGGTACCAG CGCCTTCGCT CAAACACTAC TACGTGAATA TCAATACCTC GTCGACCTCA GCAGCAGGAC AACAACCCGC CTTTATCAAT CCCCTTTCGG CGACGTTTTT TATCGTAACA TCAACGATTC GGGGCAGTTA GTGGGCAATT TTGGGGAAGC TCCTTTCCAT GCTTTCATCA CCGGCCCCAA CGGGATAGGT ATGAGAGACT TAGGCACCCT GGGGGATAAT CCGGCACGTT CGAATTCATC TGCCTTTGCC ATCAACAACT CAGGGCAGGT GGCAGGATTT TCTGATTCGA TTCGTGACCG TCTTCAGTTT GAGTCCCATG CTTTCATCAC GGGTCCTGAT GGGATGGGTA TGAGGAGTCT GGGCACCTTG GCCGGTAATC ACCCCGCTGC TTCCAGCAGT GCTTCTGGCG TCAATGAGGC TGGCCAGGTA GTTGGTGGCT CGGTTGTTGG TGCCTCTTAT CATGCTTTCA TCACGGGCCC CGGTGGGATA GGTATGAGGG ACTTAGGCAC CTTGGGTGGT ACTAACAGCC GTGCTTCTGG CATCAATGAG GCTGGCCAGG TAGTTGGTGG CTCGGTTGTT GGTGCCTCTT ATCATGCTTT CATCACGGGC CCCGGTGGGA TAGGTATGAG GGACTTAGGC ACCTTGGGTG GTACTAACAG CCGTGCTTCT GGCATCAACG AGGCCGGGCA GGTGATAGGG AACTCTCTCA CGGCTCAAAA CGTTTGGCAT GCTTTCATTA CGGGCCCGGA CGGGACGGGT ATGAAAGACC TGGGCACCCT GGGCGGTACT AGCAGCAGTG CTGTTGGCAT CAGCGATATC GGGCAGGTGG CGGGGAACGC TGACACGGCT GGAGGTGCCT CTCATGCTTT CGTCACCGGG GCGGATGGGA TAGGTATGAG GGACTTGGGC ACCTTGGGCG GAACTTCTAG CGAGGCGTAT GGCATCAACG AGGCCGGGCA AGTAATAGGG GGCTCTCTCA CGGCTGAAAA TGTTTGGCGT GCTTTCATCA CCGGCCCCGA GGGCGAAGGC ATGACAGACC TCAATTCACT GGTTGATATG CCGACTGGAG AAGTTCTACT CCAGGCTACT GCTATCAATA ACGCAGGGCA AGTCCTTGCA ATCGGACTAA TCCCTGAACC GGAAATCTAT GCCTTGATAC TCCCTGGGTT AGGGTTGGTC GGATTTATAG CGCGGCAAAA GAAGGCGAAG AAGCCTTGTT AA
|
Protein sequence | MKTFVDFKIR GFILLATFFT GLGFGTSAFA QTLLREYQYL VDLSSRTTTR LYQSPFGDVF YRNINDSGQL VGNFGEAPFH AFITGPNGIG MRDLGTLGDN PARSNSSAFA INNSGQVAGF SDSIRDRLQF ESHAFITGPD GMGMRSLGTL AGNHPAASSS ASGVNEAGQV VGGSVVGASY HAFITGPGGI GMRDLGTLGG TNSRASGINE AGQVVGGSVV GASYHAFITG PGGIGMRDLG TLGGTNSRAS GINEAGQVIG NSLTAQNVWH AFITGPDGTG MKDLGTLGGT SSSAVGISDI GQVAGNADTA GGASHAFVTG ADGIGMRDLG TLGGTSSEAY GINEAGQVIG GSLTAENVWR AFITGPEGEG MTDLNSLVDM PTGEVLLQAT AINNAGQVLA IGLIPEPEIY ALILPGLGLV GFIARQKKAK KPC
|
| |