Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1904 |
Symbol | |
ID | 3784142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2193591 |
End bp | 2194796 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637811990 |
Product | general secretion pathway protein F |
Protein accession | YP_412591 |
Protein GI | 82703025 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1459] Type II secretory pathway, component PulF |
TIGRFAM ID | [TIGR02120] general secretion pathway protein F |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGCGT ACCGCTACGA AGCGCTTGAT CCCGAAGGCC GCAAGGTAAC GGGTGTGCTG CAGGCTGATA CCGCGCGCCA GGCGCGGGCA CAGTTGCGCG CGCAAGGCTT GCTGCCTTCG ACCGTCGATC AGGTCCGCGC TCGCGAGGGC GGTCAGGCAC CCTGGGTCCG AGGCCTGCGT CCGGAGGAGC TGAATTTGCT GACACGGCAG ATGGCTACTT TGCTGACCGC CGGTCTGACA GTCGAGCAGT CGCTTGCTGC ACTGATCGAA TCCGCCGAGG AGCCGATGAC ACGCGAAGTG CTCGGCGGTG TCAAAACAGA GGTGATCGCG GGGCTTTCCC TCTCCGCCGC GCTGGGCAGC TACAGCAGGA GTTTTCCCGA TTTCTATCGG GCGCTGGTGC ATGGTGGGGA AGAATCGGGC ACATTGCCGC TGGTACTGCG TCATCTTGCC GAGTATCTCG ATGCACGTCA GACACTAAAA CAGAAAACCA GTCTTGCGCT CCTTTACCCG GCACTGGTGA CCATCATTGC CATTATCATT GTTGCCGGCC TGCTCATGTA TGTGGTTCCC CAGGTAGTGC AGGTATTCCA GCATTCTCGC CAGAGCCTGC CCCTCCTGAC TCGCGCACTG ATCGGGTTGA GTGATTTCCT TCTCATGTCA TGGCCTTATC TGATTATTGC CATTGTCGGC GGGGCACTTT CCGCACGCGT CGCGCTACGG CATGAGAACA TCAGATACCG ATGGCACGCT CTGCTGCTGC GCACCGCATG GCTGGGATCG TTGATTCGCA GCAGCAACAC ATCCCGTTTC GCCAGCACGC TTTCCATTCT GGTCGGAGGA GGTGTGCCGC TTCTTAAAGC TCTCAGCTCC GGTGCCCGCG TGATGAGCAG CATGGTCATG CGTAAAGCGA TCGAAAATAC CATCGAACAG GTCCGCGAAG GCGCGAGCCT CTCCAGAGCG CTGCGGGAAA CCCGCGTGTT TCCGCCCCTG CTTGTGCATC TGGTGGCAAG CGGGGAAATG AGCGGCAAGC TGAAAGAAAT GCTCGAACGC GCCGCCCAGC TCGAAGCCCA GGCGCTGGAA CGGCGACTGG GCGTCTTTTT AACGCTGCTG GAACCAGTAA TGATCCTGGT AATGGGGGGC GTGGTGCTGA TGATCGTGCT TGCCATACTG CTCCCCATCA TGGAAATCAA CCAGCTGGTG CATTAG
|
Protein sequence | MEAYRYEALD PEGRKVTGVL QADTARQARA QLRAQGLLPS TVDQVRAREG GQAPWVRGLR PEELNLLTRQ MATLLTAGLT VEQSLAALIE SAEEPMTREV LGGVKTEVIA GLSLSAALGS YSRSFPDFYR ALVHGGEESG TLPLVLRHLA EYLDARQTLK QKTSLALLYP ALVTIIAIII VAGLLMYVVP QVVQVFQHSR QSLPLLTRAL IGLSDFLLMS WPYLIIAIVG GALSARVALR HENIRYRWHA LLLRTAWLGS LIRSSNTSRF ASTLSILVGG GVPLLKALSS GARVMSSMVM RKAIENTIEQ VREGASLSRA LRETRVFPPL LVHLVASGEM SGKLKEMLER AAQLEAQALE RRLGVFLTLL EPVMILVMGG VVLMIVLAIL LPIMEINQLV H
|
| |