Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1903 |
Symbol | |
ID | 3784275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2192118 |
End bp | 2193581 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637811989 |
Product | type II secretion system protein E |
Protein accession | YP_412590 |
Protein GI | 82703024 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | [TIGR02533] general secretory pathway protein E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTCTG ACAGGATTCC TTATGTCTTT GTCAAGACAA ATGGAGTGGC CGTGACGAGC GTCACGAGCG ATCACGCAGA GGTGGTGGTG CGCGGCGAAG TCCAGGCCGG GGCACTGGCG GAGTTGCGTC GCGTTCTGGG CGTGCCTCTG CGGGCGCGAC GCCTCGCTAC CGACGAGTTC AACGAGATCG TTGCGGTGCT GTATAACGGC GCAAACGAAG GCGCCGCCGC ACTCGCAGAT GATCTCGCAC AGGATATAGA CCTCTCGAGA TTACTGCAGG AACTGCCGAA GGTCGAGGAC CTGCTCGAAA GCCAGGATGA TGCTCCTGTC ATCCGGCTGA TCAACGCGCT CTTTACACAG GCGTTACGCA CCGCGGCTTC CGATATCCAC ATTGAGCCAT ATGAAACACG CTCGGTTGTA CGGTTGCGAG TGGACGGCAC ATTGCGCGAC CTGATCGAAC CGGCGCGCGC ATTACATGCC GCCCTCATCT CACGCATCAA GATCATGGCG CAGCTCGACA TTGCGGAAAA ACGCCTTCCG CAGGACGGCC GGATTACATT GAGGATGGCA GGCAGGCCAG TCGACGTGCG CGTATCCACC ATCCCCACCG CACACGGCGA ACGCGCCGTA TTGCGTTTGC TGGACAAGCA GGCTGGCCGC CTGGACCTCC CTCGGCTTGG CATGGATGAA ATCACCTTGA CTCGCATGGA CAGGCTTATT CGCGAGCCCC ATGGCATTAT TCTCGTAACC GGCCCCACCG GATCGGGTAA AACAACCACG CTTTACGCCG CCCTGTCGAG GCTGGATTCC GCGTCGCTCA ACATCATGAC GGTTGAGGAC CCCATCGAGT ACGATCTTGA CGGCATCAGT CAGACTCAGG TCAATCCGCG AATCGAAATG ACGTTTGCGC GCGCCTTGCG GACAATCCTG CGGCAAGACC CGGATGTCAT CATGATTGGA GAGATTCGCG ACCTCGAAAC CGCACAGATC GCGGTGCAGG CCAGCCTTAC GGGCCATCTG GTATTTGCGA CTCTGCATAC CAATGATGCG ATAAGCGCCG TGACCCGGCT TGTCGACATG GGAGTCGAGC CGTTTCTGCT GGCATCGAGT CTCATCGGCG TAGGTGCGCA GCGTCTGGTG CGGCGGCTCT GTCTGGAATG CCGCCAGCCC TGGGACGAGG CCATGGGAAA ATCCCCGAGC TCTTTTTCGG CTTCCGGAAT TTTATACAAG GCGCAGGGCT GTGCGGCATG CAATCACTCC GGCTATCAGG GACGCACCGG GATTTATGAG TTGCTCGCGG TTGACAACGA CCTGCGCCGG AGAGTTCATG ATCGCGCTTC CGAACAAGAC CTGCGAGAAT ATGTGATTTC CGCCGGAATG CGCTCGTTAC GTGACGACGG CATGCGCCTC GCTACCCAGG GCATCACCAG CCTGGAGGAA GTCGTGCGTG TAACACGCGA ATAG
|
Protein sequence | MASDRIPYVF VKTNGVAVTS VTSDHAEVVV RGEVQAGALA ELRRVLGVPL RARRLATDEF NEIVAVLYNG ANEGAAALAD DLAQDIDLSR LLQELPKVED LLESQDDAPV IRLINALFTQ ALRTAASDIH IEPYETRSVV RLRVDGTLRD LIEPARALHA ALISRIKIMA QLDIAEKRLP QDGRITLRMA GRPVDVRVST IPTAHGERAV LRLLDKQAGR LDLPRLGMDE ITLTRMDRLI REPHGIILVT GPTGSGKTTT LYAALSRLDS ASLNIMTVED PIEYDLDGIS QTQVNPRIEM TFARALRTIL RQDPDVIMIG EIRDLETAQI AVQASLTGHL VFATLHTNDA ISAVTRLVDM GVEPFLLASS LIGVGAQRLV RRLCLECRQP WDEAMGKSPS SFSASGILYK AQGCAACNHS GYQGRTGIYE LLAVDNDLRR RVHDRASEQD LREYVISAGM RSLRDDGMRL ATQGITSLEE VVRVTRE
|
| |