Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1902 |
Symbol | |
ID | 3784274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2189733 |
End bp | 2192102 |
Gene Length | 2370 bp |
Protein Length | 789 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637811988 |
Product | type II and III secretion system protein |
Protein accession | YP_412589 |
Protein GI | 82703023 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02517] general secretion pathway protein D |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0584402 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTACA GAGGTAAAAT CGTGCGGCAT CGCTCGCTAT GTGCAGTTTT TTTATGGATG GCAGTGGCAG GCTGGGTTAC CGAGGCTTGG GCCGTAAATC CCTCTGCCAC CCTGCCCGTC GAGGAAATTG GCGTCCCGCC GAGCGTCCTG CCGGAAGCGG ATTCCCTGCG CGCCAGTAAA TCCAGGTCTT CTGCCGGTCA GGATCTCGTT ACACTCAATT TCGTCAATGC CGATATTGAA GGGGTAGTGA AAGCAGTCAG CGAAATTACC CGAAAAAATT TCATGCTCGA CCCCCGCGTC AAGGGTACGA TCAACATCGT TTCAGCCAAG CCAGTGCCGA GGTCGTCGGT CTACGAAGTA TTCCTGTCGG CACTGCGATT GCATGGATAT GCAGTCGTTG AGGATTACGG CATCATCAGG ATCGTTCCGG AAAGTGATGC CAAGCTATAC CAGGGCCCGA CACTTGGTCC CACGAACAAG CGGCAGCTCG CGGGCGACCG TATCCAGACA CAGGTATTCA CGCTGCAGTA CGAATCCGCG GTGCAGATGG TGCCGATCCT GCGTCCGCTG ATCGCTCCAA ACAACAGTAT CACTGCAAAT CCCAACAGTA ACACCCTGGT TATTACAGAC TACGCGAGCA ATCTCCAACG CCTGGCGAAA ATAATTGATT CGGTGGATCA GCCAAGTGGA ACCGAGCCTG TCTCGATACC CCTTCAGCAC GCCTCGGCGA TCGATGTCGC GCAAACCGTG AACCGGCTGT TTTCAGAATC GACGCAGTCC CAGGCCGAGG GCGCCGCGGA CCCCACCCAG CAGCGTTTCA CAGTCGTCGC CGACGCCCGC TCGAATACCC TCCTTGCACG CTCCGGAAAC CGGGCAGCGC TTGCGCGTCT GCGCCAGCTG GTAACAGTGC TCGATTCTCC CACCAGCGCT GCCGGCAACA TGCACGTCGT CTTTCTCAAG AATGCCGATG CAGTCAGGCT TGCCGAAACC CTCAGGGCGA TCTATCACAA CATGGCGTCC CCGGTTTCCT CATCTTCGGG ACTGAGCCAG GGCACCGGCA CAGCTTTTGG AACATCTTCC CTGGGTACAT CCACCGGCGG GGGGATGGGT GCCTCGTCAG GCACCTCAAC AGGGGGGTCG ATGGGCACTT CCATGCCCGG TTCCAGCCTT GGTGCGGGGA CTGTTCCCGC TGCTTCCACC GTCACCCCGG CTCCGATGCA AACTGGCGCA ACTTCCGCCA CCCCCGGCAT CATTCAGGCG GATGCAGCCA CCAACTCGAT CATTATTACT GCCCCGGATG CTATTTATAA TAATTTGCGC GCGGTGGTGG AGAAGCTCGA CGTGCGCCGC GTGCAGGTTT ATATTGAAGC GCTGATTGCC GAAATCACTG CCGACAGAGC CGCGGAATTC GGCATCCAGT GGCAGAATCT GAGCAATGCC GCGCAAGGTG GCACCCAGGT TTTCGGTGGC ACCAACTTCA ATGCCGGCAC TGCCGGAGGC GGCAGTATCA TCTCCACCGC CCAGAATCCG ATAGCGAATG CAGCCTCCGG TCTGACTATC GGCATCATGA ATGGCCTTGT CACGGCTATT CCCGGCATCG GCCCTGTTCT CAACATTCAT ACGCTCATCC GCGCGCTGGA AACGGATGCC AATGCCAACA TTCTTTCCAC CCCCACCCTG CTGACACTGA ATAACGAAGA AGCCAGGATC ATCATCGGGC AGAACGTTCC GATTCCCACC GGCCAATTCA TTCCGCCAGT AGGAGGCGCC GTTACCTCCC CGTTTCAAAC CGTTTCACGC CAGGACGTGG GACTATCATT GAAGATCAAG CCCCTTATCT CGGAAGGCAA TACTGTCCGT GTGCAGATTT TTCAGGAAGT CTCGAGCGTC GTTCCTGGCA CGGTCAACGC CACCAACGGG TTGATTACCA ACAAACGCTC GATAGAATCG ACAGTGCTGG TTGACGACGG GCAGATTCTC GTGCTCGGCG GTCTGATGCA GGATTCGGTA AATGACTCGG TTGAAAGAAT TCCACTGGTC GGGGCGATTC CGCTGTTCGG ACAATTGTTC AGTTACAACA AGCGCTCACG CAACAAAACC AATCTGATGG TATTCCTGCG GCCGACGCTG ATGCGCGCGG GCGACGCCGC CGATCCGCTT TCTGACGCAC AGTACGATCG GGTGCTGGGC GAACAGAAAA AAGTGAGACC CAAGTTTAAT CTTGTGCTTC CGGATATGGA ATCGCCTACT TTGCCGCCGC GTCAACCACC TCCTGTCATC CTTGATGACA GCATCACTCC CGATGATCCC GGAATTTCCA ATGTTCAAGG CAACTGGGAT ACCGGGGGAG TGATGGATAA TACACCCTGA
|
Protein sequence | MGYRGKIVRH RSLCAVFLWM AVAGWVTEAW AVNPSATLPV EEIGVPPSVL PEADSLRASK SRSSAGQDLV TLNFVNADIE GVVKAVSEIT RKNFMLDPRV KGTINIVSAK PVPRSSVYEV FLSALRLHGY AVVEDYGIIR IVPESDAKLY QGPTLGPTNK RQLAGDRIQT QVFTLQYESA VQMVPILRPL IAPNNSITAN PNSNTLVITD YASNLQRLAK IIDSVDQPSG TEPVSIPLQH ASAIDVAQTV NRLFSESTQS QAEGAADPTQ QRFTVVADAR SNTLLARSGN RAALARLRQL VTVLDSPTSA AGNMHVVFLK NADAVRLAET LRAIYHNMAS PVSSSSGLSQ GTGTAFGTSS LGTSTGGGMG ASSGTSTGGS MGTSMPGSSL GAGTVPAAST VTPAPMQTGA TSATPGIIQA DAATNSIIIT APDAIYNNLR AVVEKLDVRR VQVYIEALIA EITADRAAEF GIQWQNLSNA AQGGTQVFGG TNFNAGTAGG GSIISTAQNP IANAASGLTI GIMNGLVTAI PGIGPVLNIH TLIRALETDA NANILSTPTL LTLNNEEARI IIGQNVPIPT GQFIPPVGGA VTSPFQTVSR QDVGLSLKIK PLISEGNTVR VQIFQEVSSV VPGTVNATNG LITNKRSIES TVLVDDGQIL VLGGLMQDSV NDSVERIPLV GAIPLFGQLF SYNKRSRNKT NLMVFLRPTL MRAGDAADPL SDAQYDRVLG EQKKVRPKFN LVLPDMESPT LPPRQPPPVI LDDSITPDDP GISNVQGNWD TGGVMDNTP
|
| |