Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2583 |
Symbol | |
ID | 3785464 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2968191 |
End bp | 2970440 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637812674 |
Product | type II and III secretion system protein |
Protein accession | YP_413264 |
Protein GI | 82703698 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4796] Type II secretory pathway, component HofQ |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACTG GCAAACACGT TCTGCTTGCC CTGGTTCTTG CCGGAATTGC AGGATGCGCG ACAAACAATG CGTTCGTGGA GGGCAAGCGG CTCATAGCCG AGGGCAAGCT CGACACTGGG CTTGCCAGTC TCGAACAGGC GGCGCGCGAA AATCCGGATA ATCTTGAAAT TCGCGCGGTG CTCGCGCGCC AGCGCGAAGC GATAGCCGCG CGTTTCGTAT TCGAGGCGGA AGCCGCAAGG TCGACAGGTG ATCTGGAAGC GGCTGAACGG AGTTTTCGCC GGGCGCTTGA AATAAATCCA CGTCATGAAC GCGCACTCGC CGGGTTGGAG GCGATCAATA TGGATCGCCG CCATACAGCG GCAATAAAAC GAGCGGAAGA ATTGCTGGAG CGGGGGGAGT ATGCAGCTGC TTCAAGCGAG GTCCGCTCCG TGCTGCAGGA AAATCCGATG CAGCGGGATG CACGCAGGCT CATTCAGCGC ATCACCGAAA TGGAAGTGCA AGCCGCGCAA GCGGGCCCTA TCCTGAAAAC CGCGTTTAAA AAACCTATCA CGCTCGAGTT TCGAGATACG GGCCTGAAAT CGGTATTCGA AATCCTGGCG CGCACCGGCG GCATCAACGT GGTCTTTGAC AAGGACGTCA AGCAGGACAG CAAGACGACG ATTTTCGTGC GCGAGACCAA TATCGAGGAT GTATTCAAGC TCCTTCTGGT GACGAACCAG CTCGCACACA AGGTCCTCAA CGAAAATTCG GTGCTGATCT ATCCGAATAC TCCGGCCAAG CAGAAGGAAT ACCAGGAATT GATGGTGCGC AGTTTCTTTG TCACGAATAC CGATGTGAAG CAGATGGTCG CGATGGTCAA GGGGTTGATC AAAACCAAGG ACATGCACGT CGACGAAAAG CTGAACCTGT TCGTCATGAA AGACACGCCG GAGGCGATCC GGCTGGCCGA GCGGCTGGTT ACGCTGAATG ATCTGGCCGA TCCCGAAGTC ATGCTGGAAG TGGAGGTGCT GGAAATCGGG CGCAACAAAC TGCTCAACCT CGGATTGCAG TATCCTGAGA AGATCAATGT AAATTATCTT ACCGCAGCCA GTGCTGCCGG CGCGGCTCCT TTTCCGCCCT TTCAGATCAG CCGTGCAGGG GTTAACGGCA ATGGTTCCAA TCTCGACAAC CTGATTGGAT TCGTTGCCAA TCCCGCCCTG ATCGTCAACC TCAAGCAGCA GGACGGGCTG ATCAATGTGC TTGCCAATCC GCGCATTCGC GTCAAGAACC GGGAGAAAGC GAAGGTCCTG ATCGGCGACA AGGTTCCAGT CGTCACGACG ACCGCAGCCG CCAACGTGGG TGTGGCATCA TCGGTCAGCT ACCTCGATGT GGGACTCAAG CTGGACGTCG AATCCACCAT CTCGCTGCAG GATGAGGTTT CCATGAAGGT CAGCCTGGAA GTGAGCAATA TCGTCAAGGA AGTGCCGGTA ACGGGAGGGG GCCTTGCCTA TCAGGTAGGG ACGCGCACGG CAGCGACCAC GCTGGCGTTG AAGGATGGCG AAACCCAGGT TCTGGCAGGA TTGATCAGCG ATGACGAGCG CACCACGCTT TCCAAGGTTC CCGGGCTGGC AGAGTTGCCG TGGATCGGAA AGCTTTTCAT CAATAAGAAC GTTGTCCGAA ACAAGACTGA AATCGCGCTG CTGATTACGC CGCGTATCGT GCGCAATATT GCCCGTCCGG CGAAAGCGGT CAGCGAAGTG CCCTTCGGTA CGGAGAATGC GATAGCGGTT TCGCCGCTGA TGATCGGCAA AGTGGCGCCG CGGGCGCTGG CGATGGCTTC TTCCTCTCCA ACCTCCGATG CTAACGCCAG CCGGCAGTGG CCATCCGCGC CTTCCAGGGA GGAAACCCGT CCGCCGGCGC CGCCCGAGGG GGCACCGGTT GCGACACTGG CTGCGCCGGA ACAGGTGTCC GCAGGACAGG AATTTGTGGT GAGCGTGACC CTCGCCGGTG CAGATGCACT CCCGCCCGCG GAACTGGATC TGAATTATGA CCCTGTGGCG CTTGAACCGG TGGACGAGGG TGATAAATCC GGAACACGCG TATTGAAACT GAGCAAGGGG GGCGGCGCCG CGGATGTGCG ATTCAGGATT TTGGCGCAAA AGCCTGTCAC GACTCAGATC AGTATCGGGA ATATCAGTTT CAAGGACGAA AGCGGGCCGC TTCCCGTTCC GGTGCCTTTG CCACCGGCTG TCAATGTGGA TATTCGTTGA
|
Protein sequence | MKTGKHVLLA LVLAGIAGCA TNNAFVEGKR LIAEGKLDTG LASLEQAARE NPDNLEIRAV LARQREAIAA RFVFEAEAAR STGDLEAAER SFRRALEINP RHERALAGLE AINMDRRHTA AIKRAEELLE RGEYAAASSE VRSVLQENPM QRDARRLIQR ITEMEVQAAQ AGPILKTAFK KPITLEFRDT GLKSVFEILA RTGGINVVFD KDVKQDSKTT IFVRETNIED VFKLLLVTNQ LAHKVLNENS VLIYPNTPAK QKEYQELMVR SFFVTNTDVK QMVAMVKGLI KTKDMHVDEK LNLFVMKDTP EAIRLAERLV TLNDLADPEV MLEVEVLEIG RNKLLNLGLQ YPEKINVNYL TAASAAGAAP FPPFQISRAG VNGNGSNLDN LIGFVANPAL IVNLKQQDGL INVLANPRIR VKNREKAKVL IGDKVPVVTT TAAANVGVAS SVSYLDVGLK LDVESTISLQ DEVSMKVSLE VSNIVKEVPV TGGGLAYQVG TRTAATTLAL KDGETQVLAG LISDDERTTL SKVPGLAELP WIGKLFINKN VVRNKTEIAL LITPRIVRNI ARPAKAVSEV PFGTENAIAV SPLMIGKVAP RALAMASSSP TSDANASRQW PSAPSREETR PPAPPEGAPV ATLAAPEQVS AGQEFVVSVT LAGADALPPA ELDLNYDPVA LEPVDEGDKS GTRVLKLSKG GGAADVRFRI LAQKPVTTQI SIGNISFKDE SGPLPVPVPL PPAVNVDIR
|
| |