Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0376 |
Symbol | |
ID | 3784071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 410657 |
End bp | 413446 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637810452 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_411076 |
Protein GI | 82701510 |
COG category | [N] Cell motility [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCTT TTAGAAAGAC TGTTTTAAAA AAGACTGGGC TGACGCTTGC AATCGCTGCC GCCCTCACCG TGAGCAGCAT GCAAGGCTGT AACAAAGCGG TGGATCCGGT GAAACTGATG GCGGATGCGA AGCGCTACGA AGAAAGCGGG GATCACAAGT CAGCCATCAT CCAGCTAAAA AATGCCTTGC AGCAGAACCC GGATAATTCG GAAGCGCGAT ATCTGCTTGG CGCAATATAC AACAAGACTG GGGATTTTCA GTCCGCGGAG AAGGAGCTTC GCAAGGCGTT GAGCTTGGGA ATGGATGCGG GCAAGGTGCT GCCTGAACTC GGACAAGCCC TGCTCAGGCT AGGCGCCTAT CAACAAATCC TGGATGAAAC GAAAGAACTG GCCGACAAAA CAAAGTCGGC GCAGATACTT ACGCTCCGGG GGAATGCACA GCTCGGATTG GGCAAGACCG CGGAAGCAAA GGCACTGTTC GAGCAAGCGC TCGGACACAA TCCCGATTCT GCCGACGCAC TGATCGGCTT GGCGAAATAT TCCCTGGTTC AAAGGGATGT GGAAGGCGCA ACCCATTTCT CGGAGCAGGC GGTTTCCCGG AGCCCACAAA ATGTCGAGGC CTGGTTGTTC AGGGGCGACC TTCTGCGGAT GCAAGGCAAA TCCGGGGAAG CTCTGGCGGC GTATGACCAG GTGGTAAAGC TGAAACCGAA CGCAGCCATT GCCTATATCA ACAAAGCATT CATCGAAATT GGAACGGGCA AGTTCGAGGC GGCAAAGGCA GATATCGACG CGGCACGGAA GATCAGTCCC AGCTTGATGG TGTTTTATAC CCAGGCTTTG CTCGATTTCA GCCAGCAAAA GCCCGCTGCC GCGCTGGAAT CGCTCCAGCA GGTACTCAGC AAGTCGCCAG ATCACATGCC CAGCGTGTTG CTCGCGGGCG CAGTCCAGTT TGCTCTCGGA TCCATGCCCC AGGCGGAACA ACACCTGAAA CATTATCTGG AGAAAGATCC CGGAAACATC TACGCGCGCA AGCTGCTTGC TTCCGCGCTG TTAAAAAATG GCGAGACGAA ACGCGCGATC GACATCCTGA CCCCACCGCT CAAAAATGTG AAAGAGGATC CCCAGTTATT CGCCTTGGCT GGGGAAGCTT ACATGCAAGC CAAGGATTTT GCGAAAGCCA CGGAATACTT TGAAATGGCC AGCGATATCG CACCGAGAAG TGCGATGCTT CATACCGCTT TAAGCATGAG CAGGTTGGGG CAGGGGGAAA ACGCCCGGGC CATCTCCGAA CTGGAAACAG CGACAAAACT CGACCCCAAA TCGCCGCGGG CAGGAGTGTT GCTGGTCATG ACCCATTTAC GCCTCAAAGA ATTCGACAAG GCGCTCGCCG CGGTAAAGGC ATTGGAGAAG GAGAATCCTG ATAATCCCCT CATTCAAAAC CTGAAAGGCG GCGTGTATCT TGGCAAGAAC GATATAGCGA ATGCAAGAGC GAGCTTTGAG AAAGCGCTTG CTATTCAACC AAACTATTTT CCGACTATAG CAAATCTCGC ACGACTCGAC ATTCAGGATA AAAAACCGGA TGCCGCAAGA AAACGCTTTG AAGCAATTCT GGAAAAGGAC AAGAAGAATA TCCAGGCCAT GGTCGCGCTG GCGGGTCTTG CTGTCAACCA GGGACAGAAC CAGAAGGCTA CCGAGTGGCT GGAGCGTGCA ATGCAAGCAA ATCCGGATGT CCTTCAGCCG GCCATCCTGC TTGGAACACA TTATCTGCGT TTGGGCGAAA AGCAGAAGGC CCTGGCTCTC GCCAAGAAAC TGGAGGGGAC ACATTCCAAG GACTCCGCTG TGCTGGATTT ACTGGCACAG GCTCAACTAG CTAACAATGA CAAATCGAGT GCCTTGGAGA GCTACGCCCG GCTCGCTGTC GTACAACCGG AATCCCCGCT TGTTCAATTC CGCATCGCCT CCATTCATAT GGCAATGCGG AACCTCTCCG CCGCATCAGA TGCGCTGAAA AAATCGCTGG CTATAAAACC CGATTATCTG GCTGCGCAAT TGGCGCAGAT CGACATTGAA GCAGAACAGG GTAATTATGA GAAAGCAATC GCGATGGCGC GCCAAGTTCA GAAGCAGCAT AAAGGGTCAC CCGCAGGTTA TATAGCAGAG GGCGATCTGT TGCTGAAACA GAAAAAACCC GCGCTTGCCG CAAGCGCCTA TGAACAGGCA TTCGCGGTTA ATAAAACCTC GCCCTTGTTG ATCAAGCTGC ACGCATCGCT CAGGCAGGCT GGAAAAGATA AGGAGGCAGA CCTTCGCTTG ATCGAATGGC TGAAAAAACA CCCGAATGAT TTATCTGCTC GAATGTACCT CGCGGATACG TTTCTCACAG AGGGAAAACT GGCTGCTGCA GTGGAGCAAT ATCAAACTGT CCTGAAAGAA CAACCGAAAT TTGCTCCGGC GCTTAATAAC CTGGCTACCG CTTATCAGCG TCAAAAGGAC CCCAGGGCCT TGGAATACGC GGAAAAGGCA TACCAGCTTG CTACGGAAAA CCCGGCAGTA CTGGATACGC TGGGTTGGGT ACTGCTGGAG CAGGAAAATA TCGCGCGAGC TCTACCGCTT TTACAGAAAG CCGCTTCTTT GGCGCCTCAA GCAGGGGAAA TCCGCTACCA TTTTGCATCC GCACTGGTTA AATCTGGCAA TAAAAGCCAG GCGCGTAAAG AGCTTGAGCA AATTCTGGCT ACTGGAAAAA CCTTTTCAGG CATAGATGAA GCCAGAGCTC TTCTTGAGCG GATACAGTAG
|
Protein sequence | MRAFRKTVLK KTGLTLAIAA ALTVSSMQGC NKAVDPVKLM ADAKRYEESG DHKSAIIQLK NALQQNPDNS EARYLLGAIY NKTGDFQSAE KELRKALSLG MDAGKVLPEL GQALLRLGAY QQILDETKEL ADKTKSAQIL TLRGNAQLGL GKTAEAKALF EQALGHNPDS ADALIGLAKY SLVQRDVEGA THFSEQAVSR SPQNVEAWLF RGDLLRMQGK SGEALAAYDQ VVKLKPNAAI AYINKAFIEI GTGKFEAAKA DIDAARKISP SLMVFYTQAL LDFSQQKPAA ALESLQQVLS KSPDHMPSVL LAGAVQFALG SMPQAEQHLK HYLEKDPGNI YARKLLASAL LKNGETKRAI DILTPPLKNV KEDPQLFALA GEAYMQAKDF AKATEYFEMA SDIAPRSAML HTALSMSRLG QGENARAISE LETATKLDPK SPRAGVLLVM THLRLKEFDK ALAAVKALEK ENPDNPLIQN LKGGVYLGKN DIANARASFE KALAIQPNYF PTIANLARLD IQDKKPDAAR KRFEAILEKD KKNIQAMVAL AGLAVNQGQN QKATEWLERA MQANPDVLQP AILLGTHYLR LGEKQKALAL AKKLEGTHSK DSAVLDLLAQ AQLANNDKSS ALESYARLAV VQPESPLVQF RIASIHMAMR NLSAASDALK KSLAIKPDYL AAQLAQIDIE AEQGNYEKAI AMARQVQKQH KGSPAGYIAE GDLLLKQKKP ALAASAYEQA FAVNKTSPLL IKLHASLRQA GKDKEADLRL IEWLKKHPND LSARMYLADT FLTEGKLAAA VEQYQTVLKE QPKFAPALNN LATAYQRQKD PRALEYAEKA YQLATENPAV LDTLGWVLLE QENIARALPL LQKAASLAPQ AGEIRYHFAS ALVKSGNKSQ ARKELEQILA TGKTFSGIDE ARALLERIQ
|
| |