Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2599 |
Symbol | |
ID | 3785480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2984187 |
End bp | 2987309 |
Gene Length | 3123 bp |
Protein Length | 1040 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637812688 |
Product | DNA polymerase III, alpha subunit |
Protein accession | YP_413278 |
Protein GI | 82703712 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCTCTT CTGCTCCGGA TTACGCTGAA CTTCATTGCT GTTCCACCTT CAACTTCCTC AAAGGCGCTT CCATGCCGGA AGAGCTGGTG GAACGTGCCG CCACCTTGGG TTACAGCGCC CTCGCTATCA CGGATGAGTG CTCTATGGCG GGAGTGGTGA GAGCGCATGC GGAAGCCAAA AAAACGGGAT TGCATCTTCT GATCGGCAGC GAATTCGTGC TGGAGGATGG GCTGCGGCTG GTGGGTCTCG CCATGAACCG GGAAGGCTAT GGCAATCTGT GCGAACTGAT CACGCTGGCA AGGCAGCGTA GCGCGAAAGG GACTTATCGC ATCACTCGCG CGGACTTTGA AAGCTGTACG GATGCACCTC ATCTTGCCCG CTTGCCCGAT TGCCTGATCC TGCTGATTCC TCGCCGGAAT CAAGCAGACG GTTCCTGGGT CGAGGATATG CGCTGGGTGC GGTCGCTGTT TCCCGGCCGC GGCTGGATCT CTGTCGAACG GCTGCTTCAT GCCGGGGAAG AGGTGTGGTT CGAAAGAATA CGCCACGCGG CGGAAACCAT TGGCATGCCG CTGGTGGCGG CAGGCGATGT GCATATGCAC GATCGTGCCC ACAAGCCCTT GCAGGATACG ATGACCGCAA TTCGGCTGAG CCAACCGCTA GCGGCATGCG GTTATGCGTT GCAGCCTAAT GCAGAACAGC ATCTGCGCAC CCGATTGGAG CTGGCGCAAC TCTATCCGCC GGAACTCCTG CAGGAAACCC TTTCCATTGC CGCGCGCTGT ACCTTTTCAT TAGATGAGCT GCGCTATGAA TATCCTGATG AATTAACGGG AACTGACGAG ACTTATACCG ACTATCTGCG GCGCATGGTC GAAAACGGCA TGAGGTGGCG TTTTCCCAAA GGTGCCTCAA TCAAGGTTAG GAAACAGATC GAGCATGAAT TGAAGCTGAT TGCTGATCTC AAGTATGAAC CTTATTTCCT GACAGTATTC GATATCGTAA GCTTCGCCCG GTCACAAGGG ATTTTGTGCC AGGGACGCGG ATCGGCAGCG AATTCCGCAG TCTGCTATTG CCTGGGTATT ACCGAGATCG ATCCGGAGCG TAGCAATCTG CTGTTCGAAC GCTTTATTTC CAGGGAACGC AACGAACCTC CCGACATCGA CGTCGATTTC GAGCACCAGC GTCGCGGGGA AGTCATTCAG TACATCTACC GGAGATACGG CCGTGATCGG GCCGCCATTG CTGCAACTGT CATCACCTAT CGCACCCGCT CGGCAGTTCA GGATGTGGGT AAAGCGCTGG GACTCGATAT CGAACGTATC AGACGCCTAT CCACCTCCCT TGCCCGATGG GATAAAAGCT CGGATATCAA TAAGCGCCTG GGCGAGAACG GCTTCGATCC CGGCAGTCCG GTAATCCGCC AACTGATTCT GCTGACCACT CTCCTGTATG GCTATCCGCG CCATTTGTCC CAGCACGTGG GCGGGTTTGT GCTGACCCGG GACAAACTCT GCCGCATTGT CCCCATCGAA AATGCCGCCA TGCCGAATCG CCGCATCATC CAATGGGATA AGGACGACCT GGCGGAGGTG GGGTTGATGA AAGTCGACGT GCTTGCGCTC GGAATGCTAT CGGCAATACA CCGCGCGCTT GATCTGATTT CGAGCCGGCG CGGTGTGCCC TTCCAGATGC AGGATATACC CCCGGAAGAC GAAAAGACTT ACGACATGAT CTGCAAGGCG GACACGATCG GCGTCTTTCA GATCGAATCC CGTGCGCAAA TGTCCATGCT GCCGCGTTTG AGGCCACGGA CTTTCTACGA TCTGGTAGTA GAAGTCGCGA TCGTCCGGCC CGGCCCGATT CAGGGCGACA TGGTGCATCC TTATCTGCGC CGCAGGCAGG ATAAGGAAAA GCCCGATTAT CCAGAAGGAG TGGAAACCGC CCTAGCGCGA ACCTATGGCG TCCCAATCTT CCAGGAACAG GTAATGCAGG TGGCCATGCT CGCCGCCGGC TTCACTTCAG GGGAAGCAGA CCAGTTGAGG CGCTCCATGG CTGCCTGGAA GAGAAAAGGA GGCTTGGGTG CCTTCCATGA GAAGCTGATC GATGGGATGA CAAACCGAGG CTACACACGG GAATTCGCCG AGCGTATCTT CCGGCAGATC GAGGGCTTTG GCGAATACGG GTTTCCGGAA TCCCACGCCG CAAGCTTTGC ATTGCTGGTA TATGTCTCCG CCTGGCTCAA ATGCCACGAG CCTGCCGCTT TCCTTTGCGC CCTGCTGAAC AGCCTGCCTA TGGGTTTCTA TAGCGCCTCG CAACTGATTC AGGATGCCGG GCGGCACGGG GTTCAAATCA AGCCGGTGGA CGTCACCATC AGCAATTGGG AGTGCATCCT TGAGGAACAG GAGGATAAAG CGCTTCAGCC GATTGTCCGG CTGGGCCTGA ACAGGGTGAA AGGGATGGGA CTGGAAGCCG CCACCCGCAT TGTTGAAGCA AGAGAAATCG CATCTTTTGA AAACCCCGAC GACCTGGCAA ACCGCGCTTC CTTGAATACC GCTGAAGTTC ATGCGCTTGC CCGTGCAGAT GCCTTGCGAA CCTTGTCGGG ACATCGGCGC CAGACGTTGT GGTCAGTTGC ATCGCATATC ACACAGCGGG ACCTGATGCG GCATGCTCCG GCGAAGGAAA CGTTGCCAGT CATTCCGGCC GCGCCGGAGG GAGAGGAAAT CATCGCCGAT TACGCCAGTA CCAGACTTAC CCTGCGCAGA CACCCCCTGG CACTGCTGCG CCCGACGCTG GCAAGCATGA ACCTGCGCTC GGCAATGGAG TTATCTGATT ACCCTACCGG GCGGTTGGTC CGCACCACCG GCATCGTGAC CTGCCGACAA CGCCCCGGCA CCGCCAGCGG CGTCATGTTC GTCACGCTCG AAGACGAAAC CGGGATAACG AACGTCGTGC TGTGGAACCA GGTCATCCTG AAGTACCGCC GCGAAACCTT GAACTCGAAG CTTCTGACAG TATACGGCGT ATGGCAATCC GAAAGCGGCG TCAAACACCT TATCGCCAAA CGGCTGGTGG ATCATAGCCA CCTGCTGGGC AGCCTTGTGG TGGAAAGCAG GGATTTTCAT TAG
|
Protein sequence | MPSSAPDYAE LHCCSTFNFL KGASMPEELV ERAATLGYSA LAITDECSMA GVVRAHAEAK KTGLHLLIGS EFVLEDGLRL VGLAMNREGY GNLCELITLA RQRSAKGTYR ITRADFESCT DAPHLARLPD CLILLIPRRN QADGSWVEDM RWVRSLFPGR GWISVERLLH AGEEVWFERI RHAAETIGMP LVAAGDVHMH DRAHKPLQDT MTAIRLSQPL AACGYALQPN AEQHLRTRLE LAQLYPPELL QETLSIAARC TFSLDELRYE YPDELTGTDE TYTDYLRRMV ENGMRWRFPK GASIKVRKQI EHELKLIADL KYEPYFLTVF DIVSFARSQG ILCQGRGSAA NSAVCYCLGI TEIDPERSNL LFERFISRER NEPPDIDVDF EHQRRGEVIQ YIYRRYGRDR AAIAATVITY RTRSAVQDVG KALGLDIERI RRLSTSLARW DKSSDINKRL GENGFDPGSP VIRQLILLTT LLYGYPRHLS QHVGGFVLTR DKLCRIVPIE NAAMPNRRII QWDKDDLAEV GLMKVDVLAL GMLSAIHRAL DLISSRRGVP FQMQDIPPED EKTYDMICKA DTIGVFQIES RAQMSMLPRL RPRTFYDLVV EVAIVRPGPI QGDMVHPYLR RRQDKEKPDY PEGVETALAR TYGVPIFQEQ VMQVAMLAAG FTSGEADQLR RSMAAWKRKG GLGAFHEKLI DGMTNRGYTR EFAERIFRQI EGFGEYGFPE SHAASFALLV YVSAWLKCHE PAAFLCALLN SLPMGFYSAS QLIQDAGRHG VQIKPVDVTI SNWECILEEQ EDKALQPIVR LGLNRVKGMG LEAATRIVEA REIASFENPD DLANRASLNT AEVHALARAD ALRTLSGHRR QTLWSVASHI TQRDLMRHAP AKETLPVIPA APEGEEIIAD YASTRLTLRR HPLALLRPTL ASMNLRSAME LSDYPTGRLV RTTGIVTCRQ RPGTASGVMF VTLEDETGIT NVVLWNQVIL KYRRETLNSK LLTVYGVWQS ESGVKHLIAK RLVDHSHLLG SLVVESRDFH
|
| |