Gene Nmul_A2599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2599 
Symbol 
ID3785480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2984187 
End bp2987309 
Gene Length3123 bp 
Protein Length1040 aa 
Translation table11 
GC content57% 
IMG OID637812688 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_413278 
Protein GI82703712 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCTCTT CTGCTCCGGA TTACGCTGAA CTTCATTGCT GTTCCACCTT CAACTTCCTC 
AAAGGCGCTT CCATGCCGGA AGAGCTGGTG GAACGTGCCG CCACCTTGGG TTACAGCGCC
CTCGCTATCA CGGATGAGTG CTCTATGGCG GGAGTGGTGA GAGCGCATGC GGAAGCCAAA
AAAACGGGAT TGCATCTTCT GATCGGCAGC GAATTCGTGC TGGAGGATGG GCTGCGGCTG
GTGGGTCTCG CCATGAACCG GGAAGGCTAT GGCAATCTGT GCGAACTGAT CACGCTGGCA
AGGCAGCGTA GCGCGAAAGG GACTTATCGC ATCACTCGCG CGGACTTTGA AAGCTGTACG
GATGCACCTC ATCTTGCCCG CTTGCCCGAT TGCCTGATCC TGCTGATTCC TCGCCGGAAT
CAAGCAGACG GTTCCTGGGT CGAGGATATG CGCTGGGTGC GGTCGCTGTT TCCCGGCCGC
GGCTGGATCT CTGTCGAACG GCTGCTTCAT GCCGGGGAAG AGGTGTGGTT CGAAAGAATA
CGCCACGCGG CGGAAACCAT TGGCATGCCG CTGGTGGCGG CAGGCGATGT GCATATGCAC
GATCGTGCCC ACAAGCCCTT GCAGGATACG ATGACCGCAA TTCGGCTGAG CCAACCGCTA
GCGGCATGCG GTTATGCGTT GCAGCCTAAT GCAGAACAGC ATCTGCGCAC CCGATTGGAG
CTGGCGCAAC TCTATCCGCC GGAACTCCTG CAGGAAACCC TTTCCATTGC CGCGCGCTGT
ACCTTTTCAT TAGATGAGCT GCGCTATGAA TATCCTGATG AATTAACGGG AACTGACGAG
ACTTATACCG ACTATCTGCG GCGCATGGTC GAAAACGGCA TGAGGTGGCG TTTTCCCAAA
GGTGCCTCAA TCAAGGTTAG GAAACAGATC GAGCATGAAT TGAAGCTGAT TGCTGATCTC
AAGTATGAAC CTTATTTCCT GACAGTATTC GATATCGTAA GCTTCGCCCG GTCACAAGGG
ATTTTGTGCC AGGGACGCGG ATCGGCAGCG AATTCCGCAG TCTGCTATTG CCTGGGTATT
ACCGAGATCG ATCCGGAGCG TAGCAATCTG CTGTTCGAAC GCTTTATTTC CAGGGAACGC
AACGAACCTC CCGACATCGA CGTCGATTTC GAGCACCAGC GTCGCGGGGA AGTCATTCAG
TACATCTACC GGAGATACGG CCGTGATCGG GCCGCCATTG CTGCAACTGT CATCACCTAT
CGCACCCGCT CGGCAGTTCA GGATGTGGGT AAAGCGCTGG GACTCGATAT CGAACGTATC
AGACGCCTAT CCACCTCCCT TGCCCGATGG GATAAAAGCT CGGATATCAA TAAGCGCCTG
GGCGAGAACG GCTTCGATCC CGGCAGTCCG GTAATCCGCC AACTGATTCT GCTGACCACT
CTCCTGTATG GCTATCCGCG CCATTTGTCC CAGCACGTGG GCGGGTTTGT GCTGACCCGG
GACAAACTCT GCCGCATTGT CCCCATCGAA AATGCCGCCA TGCCGAATCG CCGCATCATC
CAATGGGATA AGGACGACCT GGCGGAGGTG GGGTTGATGA AAGTCGACGT GCTTGCGCTC
GGAATGCTAT CGGCAATACA CCGCGCGCTT GATCTGATTT CGAGCCGGCG CGGTGTGCCC
TTCCAGATGC AGGATATACC CCCGGAAGAC GAAAAGACTT ACGACATGAT CTGCAAGGCG
GACACGATCG GCGTCTTTCA GATCGAATCC CGTGCGCAAA TGTCCATGCT GCCGCGTTTG
AGGCCACGGA CTTTCTACGA TCTGGTAGTA GAAGTCGCGA TCGTCCGGCC CGGCCCGATT
CAGGGCGACA TGGTGCATCC TTATCTGCGC CGCAGGCAGG ATAAGGAAAA GCCCGATTAT
CCAGAAGGAG TGGAAACCGC CCTAGCGCGA ACCTATGGCG TCCCAATCTT CCAGGAACAG
GTAATGCAGG TGGCCATGCT CGCCGCCGGC TTCACTTCAG GGGAAGCAGA CCAGTTGAGG
CGCTCCATGG CTGCCTGGAA GAGAAAAGGA GGCTTGGGTG CCTTCCATGA GAAGCTGATC
GATGGGATGA CAAACCGAGG CTACACACGG GAATTCGCCG AGCGTATCTT CCGGCAGATC
GAGGGCTTTG GCGAATACGG GTTTCCGGAA TCCCACGCCG CAAGCTTTGC ATTGCTGGTA
TATGTCTCCG CCTGGCTCAA ATGCCACGAG CCTGCCGCTT TCCTTTGCGC CCTGCTGAAC
AGCCTGCCTA TGGGTTTCTA TAGCGCCTCG CAACTGATTC AGGATGCCGG GCGGCACGGG
GTTCAAATCA AGCCGGTGGA CGTCACCATC AGCAATTGGG AGTGCATCCT TGAGGAACAG
GAGGATAAAG CGCTTCAGCC GATTGTCCGG CTGGGCCTGA ACAGGGTGAA AGGGATGGGA
CTGGAAGCCG CCACCCGCAT TGTTGAAGCA AGAGAAATCG CATCTTTTGA AAACCCCGAC
GACCTGGCAA ACCGCGCTTC CTTGAATACC GCTGAAGTTC ATGCGCTTGC CCGTGCAGAT
GCCTTGCGAA CCTTGTCGGG ACATCGGCGC CAGACGTTGT GGTCAGTTGC ATCGCATATC
ACACAGCGGG ACCTGATGCG GCATGCTCCG GCGAAGGAAA CGTTGCCAGT CATTCCGGCC
GCGCCGGAGG GAGAGGAAAT CATCGCCGAT TACGCCAGTA CCAGACTTAC CCTGCGCAGA
CACCCCCTGG CACTGCTGCG CCCGACGCTG GCAAGCATGA ACCTGCGCTC GGCAATGGAG
TTATCTGATT ACCCTACCGG GCGGTTGGTC CGCACCACCG GCATCGTGAC CTGCCGACAA
CGCCCCGGCA CCGCCAGCGG CGTCATGTTC GTCACGCTCG AAGACGAAAC CGGGATAACG
AACGTCGTGC TGTGGAACCA GGTCATCCTG AAGTACCGCC GCGAAACCTT GAACTCGAAG
CTTCTGACAG TATACGGCGT ATGGCAATCC GAAAGCGGCG TCAAACACCT TATCGCCAAA
CGGCTGGTGG ATCATAGCCA CCTGCTGGGC AGCCTTGTGG TGGAAAGCAG GGATTTTCAT
TAG
 
Protein sequence
MPSSAPDYAE LHCCSTFNFL KGASMPEELV ERAATLGYSA LAITDECSMA GVVRAHAEAK 
KTGLHLLIGS EFVLEDGLRL VGLAMNREGY GNLCELITLA RQRSAKGTYR ITRADFESCT
DAPHLARLPD CLILLIPRRN QADGSWVEDM RWVRSLFPGR GWISVERLLH AGEEVWFERI
RHAAETIGMP LVAAGDVHMH DRAHKPLQDT MTAIRLSQPL AACGYALQPN AEQHLRTRLE
LAQLYPPELL QETLSIAARC TFSLDELRYE YPDELTGTDE TYTDYLRRMV ENGMRWRFPK
GASIKVRKQI EHELKLIADL KYEPYFLTVF DIVSFARSQG ILCQGRGSAA NSAVCYCLGI
TEIDPERSNL LFERFISRER NEPPDIDVDF EHQRRGEVIQ YIYRRYGRDR AAIAATVITY
RTRSAVQDVG KALGLDIERI RRLSTSLARW DKSSDINKRL GENGFDPGSP VIRQLILLTT
LLYGYPRHLS QHVGGFVLTR DKLCRIVPIE NAAMPNRRII QWDKDDLAEV GLMKVDVLAL
GMLSAIHRAL DLISSRRGVP FQMQDIPPED EKTYDMICKA DTIGVFQIES RAQMSMLPRL
RPRTFYDLVV EVAIVRPGPI QGDMVHPYLR RRQDKEKPDY PEGVETALAR TYGVPIFQEQ
VMQVAMLAAG FTSGEADQLR RSMAAWKRKG GLGAFHEKLI DGMTNRGYTR EFAERIFRQI
EGFGEYGFPE SHAASFALLV YVSAWLKCHE PAAFLCALLN SLPMGFYSAS QLIQDAGRHG
VQIKPVDVTI SNWECILEEQ EDKALQPIVR LGLNRVKGMG LEAATRIVEA REIASFENPD
DLANRASLNT AEVHALARAD ALRTLSGHRR QTLWSVASHI TQRDLMRHAP AKETLPVIPA
APEGEEIIAD YASTRLTLRR HPLALLRPTL ASMNLRSAME LSDYPTGRLV RTTGIVTCRQ
RPGTASGVMF VTLEDETGIT NVVLWNQVIL KYRRETLNSK LLTVYGVWQS ESGVKHLIAK
RLVDHSHLLG SLVVESRDFH