Gene Namu_3155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3155 
SymboldnaE2 
ID8448769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3474461 
End bp3477739 
Gene Length3279 bp 
Protein Length1092 aa 
Translation table11 
GC content74% 
IMG OID645042236 
Producterror-prone DNA polymerase 
Protein accessionYP_003202477 
Protein GI258653321 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00446252 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000661117 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCTGGA CCAACCCGAA CGTGCCGTGG AGCGAGCTGG AACGGGCCCT GTCCGGCCGC 
CCACCCGCGC CCGGCCGCGG GCCGCTGCAG GCGATCCGGC ACTGGCAGCA GGACCGGCCC
GACCCGCCGC CGATCCCGAC CGGGCCGCCG ACCGCGGTGC CCTACGCCGA GCTGCACTGC
CACTCCGCGT TCAGCTTCCT GGACGGCGCG TCCACCCCCG AGCAGCTGGT CGCCGAGGCG
GTGCGGCTGG GGCTGGAGGC GCTGGCGATC ACCGACCACG ACGGCCTGTA CGGCATCGTC
CGGTTCGCCG AGGCGGCTCG CGAGGCCGGC CTGCGCACCC TGTACGGCGC CGAGCTGTCC
CTGGGCCTGC CCGAACCGCA GCTGGGCGTG CCCGACCCGG TCGGCGAGCA CCTGCTGGTG
CTGGCCCGCG GCCAGGAGGG CTACCACCGG CTGTCCACCC AGATCAGCCG GGCTCAGCTG
GCCGGCGGCG CGAAGGGCCG GCCGGTCTAC GACCTGGACG AGCTGGCCGA CGCCGCCGGT
GGGCACTGGC TGATCCTCAC CGGCTGCCGC AAGGGCAGCG TCCGCCGCGC CCTGCAGCGG
CACGGCCCGG CCCGCGCCGC CGAGGAGCTG CGCGCGCTGA TGGACCGGTT CGGCCGGGAC
AACGTCGTCG TCGAACTGAC CACCAGCCTG GCGCCGACCG ACGACGAGGA CAACGACGCC
CTGGCCGCGC TGGCCGCCGA GCTGGACCTG CCGATCGTCG CGACCACGGC CGCCCACTAC
GCCACCCCGG CCGAGGGGCG GATCGCCGCG GCGATGGCCG CGGTCCGGGC CCGCCGCAGC
CTGGAGGAGG CCGACCCCTA CCTGCCCGCC GGCCCGGGGT CGCACCTGCG CTCGGGCGCC
GAGATGGCCG CCCTGTTCGC CCGGTACCCG GCCGCGGTGC CCACCGCCGC CGCGATCGGC
CGGGAGTGCT CGTTCGACCT GCGGCTGCTC GCGCCCAAGC TGCCGCCCTA CGACGTGCCC
CCCGGCCACG ACGAGAACTC GCACCTGCGC CAGCTCACCC TGGCCGGCGC GGCCCGCCGC
TACGGCTCGC CCGACGACCG GCCGGACGCG TACGCGCAGA TCGAGAAGGA GCTGGGGATC
ATCGCCCGGC TCGACTTCCC CGGCTACTTC CTGATCGTCA ACGACATCGT CGACTTCTGC
CGCCGCAGCG ACATCCTGTG CCAGGGCCGC GGGTCGGCCG CCAACTCCGC GGTCTGTTAC
GCGCTGGGCA TCACCGCGGT GGACGCGGTG CGCTACGAGC TGCTGTTCGA ACGGTTCCTG
GCCCCCGAGC GGGACGGCCC GCCGGACATC GACGTGGACA TCGAGTCCGA CCGGCGGGAG
GAGGTGATCC AGTACGTGTT CGCCACCTAC GGCCGGGACA AGGCGGCCCA GGTCGCCAAC
GTCAACACCT ACCGGCCGCG GATGGCGGTG CGGGACATGG CCAAGGCGCT GGGCCATTCA
CCCGGCCAGC AGGACGCCTA CTCCAAGCAG CTGGACGGCT GGGTGCCGCT GGCCGGGCAG
GACGCCAGCG GCATCCCGGC GGACGTGCTG GCCCTGGCCC ACCGGATCCA GGACTACCCC
CGGCACCTGA GCATCCACTC CGGCGGCATG GTCATCTGCG ACCGGCCGGT CGCCTCGGTG
TGCCCGGTGG AGTGGGCCCG GATGGAGAAC CGGTCGGTGC TGCAGTGGGA CAAGGACGAC
TGCGCCGCGA TGGGCCTGGT CAAGTTCGAC CTGCTGGGCC TGGGCATGCT CTCGGCCCTG
CACTACGCGG TCGACCTGAT CCGGGAACAC GAGGGCCTGG AGATCGACTT CGCCCAGCTC
GACCTGGAGG AGCCGGCGGT CTACGAGATG CTGCAGCGGG CCGACTCGGT CGGGGTGTTC
CAGGTGGAGT CCCGCGCCCA GATGGCCACC CTGCCCCGGC TCAAGCCGCG CACCTTCTAC
GACCTGGTGG TCGAGGTCGC GCTGATCCGC CCCGGGCCGA TCCAGGGCAA CGCGGTGCAC
CCCTACATCC GCCGCCGCAA CGGTCAGGAG GAGGTCACCT ACGACCACCC GGTGATGGAG
AAGTCGCTGG CCAAGACCCT GGGCATCCCG CTGTTCCAGG AGCAGCTGAT GCAGCTGGCG
GTGGACGTCG CCGGGTTCGA CGCCGGGGAG GCCGACCAGC TGCGCCGGGC GATGGGGGCC
AAGCGGTCCA AGGGAAAGAT GGAAAAGCTC AAGGCCCGCC TGTACGACGG GATGCGGGAG
CGGCACGGCA TCACCGGCCC GGTCGCCGAC CGCATCTACG AACGGCTGCT GGCGTTCGCC
AACTTCGGGT TCGCCGAGTC GCACGCGCTC TCGTTCGCGG CGCTGGTCTT CTACTCGGCC
TGGCTCAAGC TGCACCACCC GGCCGCCTTC TGCGCCGCGC TGCTGCGGGC CCAGCCGATG
GGCTTCTACT CGCCGCAGTC GCTGGTCGCC GACGCCCGGC GGCACGGGGT GAGCATCCGC
CGCCCGGACA TCAATGCCTC CCGGGCCCAT GCCGACCTGG AACCCGGGCC CGACGGCCGG
CCCGCGGTCC GGCTGGGCCT GGACGAGGTG CGCACGATCG GCGACGAGCT GGCCGCCCGC
ATCGTCGCCC GCCGCCCGAC CGAGGGGTAT CGCAGCCTGG ACGAGCTGAC CCGGACGGTG
ACCCTGACCG CGCCGCAGGC CGAGGCGTTG GCCACCGCCG GGGCGCTGGA CACGCTGGTC
GGCACCCGGC GACACGCGTT GTGGGCGGCC GGCGCCGCGG CCGGGGACCG GGCCGGCACC
CTGGCCGGCA GCGTCGTCGG GCTGGACGCG CCCGCGCTGC CCGGAATGAG CGACATCGAG
CTGACCGTCG CCGACATCTG GGCCACCGGC ATCTCGCCGC AGACCTACCC GACCGAGTTC
TCCCGGGACC AGCTCGACGC CTGGGGGGTC AAGACCGCCG CCGCCCTCAA GGACACGGCG
CACGGCACCC GGGTGCTGGT CGCCGGGGTG GTCACCCACC GGCAGCGCCC GGCCACCGCC
TCCGGCGTCA TCTTCATCAA CCTGGAGGAC GAGACCGGGA TGCTCAACGT CATCTGCTCG
GTCGGCCTGT GGGCCAAGTA CAAGCAAATC GCCCGCGGCT CACCGGCGCT GCTGGTGCGC
GGGGTGGTCG AACGCGCCGG TGGGGCGATC TCGATCGTCG CCGACCGGAT CGCCCGGGTG
AACCTGATCG CGGCCACCCC CTCCCGCGAC TTCCGCTGA
 
Protein sequence
MGWTNPNVPW SELERALSGR PPAPGRGPLQ AIRHWQQDRP DPPPIPTGPP TAVPYAELHC 
HSAFSFLDGA STPEQLVAEA VRLGLEALAI TDHDGLYGIV RFAEAAREAG LRTLYGAELS
LGLPEPQLGV PDPVGEHLLV LARGQEGYHR LSTQISRAQL AGGAKGRPVY DLDELADAAG
GHWLILTGCR KGSVRRALQR HGPARAAEEL RALMDRFGRD NVVVELTTSL APTDDEDNDA
LAALAAELDL PIVATTAAHY ATPAEGRIAA AMAAVRARRS LEEADPYLPA GPGSHLRSGA
EMAALFARYP AAVPTAAAIG RECSFDLRLL APKLPPYDVP PGHDENSHLR QLTLAGAARR
YGSPDDRPDA YAQIEKELGI IARLDFPGYF LIVNDIVDFC RRSDILCQGR GSAANSAVCY
ALGITAVDAV RYELLFERFL APERDGPPDI DVDIESDRRE EVIQYVFATY GRDKAAQVAN
VNTYRPRMAV RDMAKALGHS PGQQDAYSKQ LDGWVPLAGQ DASGIPADVL ALAHRIQDYP
RHLSIHSGGM VICDRPVASV CPVEWARMEN RSVLQWDKDD CAAMGLVKFD LLGLGMLSAL
HYAVDLIREH EGLEIDFAQL DLEEPAVYEM LQRADSVGVF QVESRAQMAT LPRLKPRTFY
DLVVEVALIR PGPIQGNAVH PYIRRRNGQE EVTYDHPVME KSLAKTLGIP LFQEQLMQLA
VDVAGFDAGE ADQLRRAMGA KRSKGKMEKL KARLYDGMRE RHGITGPVAD RIYERLLAFA
NFGFAESHAL SFAALVFYSA WLKLHHPAAF CAALLRAQPM GFYSPQSLVA DARRHGVSIR
RPDINASRAH ADLEPGPDGR PAVRLGLDEV RTIGDELAAR IVARRPTEGY RSLDELTRTV
TLTAPQAEAL ATAGALDTLV GTRRHALWAA GAAAGDRAGT LAGSVVGLDA PALPGMSDIE
LTVADIWATG ISPQTYPTEF SRDQLDAWGV KTAAALKDTA HGTRVLVAGV VTHRQRPATA
SGVIFINLED ETGMLNVICS VGLWAKYKQI ARGSPALLVR GVVERAGGAI SIVADRIARV
NLIAATPSRD FR