Gene Strop_3239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3239 
SymboldnaE2 
ID5059704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3712867 
End bp3716241 
Gene Length3375 bp 
Protein Length1124 aa 
Translation table11 
GC content71% 
IMG OID640475487 
Producterror-prone DNA polymerase 
Protein accessionYP_001160051 
Protein GI145595754 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.023295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.114564 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCT ACAACCCGGA TCTGCCGTGG TCAGAGCTGG AGCGGGTGCT CAGTGGCGAA 
TCCACCCGGG GGCAACGCCG TACCCGAGAC GGCCAGCGGC ATCTGCACGT GGTGGACCCG
CTGGCGGTGG AGGCCGACGG CGGCGATGCT CCCGCACGGG CCGGTAGGCG TCCGCACCAC
CCGCCGCCCG CGACAGCCCG CCCGGATGAT GTGGTGCCGT ACGCCGAGCT GCACACGCAC
ACCAACTTCA GCTTCCTCGA CGGCGCCAGC CACCCGGAGG AGTTGGCCGA GGAGGCCGTC
CGGCTGGGGC TCACCGCTCT CGCGGTGACC GACCACGATG GCTTCTACGG CGTGGTGCGC
TTCGCCGAGG CGGCACGTGC CCTGCACCTG CCGACGGTCT TCGGCGCGGA GCTCTCCCTC
GATCTGCCGG GCGCGCCCAA CGGCGAGCCG GACCCGCTCG GGCGGCACCT GTTGCTGCTC
GCCCACGGCC ACGAGGGGTA CGCGCGGTTG GCCGCCACCA TCTCCCGGGC CCAGCTGCGC
GGTGGCGAGA AGGGCCGCCC GATCTACGGG GAGTTGGCGG AGGTCGCGGC CGAGCTGCGG
GACCACGTGC TGGTGCTCAC CGGCTGCCGC AAGGGTCACG TGCCGGCCGC GCTGCTCACC
GAGGGAGTTG ACGCGGCAGC CCGGGAACTG GATCGGTTGA CCGCGCTGTT CGGTGCGGAG
ACGGTCGCGG TGGAACTGAC CGACCACGGT CATCCGATTG ACGCCGACCG GAACGACGCC
CTCGCCGAGC TGGCGGAGGC CGCCGGGCTG CCGACAGTGG CCACCAACAA CGTGCACTAC
GCCACTCCGG GGCGGCGTCG GCTGGCCACC ACACTCGCCG CGGTCCGGGC CCGACGCAGC
CTGGACGAGA TCGACGGCTG GCTACCCGCC GCCGGCGCCG CCCATCTGCG CAGCGGTGCC
GAGATGGCGG CCCGGTTCGC CGCCTATCCG GGGGCCGTGG CCCGAGCCGC CGAGTTCGGC
GCGGAACTCG CCTTCGACCT GCAGCTCGTC GCCCCGGCGC TACCGGCGTA TCCGGTGGCG
TCCGGACACA CCGAGATGAG CTGGCTGCGT CATCTCACGG CGCGGGGGGC TCGAGAGCGC
TACGGCCCAC CGGAGGCGCA CCCCGAGGCG TACGCGCAGC TTGAACATGA GCTGAACATG
ATCGAGGAAC TGGGCTTTCC CGGCTACTTC CTGGTGGTCT ACGACATCGT CACGTTCTGC
CGTGAGCAGG ACATCTACTG TCAGGGCCGA GGGTCGGCGG CGAACTCGGC GGTCTGCTAT
GCGTTACGGA TAACCAACGT TGACGCAGTC CGCCACCGGC TGCTCTTCGA GCGCTTCCTC
GCCCCGGAGC GGGATGGGCC ACCGGACATC GACGTGGACA TCGAGTCCTC CCGTCGGGAG
GAGGTCATCC AACACGTCTA CGCCCGGTAC GGGCGGGAGC ACACTGCTCA GGTCGCCAAT
GTCATCTCGT ACCGGCCCCG GTCGGCGGTG CGGGACGTGG CGAAGGCGTT CGGGTTCTCG
CCCGGCCAGC AGGACGCCTG GAGCAAGCAG ATCGATCGGT GGGGTTCGGT CGCCGCGGCC
GATGTCGAGG GCATCCCCGA GCCGGTGGTG GCGTACGCCG ACGCGGTGCA GACCTTTCCC
CGGCACCTGG GCATCCACTC CGGCGGCATG GTGATCTGCG ACCGACCGGT GATCGAGGTG
TGCCCGGTGG AGTGGGGGCG GATGCCCGGC CGGAGCGTGC TCCAGTGGGA CAAGGACGAC
TGCGCCGCCG TTGGGCTGGT CAAGTTCGAC CTGCTCGGCC TCGGCATGCT CGCCGCGTTG
CACCACGGCT ACGACATGAT CGGGGCGCGG CTCGAGCTGG GCGACATGAC TCTGGATGAC
GACGAGGTCT ACGACATGCT CTGCCGAGCC GACGCGGTCG GGGTGTTCCA GGTGGAGAGC
CGCGCTCAGA TGGCCACCCT GCCCCGGCTC AAACCCCGCT GCTTCTACGA CCTGGTGGTG
GAGGTGGCGC TGATCCGGCC TGGCCCGATC CAGGGCGGCT CGGTGCACCC GTACATCCGG
CGTAAGAACG GCCAGGAACC GGTCACCTTC CCGCATCCGC TGATGCGCAA CGCGCTGGAG
AAGACCTTGG GCGTGCCGCT GTTCCAGGAG CAGCTCATGC AGCTCGCCAT CGACCTGGCC
GGCTTCGACG CGGCCGAGGC GGACCAGCTG CGCCGGGCGA TGGGGGCCAA GCGGTCGGCG
GAGCGGATGG CCAGGATCGC CGATCGGCTC TACGCCGGTA TGGCCGAGCG GGGCATCACC
GGCGAGTTGG CCGACGATGT CTACCGCAAG CTCACCGCGT TCGCCAGTTA TGGCTTCCCG
GAGAGCCACG CGATGAGCTT CGCCTATCTG GTCTACGCCA GTTCCTGGCT CAAGCGGTAC
CACCCGGCGC CGTTTCTGGC CGCGCTGCTC AACGCCCAAC CGATGGGGTT CTATTCGCCG
CAGACCCTGG TGGATGACGC TCGTCGGCAC GGGGTGGAGG TCCGTCGGCC GGATGTCAAT
GCCAGCGGTG CCGGGGCGGT CCTGGAGTCC ACCCCGAGCA CCCGTTGGGG AAGCCAGCCG
GGGGAGCCGC CGCACGCGTG GGGGCTGGGT GGTCCGGCGG TTCGGCTCGG GCTGGACAGT
GTGCGTACGC TCGGTGGGGA GGTGGCCGGG CGAATCGAGA CCGAGCGGGC AGCGCGTGGG
CCGTATCGGG ATATGTCGGA CCTGGCCCGG CGGGTGGGTC TGACCGCCGC GCAACTGGAG
GCGCTTGCCA CCGCGGATGC CTTCGCGTGT TTTGGCGTGA CCCGGCGGCA GGCACTCTGG
AGCGCTGGGG CGGCAGCCCA GGAGCATCCC GACCGCCTGC CCGGCACCGT GCCCGGCACG
GTCGCGCCGA CGCTGCCCGG GATGGGGGCG GTCGATCGTC TCGTCGCGGA TGTCTGGGCG
ACGGGGCTGT CGCCGGAGAG CCATCCGGCC CGGTTCGTCC GAGAACAGCT CGACGCCCTG
GGGGCGGTGC CGATCGCCCG GCTCGCGCGG GTGGAGCCGG GTCGGCGGGT TCGGGTTGGC
GGGATCGTCA CCCACCGGCA GCGTCCGGCA ACCGCGGGTG GGGTCACCTT CCTCAACCTG
GAGGACGAGA CGGGGATGCT CAATGTCACC TGTTCCCCTG GGTTGTGGCA GCGCTACAGG
CAGGTGGCGA AGAACAGCGG GGGGCTGGTG GTTCGGGGTC TCCTCCATCG GCACGAGGGG
GTGATCAATT TCACCGCTGA CCGGTTGGAC CCGATTGAGC TCCCGGTCCG TCCGGCCGCC
CGCGACTTTC GGTGA
 
Protein sequence
MSFYNPDLPW SELERVLSGE STRGQRRTRD GQRHLHVVDP LAVEADGGDA PARAGRRPHH 
PPPATARPDD VVPYAELHTH TNFSFLDGAS HPEELAEEAV RLGLTALAVT DHDGFYGVVR
FAEAARALHL PTVFGAELSL DLPGAPNGEP DPLGRHLLLL AHGHEGYARL AATISRAQLR
GGEKGRPIYG ELAEVAAELR DHVLVLTGCR KGHVPAALLT EGVDAAAREL DRLTALFGAE
TVAVELTDHG HPIDADRNDA LAELAEAAGL PTVATNNVHY ATPGRRRLAT TLAAVRARRS
LDEIDGWLPA AGAAHLRSGA EMAARFAAYP GAVARAAEFG AELAFDLQLV APALPAYPVA
SGHTEMSWLR HLTARGARER YGPPEAHPEA YAQLEHELNM IEELGFPGYF LVVYDIVTFC
REQDIYCQGR GSAANSAVCY ALRITNVDAV RHRLLFERFL APERDGPPDI DVDIESSRRE
EVIQHVYARY GREHTAQVAN VISYRPRSAV RDVAKAFGFS PGQQDAWSKQ IDRWGSVAAA
DVEGIPEPVV AYADAVQTFP RHLGIHSGGM VICDRPVIEV CPVEWGRMPG RSVLQWDKDD
CAAVGLVKFD LLGLGMLAAL HHGYDMIGAR LELGDMTLDD DEVYDMLCRA DAVGVFQVES
RAQMATLPRL KPRCFYDLVV EVALIRPGPI QGGSVHPYIR RKNGQEPVTF PHPLMRNALE
KTLGVPLFQE QLMQLAIDLA GFDAAEADQL RRAMGAKRSA ERMARIADRL YAGMAERGIT
GELADDVYRK LTAFASYGFP ESHAMSFAYL VYASSWLKRY HPAPFLAALL NAQPMGFYSP
QTLVDDARRH GVEVRRPDVN ASGAGAVLES TPSTRWGSQP GEPPHAWGLG GPAVRLGLDS
VRTLGGEVAG RIETERAARG PYRDMSDLAR RVGLTAAQLE ALATADAFAC FGVTRRQALW
SAGAAAQEHP DRLPGTVPGT VAPTLPGMGA VDRLVADVWA TGLSPESHPA RFVREQLDAL
GAVPIARLAR VEPGRRVRVG GIVTHRQRPA TAGGVTFLNL EDETGMLNVT CSPGLWQRYR
QVAKNSGGLV VRGLLHRHEG VINFTADRLD PIELPVRPAA RDFR