Gene Sare_3466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3466 
SymboldnaE2 
ID5708068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3996723 
End bp4000097 
Gene Length3375 bp 
Protein Length1124 aa 
Translation table11 
GC content71% 
IMG OID641272893 
Producterror-prone DNA polymerase 
Protein accessionYP_001538259 
Protein GI159039006 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.14636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00222326 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCTTCC ACAATCCGGA TCTGCCGTGG TCGGAACTGG AGCGGGTGCT CACCGGTAGA 
CCCGCCCGGG GCCAACGCCG CACCCAGGAC GGTGAGCGGC ACCTGCACGT GGTGGACCCG
CTGGCGGTGG ACGCCGACGG CGGAGATGCT GCCGCGTGGG CCGGCAAACG TCAGCACCGT
ACGCCGCCCG CGGTGAGCCG CCCGGATGAT GCGGTGCCGT ACGCCGAGCT GCACACGCAC
ACCAACTTCA GTTTCCTCGA CGGGGCCAGC CACCCGGAGG AACTGGCCGA GGAGGCCACC
CGGCTGGGAC TCACCGCTCT CGCGGTGACC GACCACGACG GCTTCTACGG CGTGGTGCGC
TTCGCCGAGG CGGCCCGCGC CCTGCACCTG CCGACGATCT TCGGCGCGGA GCTCTCCCTC
GACCTGCCCG GCCCACCCAA TGGTGAGCCG GATCCGCTCG GGCGGCACCT GTTGCTGCTC
GCCCACGGCC ACGAGGGGTA CGCCCGGCTG GCCGCCACCA TCTCCCGGGC CCAGCTGCGC
GGCGGGGAGA AGGGCCGCCC GGTCTACGGA GAGCTGGCGG AGGTCGCGGC CGAGTTGCGT
GACCATGTGC TGGTGCTCAC CGGCTGCCGC AAGGGCCACG TGCCAGCCGC GCTGCTCACC
GAGGGCGTTG ACGCGGCAGC CCGGGAACTG GACCGGCTGA CCGCGCTGTT CGGCGCGGAG
ACGGTCGCGG TGGAGTTGAC CGACCACGGC CACCCGGTCG ACGCCGACCG CAACGACGCC
CTCGCCGAGC TGGCGGATGC CGCCGGGTTA CCGACGGTGG CTACCAACAA CGTGCACTAC
GCCACTCCGG GACGGCGCCG GCTGGCCACC ACACTCGCCG CGGTTCGGGC CCGGCGTAGC
CTGGACGAGA TCGATGGCTG GTTACCCGCC GCCGGCACCG CCCACCTGCG GAGCGGTGCC
GAGATGGCGG CCCGGTTCGC CGCGTACCCA GGGGCTGTGG CCCGGGCCGC CGAGTTCGGT
GCGGAACTCG CCTTCGACCT GCAACTCGTC GCCCCAGCGC TACCGGCCTA TCCGGTTCCG
TCCGGGCACA CCGAGATGAG CTGGCTGCGT CACCTCACGG CGCGGGGGGC ACGAGAACGC
TACGGCCCGC CGGAGGCGCA CCCCGAGGCG TACGCGCAGC TGGAGCACGA GCTGAACATG
ATCGAGGAGC TGGGCTTTCC CGGCTACTTC CTGGTGGTCT ACGACATCGT CACGTTCTGC
CGTGAGCAGG ACATCTACTG CCAGGGTCGG GGGTCGGCGG CGAACTCGGC GGTCTGCTAC
GCCTTGCGGA TCACCAACGT CGACGCGGTC CGTCATCGGC TGCTCTTCGA GCGCTTCCTC
GCCCCGGAGC GGGACGGGCC GCCGGACATC GACGTGGACA TCGAGTCCCA CCGCCGGGAG
GAGGTCATCC AACACGTCTA CGCCCGGTAC GGGCGGGAGC ACACCGCCCA GGTCGCCAAC
GTCATCTCGT ATCGGCCCCG GTCGGCGGTG CGGGACGTGG CGAAGGCGTT CGGGTTCTCC
CCGGGTCAAC AGGACGCCTG GAGCAAGCAG GTCGACCGGT GGGGCTCGGT TGCCGAGGTC
GATGCCGAGG GCATTCCCGA ACCGGTGGTG GCGTACGCCG ACGCGGTGCA GACCTTCCCG
CGGCATCTGG GCATCCACTC CGGCGGCATG GTGATCTGCG ACCGTCCGGT GATCGAGGTG
TGCCCGGTGG AGTGGGGGCG GATGCCCGGC CGGAGCGTGC TCCAGTGGGA CAAGGACGAC
TGTGCCGCCG TCGGGCTGGT CAAGTTCGAC CTGCTCGGTC TCGGCATGCT CGCCGCGTTG
CACCACGGCT ACGACATGAT CGGGACCCGG CTCGAGCTGG GCGACATGAC CCTGGACGAT
TCCGAGGTCT ACGACATGCT CTGCCGGGCG GACGCGGTCG GGGTGTTCCA GGTGGAGAGC
CGTGCCCAGA TGGCTACCCT GCCCCGGCTC AGACCCCGCT GCTTCTACGA CCTGGTGGTG
GAGGTGGCGC TGATCCGGCC CGGCCCGATC CAGGGCGGCT CGGTGCATCC GTACATCCGG
CGTAAGAACG GCCAGGAGCC GGTCACCTAC CCCCATCCGC TGATGCGCAA CGCGCTGGAG
AAGACCTTGG GCGTGCCGCT GTTCCAGGAG CAGCTCATGC AGCTCGCCAT CGACCTGGCC
GGCTTCGACG CGGCCGAGGC GGATCAGCTG CGTCGGGCGA TGGGGGCAAA GCGGTCGGTG
GAGCGGATGG CCCGGATCGC CGACCGGCTC TACGCCGGTA TGGCCGAGCG GGGTATCACC
GGTGAGCTGG CTGACGACGT CTACCGCAAG CTCACCGCGT TCGCCAGCTA CGGTTTCCCG
GAGAGCCACG CGATGAGCTT TGCTTACCTG GTCTACGCCA GTTCCTGGCT CAAGCGGTAC
CACCCGGCAC CGTTCCTGGC CGCGCTGCTC AATGCCCAAC CGATGGGGTT CTACTCGCCG
CAGACTCTGG TCGATGACGC CCGCCGGCAC GGGGTGGAGG TCCGCCGGCC AGACGTCAAC
GCCAGCGGTG CCGGAGCGGT CCTGGAGTCC ACCCCGAACA CTCGGTGGGG GAGCCAGCCG
GGGGAGCCGC CGCACGCGTG GGGGCTTGGC GGTCCGGCGG TCCGGCTCGG GCTGGACGGC
GTGCGTGCCC TCGGTGCGGA GGTGGCCGAG CGGATCGTGG CCGAGCGGAC GGCGCATGGG
CTGTATCAGG ATATGTCGGA TCTGGCCCGG CGTGCCGGTC TCACCGCCGC ACAACTGGAG
GCGCTCGCCA CCGCGGATGC CTTCGCGTGC TTCGGTGTGA CCCGACGACA GGCGCTCTGG
GCCGCCGGGG CGGCGGCCCA GGACCGCCCC GACCGTCTGC CCGGTACCGT GCCCGGCACG
GTCGCACCGA CGCTGCCCGG GATGGCGGCG GTCGACCGTC TCGTCGCGGA TGTGTGGGCG
ACCGGGTTGT CGCCGGAGAG TCACCCGGCC CGTTTCATCC GGGACCAACT CGACGCCCTG
GGGGCGGTAC CGATCGACCA GCTCGCGCGG GTGGAGCCGG GTCGGCGGAT TCGGGTCGGC
GGGATCGTCA CCCACCGGCA GCGTCCGGCG ACCGCGGGTG GGGTCACCTT CCTCAACCTG
GAGGACGAGA CGGGGATGCT CAATGTCACC TGTTCCCCCG GGCTGTGGCA GCGCTACCGG
CAGGTGGCGA AGAGCAGTGG GGCGCTGGTG GTCCGGGGTC TCCTCCAACG GTACGACGGA
GTGATCAATC TCATCGCTGA CCGGTTGGAC CCGATTGATC TCCAGGTTCG TCTGGCCGCT
CGCGACTTCC GTTGA
 
Protein sequence
MSFHNPDLPW SELERVLTGR PARGQRRTQD GERHLHVVDP LAVDADGGDA AAWAGKRQHR 
TPPAVSRPDD AVPYAELHTH TNFSFLDGAS HPEELAEEAT RLGLTALAVT DHDGFYGVVR
FAEAARALHL PTIFGAELSL DLPGPPNGEP DPLGRHLLLL AHGHEGYARL AATISRAQLR
GGEKGRPVYG ELAEVAAELR DHVLVLTGCR KGHVPAALLT EGVDAAAREL DRLTALFGAE
TVAVELTDHG HPVDADRNDA LAELADAAGL PTVATNNVHY ATPGRRRLAT TLAAVRARRS
LDEIDGWLPA AGTAHLRSGA EMAARFAAYP GAVARAAEFG AELAFDLQLV APALPAYPVP
SGHTEMSWLR HLTARGARER YGPPEAHPEA YAQLEHELNM IEELGFPGYF LVVYDIVTFC
REQDIYCQGR GSAANSAVCY ALRITNVDAV RHRLLFERFL APERDGPPDI DVDIESHRRE
EVIQHVYARY GREHTAQVAN VISYRPRSAV RDVAKAFGFS PGQQDAWSKQ VDRWGSVAEV
DAEGIPEPVV AYADAVQTFP RHLGIHSGGM VICDRPVIEV CPVEWGRMPG RSVLQWDKDD
CAAVGLVKFD LLGLGMLAAL HHGYDMIGTR LELGDMTLDD SEVYDMLCRA DAVGVFQVES
RAQMATLPRL RPRCFYDLVV EVALIRPGPI QGGSVHPYIR RKNGQEPVTY PHPLMRNALE
KTLGVPLFQE QLMQLAIDLA GFDAAEADQL RRAMGAKRSV ERMARIADRL YAGMAERGIT
GELADDVYRK LTAFASYGFP ESHAMSFAYL VYASSWLKRY HPAPFLAALL NAQPMGFYSP
QTLVDDARRH GVEVRRPDVN ASGAGAVLES TPNTRWGSQP GEPPHAWGLG GPAVRLGLDG
VRALGAEVAE RIVAERTAHG LYQDMSDLAR RAGLTAAQLE ALATADAFAC FGVTRRQALW
AAGAAAQDRP DRLPGTVPGT VAPTLPGMAA VDRLVADVWA TGLSPESHPA RFIRDQLDAL
GAVPIDQLAR VEPGRRIRVG GIVTHRQRPA TAGGVTFLNL EDETGMLNVT CSPGLWQRYR
QVAKSSGALV VRGLLQRYDG VINLIADRLD PIDLQVRLAA RDFR