Gene Sare_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1784 
SymboldnaE 
ID5706595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2055942 
End bp2059475 
Gene Length3534 bp 
Protein Length1177 aa 
Translation table11 
GC content66% 
IMG OID641271287 
ProductDNA polymerase III subunit alpha 
Protein accessionYP_001536662 
Protein GI159037409 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00195815 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCGATT CGTTCGCGCA TCTGCACGTG CACACGGAGT ACTCGATGCT CGACGGTGCG 
GCCCGGCTGA AGGACCTGTT CGCCGAGGTC AACCGCCAGG GGATGCCGGC CGTGGCAATG
ACCGACCACG GAAACATGCA CGGCGCGAAC GACTTCTACA AGCAGGCGAT GGCGGCCGGC
GTCACACCGA TCCTGGGGAT CGAGGCGTAC GTCGCACCGG AGTCGCGGTT CCACAAGCAG
CGGGTGCGGT GGGGCCGGCC GGAGCAGAAG AGCGACGACG TCTCCGGCAG CGGTGGTTAC
ACCCACATGA CCATCTGGGC GCGTAACAAG GTGGGGTTGC ACAACCTGTT CAAGCTGACC
AGTCGGTCGT TCACCGAGGG GTTCTTCGTC AAGTGGCCGC GGATGGACGC GGAGTTGCTC
GCTGAGCACG CCGACGGGTT GATGGCCACG ACCGGCTGCC CCTCCGGCGA GGTGCAGACC
CGGTTACGGC TGGAACAGTA TGACGAGGCC CTGAAGGCTG CCGCGAGATA CCAGGACATC
TTCGGCAAGG AGAACTTCTT CCTGGAGGTC ATGGACCACG GCATCGACAT CGAGCGTCGG
GTTCGCCAGG AGCTGGCGGA GATCTCGCAC AAGCTGGATA TTCCGCCCGT TGTCACCAAC
GACACGCACT ACACCCACCA GGAGCAGTCC GAGGCGCACG ATGTGCTGCT CTGCGTGCAG
ACCGCGGCGA ACGTCAGCGA CCCCAACCGG TTCCGGTTCG ACGGCAGCGG CTACTACATC
AAGTCCGCCG ACGAGATGCG CGCCATCGAC TCGTCCGACC TGTGGCTTCA GGGCTGCCGC
AACACGCTGC TGGTCGCGGA GAAGGTCGAC CCGACCGGCA TGTTCGACTT CCGGAACCTG
ATGCCGCGCT TCCCGGTGCC GGAGGGGGAG ACCGACGAGT CCTGGTTCCG CAAGGAGACG
TTCAAAGGGC TCGCCCGCCG CTTCCCCGAC GGCGTCCCGG AGACCCACGT CAAGCAGGCC
GAGTACGAAC TGGGTGTCAT CATCCAGATG GGTTTCCCGT CGTACTTCCT CGTGGTCGCC
GACTTCATCC AGTGGGCCAA GGGCGAGGGC ATCGCCGTGG GGCCCGGCCG TGGTTCGGCC
GCCGGTTCCC TGGTCGCGTA CGCCCTGGGC ATCACCGACC TGGATCCGTT GCCGCACGGG
CTGATCTTCG AGCGATTCCT CAATCCCGAG CGGGTCTCGA TGCCGGATGT CGACATCGAC
TTCGACGAGC GTCGGCGCGG CGAGGTGATC AAGTACGTCA CGGAGAAGTG GGGTGAGGAC
AAGGTCGCGC AGATCGCGAC CTTCGGCACG ATCAAGGCGA AGGCCGCGAT CAAGGACTCG
GCGCGGGTGC TCGGCTACCC GTACGCGGTC GGTGACCGGA TCACCAAGGC GATGCCACCG
GCCGTGATGG GCAAGGACAT CCCGCTGACC GGCATCTTCG ACCCGAAGCA CTCCCGGTAC
GCGGAGGCCG GCGAGATCCG CGGTCTCTAC GAGTCCGACC CGGACGTCAA GAAGGTCATC
GACACCGCGC GTGGCATCGA GGGTCTGATC CGGCAGACGG GTGTGCACGC CGCCGGGGTC
ATCATGTCCG CCGAGCCGAT CATCGACCAC ATCCCGCTGA TGCGCCGGGA CGCCGACGGG
GCGATCATCA CGCAGTTCGA CTACCCGACC TGTGAGTCGC TCGGGCTGCT GAAGATGGAC
TTCCTCGGCC TGCGCAACCT GACCATCATC GACGACGCGC TGAAGAACAT CGAGAGCAAC
CACGGGCGTA CGGTCGACCT GCTCAAACTG CCGTTGGACG ACACGCCGAC CTACGAGCTG
CTGTCCCGTG GCGACACCCT GGGCGTGTTC CAGCTCGACG GCGGGCCGAT GCGGTCGCTG
CTGCGAACGA TGAAGCCGGA CAACTTCGAG GACATCTCCG CGGTCCTGGC GCTGTACCGG
CCGGGCCCGA TGGGCGCCAA CTCGCACACC AACTACGCGT TGCGCAAGAA CGGCCTCCAG
GAGATCACCC CGATCCACCC GGAGCTGGCC GAACCGCTGG AGGAGATCCT TGGGCCCACC
TACGGCCTGA TCGTCTACCA GGAGCAGGTG CAGCGTGCGG CCCAGGTCCT CGCCGGCTAC
AGCCTCGGCA AGGCCGACCT GCTGCGGCGG GCGATGGGCA AGAAGAAAAA AGAGGTGCTC
GACAAGGAGT TCGTGCCGTT CCGCGACGGA ATGCGCGGCA ACGGCTACTC CGACGAGGCC
ATCCAGACGC TGTGGGACAT CCTCGTCCCC TTCGCCGACT ACGCGTTCAA CAAGGCCCAC
ACCGCCGGGT ACGGCCTGGT GTCGTACTGG ACCGGCTACC TGAAGGCGAA CTACCCGGCC
GAGTACATGG CGGCACTGCT CACCTCCGTG GGCGATGACA AGGACAAGAT GGCGCTCTAT
CTGTCGGAGT GCCGCCGGAT GGGCATCCAG GTTCTCCCGC CGGACGTGAA CACCTCGGCC
GGGCCGTTCA CTCCGGTCGG CCGGGACATT CGCTTCGGCC TGGCCGCGAT CCGTAACGTC
GGTGCGAACG TGGTCAGCGC GATCATGCGC TGCCGCGGGG AGAAGAGCGC CTACACCGAC
TTCTACGACT TCCTGTCCAA GGTGGATGCG GTCGTCTGCA ACAAGAAGAC CATCGAATCT
CTGATCAAGG CGGGCGCGTT CGACTCACTC GACCATCCCC GCCGGGGCCT GCTCGCGGTG
CACGCCGACG CCATCGACGC GTACGCCGAC GTCAAGCGCA AGGAGGCCGT CGGCCAGTAC
GACCTGTTCG GCGCCGGGTT CGCCGACCCG GAGGTCGGCA CCAGCACCAC GGTGATGCCG
GCCATCGTCG ACGGGGAGTG GGACAAGCGG GAGAAGCTTG CCTTCGAGCG CGAGATGCTC
GGCCTCTACG TCTCCGACCA TCCGCTGTTC GGCCTGGAGC ATGTGCTCGG CAAGGAAGCC
GACACCACCA TCGCCGCCCT CTCGGAGGAG GGGACCATCC CCGACGGGAC GGTGGTGACA
CTCGCGGGCA TCCTCTCCGG GGTGCAGCGC CGGGTCACCA AGCAGGGCCG GGCGTGGGCG
TCGGCGACGC TGGAGGACCT GGCCGGCGGG GTGGAGACGC TGTTCTTCCC CAACACCTAC
GAGGTGATCG GGCAGTACAT CGCCGAGGAC GCGATCGTGG TGGTCAAGGG GCGGGTCGAC
CGCCGTGACG ACACACCGCG GATCATGGCG ATGGACATGT CGATCCCGGA TGTCAGCAGC
AACTCGGCCA ACAAGCCGGT CACCCTCACC ATTCCGGTCA CCCGGTGCAC GCCGCCGCTG
GTGGAGCGGC TCAAGGAGAC GCTGGTGCTG CATCCCGGCG ACGCGGAGGT GCACGTCAAG
CTCCTCAACG GCAGCAAGGT GACCCGGCTG CGACTCGGTC CGTTCCGGGT GGCGGCCACC
ACCGCCCTGA TGGCCGACCT GAAGAGTGTC CTCGGCCCGG CCAACGTGGG CTGA
 
Protein sequence
MGDSFAHLHV HTEYSMLDGA ARLKDLFAEV NRQGMPAVAM TDHGNMHGAN DFYKQAMAAG 
VTPILGIEAY VAPESRFHKQ RVRWGRPEQK SDDVSGSGGY THMTIWARNK VGLHNLFKLT
SRSFTEGFFV KWPRMDAELL AEHADGLMAT TGCPSGEVQT RLRLEQYDEA LKAAARYQDI
FGKENFFLEV MDHGIDIERR VRQELAEISH KLDIPPVVTN DTHYTHQEQS EAHDVLLCVQ
TAANVSDPNR FRFDGSGYYI KSADEMRAID SSDLWLQGCR NTLLVAEKVD PTGMFDFRNL
MPRFPVPEGE TDESWFRKET FKGLARRFPD GVPETHVKQA EYELGVIIQM GFPSYFLVVA
DFIQWAKGEG IAVGPGRGSA AGSLVAYALG ITDLDPLPHG LIFERFLNPE RVSMPDVDID
FDERRRGEVI KYVTEKWGED KVAQIATFGT IKAKAAIKDS ARVLGYPYAV GDRITKAMPP
AVMGKDIPLT GIFDPKHSRY AEAGEIRGLY ESDPDVKKVI DTARGIEGLI RQTGVHAAGV
IMSAEPIIDH IPLMRRDADG AIITQFDYPT CESLGLLKMD FLGLRNLTII DDALKNIESN
HGRTVDLLKL PLDDTPTYEL LSRGDTLGVF QLDGGPMRSL LRTMKPDNFE DISAVLALYR
PGPMGANSHT NYALRKNGLQ EITPIHPELA EPLEEILGPT YGLIVYQEQV QRAAQVLAGY
SLGKADLLRR AMGKKKKEVL DKEFVPFRDG MRGNGYSDEA IQTLWDILVP FADYAFNKAH
TAGYGLVSYW TGYLKANYPA EYMAALLTSV GDDKDKMALY LSECRRMGIQ VLPPDVNTSA
GPFTPVGRDI RFGLAAIRNV GANVVSAIMR CRGEKSAYTD FYDFLSKVDA VVCNKKTIES
LIKAGAFDSL DHPRRGLLAV HADAIDAYAD VKRKEAVGQY DLFGAGFADP EVGTSTTVMP
AIVDGEWDKR EKLAFEREML GLYVSDHPLF GLEHVLGKEA DTTIAALSEE GTIPDGTVVT
LAGILSGVQR RVTKQGRAWA SATLEDLAGG VETLFFPNTY EVIGQYIAED AIVVVKGRVD
RRDDTPRIMA MDMSIPDVSS NSANKPVTLT IPVTRCTPPL VERLKETLVL HPGDAEVHVK
LLNGSKVTRL RLGPFRVAAT TALMADLKSV LGPANVG