Gene Daro_1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1942 
Symbol 
ID3567871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2090875 
End bp2093958 
Gene Length3084 bp 
Protein Length1027 aa 
Translation table11 
GC content61% 
IMG OID637680413 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_285158 
Protein GI71907571 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0322539 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTGC CCGAGTACGC AGAACTCCAC TGTCTTTCCA ACTTCAGCTT CCTACGCGGT 
GCCTCGCATC CGGAAGAGCT GGCGGCACGT GCGCTAGCGC AAGGCTATGC TGCCCTGGCG
CTGACTGATG AATGCTCCCT GGCCGGTGTT GTTCGGGCGC ATCTGGCGGC CAAGAAGCAT
GGCCTGAAAT TCATCGTCGG TAGCGAGATG ATGACCGTCG ATGGCCTGAA GCTGGTTTTT
CTGGCCTGCA ATCGCCATGG TTACGGCAAT CTCTCGGCGC TAATCACGCT GGCTCGGCGG
CGCGCGGAGA AGGGGGGCTA CACCCTGCAC CGCAATGATC TGGAGTCCAT ATCGCCGAGT
GGTGCGCTAC CTGACTGTCT GGTGCTCTGG GTGCCGGGCA ATAATCCATC ATCCGCGGAT
GGCGAGTGGC TGGTGAGTCG CTTTCCTGGT CGGAGTTGGA TTGCAGTCGA ATTGCATGCC
GGGCCGGAGG ATGCGGAGAG GCTGGCTGGC TTGCAGTCCT TGGGTGAGGT CTGCGGCCTG
CCATTGGTCG CGACCGGCGA TGTCCATATG CACATCAAGG CCCGTCGGCC GGTGCAGGAT
GTGCTGACGG CCTTGCGTCT GAAAAGTACG GTTTTCGAGG CTGGCTACGC GCTATTCCCC
AACGGCGAGC GTCATTTGCG CTCCCGGCTC CGGCTGTCTC GACTCTATCC GCCAGAACTT
CTGGCTGAAA CCTTGAACAT CGCGGCACGC TGTGAGTTTT TACTCGATGA ACTGCGCTAT
GAATACCCGG AAGAAATCAT TCCATCGGGT GAAACACCGG CCAGCTGGCT GCGCAGCGAA
ACCGAGCGCG GCCTGCAACG GCGTTACCCG GCCGGTGTTC CTGGCAGTGT TCGCGAGCGG
ATTGAACACG AACTGGGCCT GATTGCCGAA ATGGCCTATG AGGCCTATTT CCTGACGGTT
TACGACATCG TCTGCTACGC CCGTAGCCAG GATATTCTTT GTCAGGGCCG TGGTTCGGCC
GCCAATTCGG CCGTCTGCTA CGCGCTGGGC GTGACTGAAG TCGATCCAGC GCGTTCGGCC
TTGCTGTTCG AGCGCTTCGT TTCGAAAGAA CGCGGCGAAC CGCCGGATAT CGATGTCGAT
TTCGAGCACG AACGGCGCGA GGATGTCATC CAGTACATCT ACACCAAGTA CGGTCGTGAG
CGGGCGGCAC TGGCCGCAGC GTTGATCACC TACCGGACCA AGGGCGCCCT GCGCGATGCT
GGCCGGGCGC TGGGTTTCGG CATTGCCCAG ATCAACGCGC TGACCGCTTC GCTGGCCTGG
TGGGACAAGC GCGAGCAGTT GCCGGAACGC TTCGCCGAGC TTGGTCTCGA TCCGCATGCG
CCGCGTGTCG AGAAATGGCT GGCCATCGCC GAGGCTCTGC GTGGATTTCC CCGTCACCTG
ACGCAGCATG TCGGCGGCTT CGTCATCTCG CGCGGGCCGC TGTCCCGGCT GGTGCCGGTT
GAAAATGCGG CGATGTCGGC GCGCAGTGTG ATTCAGTGGG ACAAGGATGA TCTTGATGCG
ATGGGGCTGA TGAAAGTCGA CATCCTGGCT CTCGGCATGC TGTCGGCGAT CCGCCGGATG
CTGCAGATCG TTGGCGAGAC GACCGGTCGA CCGATGAAAA TGCAGGATAT TCCGGCCGAA
GACCCGGCCA CCTATGAAAT GCTCTGTCAC GCCGACAGCA TGGGTGTTTT CCAGGTCGAA
TCCCGTGCCC AGATGGCCAT GTTACCGCGC CTCAGGCCAC AAAATTTCTA CGATCTGGTT
GTCGAAGTGG CGCTGGTGCG GCCGGGGCCG ATTCAGGGCG ACATGGTGCA TCCCTACCTG
AAGCGGCGGC AGGGCAGGGA GCGGATCGAG GAAATTTCGC CAGCCGTCGA TGCCGTGCTC
GAACGCACCT ATGGTGTGCC GATCTTTCAG GAGCAGGTCA TGCAACTGGC CGTCGTCGCT
GCCAACTTCA CGCCCGGCGA GGCCGACCAG CTACGCCGGG CAATGGCGGC CTGGAAGCGC
AAGGGCGGCT TGGAACCATT CGAGCAGAAA CTGCTGGCTG GCATGGCCGC CAACGATTTG
CCGGAGAGCT TCGCCCGACG GATCATCGCC CAGATCCAGG GCTTCGGTGA ATACGGCTTT
CCCGAATCCC ATGCCGCCAG TTTTGCCTTG CTCGTTTATG CCTCGGCCTG GCTCAAGCGC
CACCACCCGG CCGCCTTCCT GTGCGGTCTG CTCAACAGCC AGCCGATGGG TTTCTACTCG
CCGTCGATGC TGATTCAGGA TGCTCGTCGT CATGGTGTTA GGGTACTGCC CCCCGATGTT
ATGACCAGCG ACTGGGATAG CCGGCTTGAC GAGCGCGGTG CCGTGCGTCT CGGTCTGCGC
GAGATTAGTG GCTTCTCGGT GGCCGCCGCC AAACGCATTA CTGCGGTCTG CCGGGAAAAT
CAGCCCTTTT TGAATGTCGC CGATCTGGCA GCGAGGGCCG GGCTGCAACG GCGTGATCTC
GACCTGCTGG CCGCTGGCGA TGCCTTGCAG GGGCTGGCCG GCCATCGTCG GCAAGCCGCC
TGGGCTGCCA CGGTAGCTGT TGTACAGGGA GACCTGTTTG ACGGTACGCC GGTGGTCGAA
GCAGAAATCG AGCTGCCAGC GCCGAGTGAT GGTGAAAATC TGGTTGCCGA TTACCGCAGC
CTTGGCCTGA CGCTACGTTC GCATCCGCTG AGCCTGTTGC GTCAGTATCT GGCTGAACGT
CGATTCGTGA CTGCGGCCGA TCTAAAAATG GCGGGGCACC ATACCTTGAT CCGATCCGTT
GGCATTGTCG TCGGCCGTCA GCGACCGGGC ACCGCAACCG GCATTGTCTT TGTCACGCTG
GAAGACGAAA CCGGATTGAG CAACGTGGTC GTTCATCCGC AACTCGTCGA AAAACAGCGC
CGCGAACTAC TTGGCTCAAC GCTGCTCGGG GTTTACGGAC AATTGCAGGT CGAAGGAGAG
GTGGTGCATC TGGTCGCCAA GCGCTTGGTT GATCTCTCCG CCTGGCTGGG GCGGCTGGAA
ACGGTCAGCC GGGATTTTCA CTGA
 
Protein sequence
MSLPEYAELH CLSNFSFLRG ASHPEELAAR ALAQGYAALA LTDECSLAGV VRAHLAAKKH 
GLKFIVGSEM MTVDGLKLVF LACNRHGYGN LSALITLARR RAEKGGYTLH RNDLESISPS
GALPDCLVLW VPGNNPSSAD GEWLVSRFPG RSWIAVELHA GPEDAERLAG LQSLGEVCGL
PLVATGDVHM HIKARRPVQD VLTALRLKST VFEAGYALFP NGERHLRSRL RLSRLYPPEL
LAETLNIAAR CEFLLDELRY EYPEEIIPSG ETPASWLRSE TERGLQRRYP AGVPGSVRER
IEHELGLIAE MAYEAYFLTV YDIVCYARSQ DILCQGRGSA ANSAVCYALG VTEVDPARSA
LLFERFVSKE RGEPPDIDVD FEHERREDVI QYIYTKYGRE RAALAAALIT YRTKGALRDA
GRALGFGIAQ INALTASLAW WDKREQLPER FAELGLDPHA PRVEKWLAIA EALRGFPRHL
TQHVGGFVIS RGPLSRLVPV ENAAMSARSV IQWDKDDLDA MGLMKVDILA LGMLSAIRRM
LQIVGETTGR PMKMQDIPAE DPATYEMLCH ADSMGVFQVE SRAQMAMLPR LRPQNFYDLV
VEVALVRPGP IQGDMVHPYL KRRQGRERIE EISPAVDAVL ERTYGVPIFQ EQVMQLAVVA
ANFTPGEADQ LRRAMAAWKR KGGLEPFEQK LLAGMAANDL PESFARRIIA QIQGFGEYGF
PESHAASFAL LVYASAWLKR HHPAAFLCGL LNSQPMGFYS PSMLIQDARR HGVRVLPPDV
MTSDWDSRLD ERGAVRLGLR EISGFSVAAA KRITAVCREN QPFLNVADLA ARAGLQRRDL
DLLAAGDALQ GLAGHRRQAA WAATVAVVQG DLFDGTPVVE AEIELPAPSD GENLVADYRS
LGLTLRSHPL SLLRQYLAER RFVTAADLKM AGHHTLIRSV GIVVGRQRPG TATGIVFVTL
EDETGLSNVV VHPQLVEKQR RELLGSTLLG VYGQLQVEGE VVHLVAKRLV DLSAWLGRLE
TVSRDFH