Gene Sare_3372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3372 
Symbol 
ID5703406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3894281 
End bp3896980 
Gene Length2700 bp 
Protein Length899 aa 
Translation table11 
GC content68% 
IMG OID641272798 
ProductDNA polymerase I 
Protein accessionYP_001538165 
Protein GI159038912 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0398668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0449106 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGCCA CGAAGCCGCG CCTGCTCCTC GTTGACGGAC ACTCCCTGGC ATACCGGGCG 
TTCTTTGCCC TGCCGGTGGA GAACTTCTCC ACCACGACGG GCCAACCGAC CAACGCGGTC
TACGGCTTCA CCTCGATGTT GATCAACGTG TTGCGGGACG AGCGGCCAAC GCACATCGCG
GTGGCCTTTG ACGTGTCCCG GCGTTCCTTC CGCACCGAGA AGTACGCGGC GTACAAGGCC
GGCCGCAGTG AGACCCCGGC CGACTTCAAG GGCCAGGTCA GCCTGGTCAA GGAGATCCTG
GCGGCGTTGC GCGTCCCGGC GGTGGAGCTG GAGGGCTACG AGGCGGACGA CGTGATTGCC
ACGCTCACCC GCCAGGCTCG CGAGCAGGGC ATGTCGGTGC TGATCACCAC CGGCGACCGG
GACGCCTTCC AACTCGTCGA CGACGATGTC ACCGTGCTCT ACCCGCGGAA GGGCGTCTCC
GACCTGGCCC GGATGGATCC GGCGGCGGTG ACGGCGAAAT ACGGTGTACC CCCGGAGCTA
TACCGGGACC TCGCCGCGCT GGTCGGCGAG ACCAGTGACA ACTTGCCCGG CATTCCGGGG
GTCGGTCCGA AGACCGCCGC CAAGTGGATC ACCACCTACG GCGGGGTCGA GGGGATCGTG
GCCCGGGCCG ACGAGATCAA GGGTAAGGCC GGGGTCAATC TGCGGGAGCG TCTTGCCGAT
GTGATCCGTA ACCACGAGAT CAACTGGCTG GTCGCCGACC TGGATCTGCC CTTGCGTCCA
GAGGACACGC TCTGGGCCGG ATGGGACCGG GAGGCGGTGC ACCAGGTGTT CGACACCCTG
GAGTTCCGTA TCCTTCGCGA CCGGCTCTAC CAATACCTGG AGGCCGTCGA GCCGGAGGCC
GAAGCGGGCT TTGACCTGGC CGGTGAGGTC CTCGTCGAGC CGGGGGCGCT CACCGCGTGG
CTGGGCACGC ACGCGACGGC GGACACGCCG GTTGGCCTCG CCGTGACACT CGACACCGGC
CCCAATCGAC GGCACGTCGC CGGGGTGACC GGCATGGCGC TGGCGACCGC TGCCGGGGCG
GGGGCGTGGT TCGACCCGGG CCGGCTCGAG GCCGCCGACG AGAGTGCCCT GGCCGGGTGG
CTGGCTGATG AGACCCGACC AAAGGTGCTG CACGACAGCA AGCCGGCAGT GCTCGCGTTC
GCCGCGCACG GGTGGTCGAT CGAGGGCATC ACCCGCGACA CCCAGATCGC CGCCTACCTG
GCCCGGCCCG ACCAGCGATC CTACGATCTG GTCGACCTGG CCCTGCGGTA CCTGCATCGG
GAGCTGCGGG TCGATACGCC GGATACCGGC CAGCTCACCT TCGAGGGCCT GGGTAACGAC
GGCGAGGCTG AGCAGAACCT GATGCTGCGG GCGCGAGCCA CCCTCGACCT CGCCGACGCG
ATCGACGCCG AGCTGTCTCG CGACGGCGAA CAGTCGTCCC GGTTGATGGC CGGGGTGGAG
CTACCGCTGA TGCGGGTCCT CGCGAGTATG GAACGCACCG GCATCGCCGC CGACACGCAG
TACCTGTCCG AGTTGGAGGC GCAGTTCGCC GCGGAGGTGA AGGCCGCCGC GCAGGGCGCG
TACGAGGCGG TGGGGCGCGA GTTCAACCTC GGCTCCCCCA AGCAGTTGCA GGAGATCCTC
TTCGTCGAGC TGGGCCTGCC GAAGACCAAG CGGATCAAGA CGGGTTACAC CACCGACGCG
GATGCCCTGC AGTGGCTCCA CGCGCAGAAT CCGCACCCGG TGCTCGGGCA CCTGTTGCGG
CACCGGGACG TGGCGAAGCT GAAGACCACG GTCGACGGTC TCCTCAAGTC GGTCTCGGAC
GACGGCCGGA TCCACACCAC CTTCAACCAG ACCGTGGCGG CCACCGGTCG GCTCTCCTCC
ACCGAGCCGA ATCTGCAGAA CATCCCGATC CGTACCGAGG AAGGGCGGCG GATTCGTCGG
GCGTTCGTGG TCGGTGAGGG GTACGAGTGC CTACTGACCG CCGACTACAG CCAGATCGAG
ATGCGGATTA TGGCGCACCT GTCGGCGGAC GACGTTCTGA TCGACGCGTT CAACTCGGGC
CGCGACTTTC ACGCGGCCAC CGCCTCGTCG GTGTTCCAGG TTCCGGTGGA GGAGGTCACC
ACCGACCAGC GGCGCAAGAT CAAGGCGATG AACTACGGCC TGGCGTACGG GCTGAGTGCG
TTCGGTCTCT CCCAGCAGCT CGGCGTCACC GCCGAGGAGG CGCGCGGACT GATGGAGATC
TACTTCGCCG GCTTCGGTGG GGTGCGCGAC TACCTCCAGG AGGTGGTGGC ACGGGCCCGG
CACGACGGTT ACACCGAGAC CGTCCTCGGT CGTCGCCGCT ACCTGCCTGA CCTGGTCAGC
GACAACCGGC AGCGGCGGGA GATGGCCGAG CGAATGGCGC TCAACGCTCC GATCCAGGGC
TCCGCCGCCG ACATCATCAA GGTCGCGATG ATGCGTGTCG ACTCGGCGGT GCACGACGCC
GGTCTACGTT CCCGGATGTT GCTCCAGGTG CACGACGAAC TGGTCTTCGA GGTGGCCCCC
GGCGAGCGGG AGAGGCTCGA GGAACTGGTC CGCCGGGAGA TGGGCGAGGC GTACCCGCTG
TCGGTGCCGC TGGAGGTGTC AGTCGGCGAC GGGCGCGACT GGAACAGCGC CGACCACTGA
 
Protein sequence
MTATKPRLLL VDGHSLAYRA FFALPVENFS TTTGQPTNAV YGFTSMLINV LRDERPTHIA 
VAFDVSRRSF RTEKYAAYKA GRSETPADFK GQVSLVKEIL AALRVPAVEL EGYEADDVIA
TLTRQAREQG MSVLITTGDR DAFQLVDDDV TVLYPRKGVS DLARMDPAAV TAKYGVPPEL
YRDLAALVGE TSDNLPGIPG VGPKTAAKWI TTYGGVEGIV ARADEIKGKA GVNLRERLAD
VIRNHEINWL VADLDLPLRP EDTLWAGWDR EAVHQVFDTL EFRILRDRLY QYLEAVEPEA
EAGFDLAGEV LVEPGALTAW LGTHATADTP VGLAVTLDTG PNRRHVAGVT GMALATAAGA
GAWFDPGRLE AADESALAGW LADETRPKVL HDSKPAVLAF AAHGWSIEGI TRDTQIAAYL
ARPDQRSYDL VDLALRYLHR ELRVDTPDTG QLTFEGLGND GEAEQNLMLR ARATLDLADA
IDAELSRDGE QSSRLMAGVE LPLMRVLASM ERTGIAADTQ YLSELEAQFA AEVKAAAQGA
YEAVGREFNL GSPKQLQEIL FVELGLPKTK RIKTGYTTDA DALQWLHAQN PHPVLGHLLR
HRDVAKLKTT VDGLLKSVSD DGRIHTTFNQ TVAATGRLSS TEPNLQNIPI RTEEGRRIRR
AFVVGEGYEC LLTADYSQIE MRIMAHLSAD DVLIDAFNSG RDFHAATASS VFQVPVEEVT
TDQRRKIKAM NYGLAYGLSA FGLSQQLGVT AEEARGLMEI YFAGFGGVRD YLQEVVARAR
HDGYTETVLG RRRYLPDLVS DNRQRREMAE RMALNAPIQG SAADIIKVAM MRVDSAVHDA
GLRSRMLLQV HDELVFEVAP GERERLEELV RREMGEAYPL SVPLEVSVGD GRDWNSADH