Gene Sare_4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4039 
Symbol 
ID5705020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4596467 
End bp4597426 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content72% 
IMG OID641273465 
ProductDNA polymerase LigD polymerase subunit 
Protein accessionYP_001538820 
Protein GI159039567 
COG category[L] Replication, recombination and repair 
COG ID[COG3285] Predicted eukaryotic-type DNA primase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02778] DNA polymerase LigD, polymerase domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.437287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0570226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAGG CCGAGGAGAC CCGGAACGGG GTCGCACTGA CGAACCTGAA CCAGCCGTTG 
TTCGACGGGG CGGGCGCGAC CAAGCGTGAC CTGGTCAACT ACCTGGACGC GGTGCGCGAC
CGTATCCTGC CCCACCTGCG GGACCGGCCA CTGTCGGTGA TGCGGGTGCG GCCCGGCCAA
CCTCCGTTCA TGCAGAAGAA CCTTCCGAAG TACACCCCGG ACTGGATCCC CCGGACGGGA
GTGTGGGCGG AGGCGTCGCA CCGCGAGATC TCGTACGCCC TCTGCGGCGA CCGGCGCACC
CTGCTCTGGC TCGCCAATCA GCGGGCGGTC GAATTTCACC CCACCCTCGC CACGGTCGCG
GACCTGCGCT GCCCGACTCA CCTCGTGCTC GACCTGGACC CGCCGGAGGG CGCACCGTTC
GAGTCGGCGG TGGCCGCGGC CCTCCTGGTT CGACAGGCTC TCACCGAGGC TGGGCTTGTC
GGGGCGGTGA AGACCAGCGG CGCCAAGGGG GTGCACGTGT TCGTGCCGGT GACGGCGGGT
GCAACGGCGG AGGACCTTGC TGCCGCCACC CGAGCGCTCG CGCTCCGGGC TGCGCGCCTC
GATCCGGACC TCGCGACGAC CGCCTTCATT CGGGAGGACC GGGGCGGAAA AGTCTTCATC
GACTCCACCC GGGCTGGTGG GGCAACGGTT GTGGCCGCGT ACAGCCCGCG GCTGCGGCCC
GGTGTGCCGG TCTCCTTCCC GGTGCCCTGG GCTACCCTGC CGTCGGTCAC ACCCTCCGAC
TTCACGGTCC GGACCGCGCC CGAACTGGTC GCATCGGGGG ACCCGTGGGC GGACGCGATG
CCCACGGCCC AGCGACTCCC GGACGACCTG GTCGCCGAAG GCCACACCAT CCCGGTGGCC
CGGGTGCAGG CGATGCACGA GGGGAAGCGA CGGGCGCGCG CCCGGCGGGC CGCCGGCTGA
 
Protein sequence
MGEAEETRNG VALTNLNQPL FDGAGATKRD LVNYLDAVRD RILPHLRDRP LSVMRVRPGQ 
PPFMQKNLPK YTPDWIPRTG VWAEASHREI SYALCGDRRT LLWLANQRAV EFHPTLATVA
DLRCPTHLVL DLDPPEGAPF ESAVAAALLV RQALTEAGLV GAVKTSGAKG VHVFVPVTAG
ATAEDLAAAT RALALRAARL DPDLATTAFI REDRGGKVFI DSTRAGGATV VAAYSPRLRP
GVPVSFPVPW ATLPSVTPSD FTVRTAPELV ASGDPWADAM PTAQRLPDDL VAEGHTIPVA
RVQAMHEGKR RARARRAAG