Gene SAG0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0939 
SymboldnaE 
ID1013743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp943547 
End bp946651 
Gene Length3105 bp 
Protein Length1034 aa 
Translation table11 
GC content34% 
IMG OID637316124 
ProductDNA polymerase III DnaE 
Protein accessionNP_687951 
Protein GI22537100 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGCAC AACTTGATAC AAGAACCGTT TATTCTTTTA TGGATAGCTT GGTTGATTTA 
AAGACATATG TCTCTAAATC GAAATCATTA GGATATCAAA CTATTGGTAT CTTGGACCAC
AGTAATCTTT ATGCAGCATA TCATTTTATT CAAGAGGCGC AAAAGGCAAA CCTAAGACCA
ATAGTTGGAT TTAGCTTTGA TATAGTGGTT GAAAATAGAT CAATAGAAGT TTACTGTATT
GCGATAAATA CAGTTGGTTA TAAGAATCTC TTGAAGTTGT CAACGGCTCA AATGTCTGAG
AAGATGTCTC TAAATTTATT GACAGAACAT CTTGAAGGGG TGCAATTGAT TCTTCCCTAT
CAAGATGTTC TTGATCAGCT CAATTTACCC TTTGATTATG TTATTGGTGT TAATTTGACC
TCTCCGCAGA TTCCATATAC CAAACCCATT ATAGCCATAG ATACTGTTCG ATATTTTCAA
AAAAATGATA TTGAGACATT ACAGATGCTT CATGCTATAA GAGATAATGT CCTGTTAAAA
GACGCACAAT ATGCGAGCAA AAATCAAGAA TTAAAACCTT GTCAAGAAAT GACTCTAGCA
TTCAAGGAAC GTTTTCCTGA AGCTTTAGCT AACCTAGAAT CTTTGGTAGA GAATGTTGCT
TATCACTTTG ATTCTGACTT TAAGTTACCT ATTTTTAATA GAGAAATTCC TGCTGGAGAA
GAATTAAGAA CCTTAACTCA AAATAACTTA AAAAGTAAAG GTTTATGGAG CGATGCTTAT
CAAGAGCGTC TCGAAAAAGA ATTAACAATC ATTCACAAAA TGGGCTTTGA TGACTATTTC
CTTATTGTGT GGGATTTATT AAGATTTGGT CGTAGTAAGG GCTACTATAT GGGAATGGGA
CGTGGTTCAG CAGCGGGAAG TTTAGTCTCA TATGCTTTAA ATATTACCGG TATCGATCCT
GTCAAACATA ATTTAATTTT TGAACGTTTT TTAAATGAAG AGCGTTATTC GATGCCAGAT
ATTGATATTG ATCTTCCGGA TATTTATCGT GGAGAATTCT TGCGCTATGT ACGTAATCGA
TATGGTTCAA TGCATTCTGC ACAGATTGTA ACCTTCTCAA CTTTTGGTGC GAAACAGGCT
ATTCGAGATG TTTTTAAACG CTTTGGTGCA TCAGAGTATG AGTTAACAAA CATAACTAAA
AAAATTCATT TCAGAGATAA TTTGACCAGT GTTTATAACC GCAACTTAGC TTTCAGACAA
ATAATAGATA GTAAAATTGA ATACCAAAAA GCCTATGATA TTGCTAAGCG TATTGAAGGA
AATCCTAGAC AGACTTCTAT TCATGCCGCT GGGGTAGTAA TGAGTGACGA TTTGTTAACA
GATCATATTC CTTTAAAAAA CGGTGAGGAT ATGATGATAA CGCAATATGA TGCTAGTTCT
GTTGAGGATA ATGGTCTTCT AAAAATGGAC TTTCTAGGTC TTCGAAATTT GACATTTGTT
CAGAAAATGA AGGAGAAAGT CGATAAGGAC TACGGTATCT CTATACAATT AGAAACTATT
GATTTAGAAG ATAAAGAGAC TCTAAAACTA TTTGCAGCTG GTCAAACAAA GGGGATTTTT
CAATTTGAGC AAAGTGGAGC TATTAATTTG TTGCGACGTA TTAGGCCAGA GTGTTTTGAA
GATGTGGTAG CTACCACCAG CTTGAATAGG CCTGGTGCAA GTGATTATAC TGAGAATTTT
ATAAACCGTC GTTTTGGTAA AGAAAAAATC GATTTAGTAG ATCCTGTCAT AGCTCCAATT
TTACAACCTA CCTATGGCAT AATGCTCTAT CAAGAACAGG TTATGCAGAT TGCCCAGACC
TACGCAGGTT TTACGCTAGG CAAGTCTGAT TTGCTCAGAC GTGCTATGTC TAAGAAAAAT
TCTAAGGAGA TGCAAAAGAT GTCTCAATCC TTCTTAGAGG GTGCAGTATC TAAGGGACAT
CGTCAAGAAG ATGCTCGACT TATATTTGAA CGCATGGCTA AATTTGCAGG TTATGGGTTT
AATCGCAGCC ATGCGTTTGC TTATTCGGCT TTAGCTTTTC AATTAGCCTA TTTCAAAGCC
CATTATTCCG ATGTCTTTTA TGACATCATG ATAAATTATT CAAATAGTGA TTATCTAATC
GATGCTATTG ATTTTGGTTT TGTTATTGAA AAACCTTCAA TTAATACTAT TTCATACAGA
GACCGCATCT ATAAAAAGAA AATTTATTTA GGTTTAAAAA ATATTAAGGG TGTTCCTAAC
GATTTGGCTT ATTGGATTTC TAAAAATCAA CCGTTTCAGA GTATTGAAGA TTTTCTAATG
AGGCTGCCAC AGCAGTTTCA AAAAAGTGGT TTTATTTCTC CGCTAATTGC TATTGGAGCA
TTTGACGAAT TTGATAATAA TAGACGGAAA ATCACCTCTA ATCTTGATTC TTTATTTACA
TTTGTAAATG AACTAGGTAG TTTATTTGCA GATACCTCAT ATCATTGGCT AGAAGTAGAA
GATTTCTCTA ATTCTGAGAA ATATGAAATG GAACAAGATA TCTTAGGAGT AGGAATCAGC
CCGCACCCTT TAGTGGGGAT TTCACAAAAA GCTAGCCGTC CGTTTATACC AATTAGTCAA
GTTCAAGAAA ATAGCGAAGC TACTATTTTA GTTCAGCTAA AGCAGGTCAA AGTTATTAGG
ACAAAAAGTT CTGGGCAACA AATGGCTTTC TTGACTGTTA TGGATATTAA TTCAAAAATG
GATATTACTG TTTTTCCTGA AACTTTCAAT ATTGTAAGAG ATGATTTACA AGAAGGGAAA
TATTATTACT TGCATGGAAA AATTCAGAAG CGAGATGAGC GATTACAGAT GGTTTTAAAC
GGTGTTCAAG AAGCAACGGA GGAACGTTTT TGGATACTTC TAAAAAATCA TGATAACGAC
AAAAAAATAT CGGAAATATT GTCCAAATAC AAAGGGCATA TTCCTGTCTA TTTGCATTAT
GAAACAACTA AAGAAACTAT CCAAAGCAAA GTACATTTGG TTAGAAAAGA TAGTGGCTTA
GCCCTTGATT TATCTGAATT TGTTGTGAAA ACGGTTTATC AATAA
 
Protein sequence
MFAQLDTRTV YSFMDSLVDL KTYVSKSKSL GYQTIGILDH SNLYAAYHFI QEAQKANLRP 
IVGFSFDIVV ENRSIEVYCI AINTVGYKNL LKLSTAQMSE KMSLNLLTEH LEGVQLILPY
QDVLDQLNLP FDYVIGVNLT SPQIPYTKPI IAIDTVRYFQ KNDIETLQML HAIRDNVLLK
DAQYASKNQE LKPCQEMTLA FKERFPEALA NLESLVENVA YHFDSDFKLP IFNREIPAGE
ELRTLTQNNL KSKGLWSDAY QERLEKELTI IHKMGFDDYF LIVWDLLRFG RSKGYYMGMG
RGSAAGSLVS YALNITGIDP VKHNLIFERF LNEERYSMPD IDIDLPDIYR GEFLRYVRNR
YGSMHSAQIV TFSTFGAKQA IRDVFKRFGA SEYELTNITK KIHFRDNLTS VYNRNLAFRQ
IIDSKIEYQK AYDIAKRIEG NPRQTSIHAA GVVMSDDLLT DHIPLKNGED MMITQYDASS
VEDNGLLKMD FLGLRNLTFV QKMKEKVDKD YGISIQLETI DLEDKETLKL FAAGQTKGIF
QFEQSGAINL LRRIRPECFE DVVATTSLNR PGASDYTENF INRRFGKEKI DLVDPVIAPI
LQPTYGIMLY QEQVMQIAQT YAGFTLGKSD LLRRAMSKKN SKEMQKMSQS FLEGAVSKGH
RQEDARLIFE RMAKFAGYGF NRSHAFAYSA LAFQLAYFKA HYSDVFYDIM INYSNSDYLI
DAIDFGFVIE KPSINTISYR DRIYKKKIYL GLKNIKGVPN DLAYWISKNQ PFQSIEDFLM
RLPQQFQKSG FISPLIAIGA FDEFDNNRRK ITSNLDSLFT FVNELGSLFA DTSYHWLEVE
DFSNSEKYEM EQDILGVGIS PHPLVGISQK ASRPFIPISQ VQENSEATIL VQLKQVKVIR
TKSSGQQMAF LTVMDINSKM DITVFPETFN IVRDDLQEGK YYYLHGKIQK RDERLQMVLN
GVQEATEERF WILLKNHDND KKISEILSKY KGHIPVYLHY ETTKETIQSK VHLVRKDSGL
ALDLSEFVVK TVYQ