Gene SAG1005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1005 
SymboltopA 
ID1013809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1014001 
End bp1016121 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content36% 
IMG OID637316189 
ProductDNA topoisomerase I 
Protein accessionNP_688016 
Protein GI22537165 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00429671 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACTA CGACAAAAAC GTCGACCAAA AAAACAAGTA AAAAAAAATC AGCTACTGCT 
AAAAAAAATC TAGTTATTGT GGAGTCTCCT GCAAAAGCAA AGACTATCGA AAAATACTTA
GGACGTAACT ATAAAGTTGT AGCCTCAGTA GGTCATATCC GTGATTTAAA AAAGTCAAGT
ATGTCCATTG ATTTTGAGAA TAATTACGAA CCACAATATA TTAATATTCG TGGTAAAGGA
CCTCTCATTA ATGATTTAAA AAAAGAAGCT AAAAAGGCTA AAAAAGTTTA CCTCGCGAGT
GACCCGGACC GTGAAGGAGA AGCTATTTCC TGGCATTTAG CGCATATTTT AGATTTAGAC
AAAGAAGACA GAAACCGTGT TGTTTTCAAT GAAATCACAA AAGATGCTGT TAAAAATGCT
TTTGTGGAAC CTCGTCAGAT TAACATGGAT CTCGTTGATG CTCAGCAAGC ACGACGTGTT
TTAGATCGCA TTGTTGGTTA TTCTATCTCA CCTATTTTAT GGAAAAAAGT AAAAAAAGGC
CTATCAGCTG GACGTGTACA ATCAGTTGCT CTTAAACTAA TCATTGATCG TGAAAATGAG
ATTAAGGCTT TCCAACCGGA GGAGTATTGG ACTATTGACG GTTCCTTTAA AAAAGGAACA
CGTAAATTCA ATGCCACTTT CTACGGTTTA GACGGAAAGA AATTCAAATT ATCGAATAAT
GAAGACGTCA AAACAGTTCT AAAACGTATT AAAACTGATG AATTCTTAGT TGAAAAAGTT
GAAAAAAAAG AGCGTCGTCG TAATGCACCA TTACCGTATA CAACTTCTTC ATTGCAACAA
GATGCAGCTA ATAAAATCAA TTTTCGAACT CGTAAAACCA TGATGATTGC GCAACAGCTG
TACGAAGGAC TTAGCTTAGG CACAGCAGGT CATCAAGGTC TGATTACCTA TATGCGTACT
GATTCTACAC GTATTAGTCC GTTAGCACAA AATGAAGCAA CAGAATTTAT TACTAACCGT
TTTGGTGCAA ATTATTCTAA ACATGGGAAT AAAGTTAAAA ATGCTTCTGG AGCACAAGAC
GCTCACGAAG CTATTCGCCC ATCTAGTGTT AATCATACAC CCGAAAGTAT TGCTAAGTAT
TTAGACAAAG ATCAACTAAA ACTTTACACT CTTATCTGGA ACCGTTTTAT TGCAAGCCAA
ATGACAGCAG CAGTCTTTGA CACAATGAAA GTTAATTTAA CGCAAAATGG TGTTACTTTT
ATTGCTAATG GTAGCCAAGT CAAGTTTGAT GGTTACATGG CTGTTTATAA CGATACTGAC
AAAAATAAAA TGTTACCAGA TATGGAGGAA GGAGAAAGTG TTAAAAAGGT TAATACGAAT
CCTGAACAAC ACTTCACTCA ACCTCCTGCA AGGTTTTCAG AAGCAAGTCT CATTAAAACA
CTTGAAGAAA ACGGTGTAGG TCGTCCTTCA ACTTATGCCC CAACGCTTGA GACTATTCAA
AAACGTTATT ATGTTAAGCT TGCAGCTAAA CGCTTCGAAC CAACTGAACT TGGTGAAATT
GTTAATAGCC TTATTGTAGA ATTTTTCCCA GATATTGTTG ATGTTACCTT CACTGCCGAA
ATGGAAGGGA AACTAGACGA AGTTGAGATT GGTAAAGAGC AGTGGCAAAA GATTATCGAT
GAATTTTATA AACCATTTGA AAAAGAACTT GCAAAAGCAG AAACTGAAAT GGAGAAAATT
CAAATAAAGG ATGAACCTGC TGGATTTGAT TGCGAGCTAT GTGGATCACC AATGGTAATA
AAACTAGGAC GTTATGGAAA GTTTTATGCG TGTAGCAATT TCCCTGAATG TCATAACACA
AAAGCTATCA CTAAGGAAAT AGGTGTTATT TGTCCTATCT GTCAAAAAGG ACAAGTTATT
GAGCGAAAAA CAAAACGCAA TCGTATCTTT TATGGCTGTG ACCGCTACCC AGAATGTGAG
TTTACATCTT GGGACAAACC TATTGGACGA ACTTGCCCCA AATCAAATGA TTTCTTAGTG
GAGAAAAAAG TACGCGGTGG TGGTAAACAA GTTGTTTGCT CAAATGAAAA ATGCGACTAC
CAAGAAGAGA AAATTAAATA A
 
Protein sequence
MATTTKTSTK KTSKKKSATA KKNLVIVESP AKAKTIEKYL GRNYKVVASV GHIRDLKKSS 
MSIDFENNYE PQYINIRGKG PLINDLKKEA KKAKKVYLAS DPDREGEAIS WHLAHILDLD
KEDRNRVVFN EITKDAVKNA FVEPRQINMD LVDAQQARRV LDRIVGYSIS PILWKKVKKG
LSAGRVQSVA LKLIIDRENE IKAFQPEEYW TIDGSFKKGT RKFNATFYGL DGKKFKLSNN
EDVKTVLKRI KTDEFLVEKV EKKERRRNAP LPYTTSSLQQ DAANKINFRT RKTMMIAQQL
YEGLSLGTAG HQGLITYMRT DSTRISPLAQ NEATEFITNR FGANYSKHGN KVKNASGAQD
AHEAIRPSSV NHTPESIAKY LDKDQLKLYT LIWNRFIASQ MTAAVFDTMK VNLTQNGVTF
IANGSQVKFD GYMAVYNDTD KNKMLPDMEE GESVKKVNTN PEQHFTQPPA RFSEASLIKT
LEENGVGRPS TYAPTLETIQ KRYYVKLAAK RFEPTELGEI VNSLIVEFFP DIVDVTFTAE
MEGKLDEVEI GKEQWQKIID EFYKPFEKEL AKAETEMEKI QIKDEPAGFD CELCGSPMVI
KLGRYGKFYA CSNFPECHNT KAITKEIGVI CPICQKGQVI ERKTKRNRIF YGCDRYPECE
FTSWDKPIGR TCPKSNDFLV EKKVRGGGKQ VVCSNEKCDY QEEKIK