Gene LGAS_0916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLGAS_0916 
Symbol 
ID4439940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLactobacillus gasseri ATCC 33323 
KingdomBacteria 
Replicon accessionNC_008530 
Strand
Start bp913680 
End bp915761 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content37% 
IMG OID639672772 
ProductDNA topoisomerase I 
Protein accessionYP_814743 
Protein GI116629571 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA
[COG0551] Zn-finger domain associated with topoisomerase type I 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000916095 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.000814664 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTACTA AACGAAAAAA TAAAAAGAAT TTAGTTATTG TCGAGTCTCC ACATAAGGCT 
AAGACAATTG AAAAATATTT AGGTAGAAAT TATCATGTTA TTGCTTCAAA AGGTCATATT
CGTGATTTAC CCAAGTCCCA AATGGGTGTT GACGTCGAAC ACGATTATGA ACCAAAGTAT
ATTTCTATTC GCGGAAAAGG CGATACGATT AAAGAATTAA AGAGCGAAGC CAAAAAGGCA
AAGTATGTTT ATCTCGCTTC CGACCCCGAT CGTGAAGGAG AAGCTATTGC CTGGCACGTT
GCTCATGCTT TGAATTTAGA TCCTAAAGAG CACAACCGTG TTGCTTTTAA CGAAATTACT
AAAGATGCGG TTAAAAATGC CTTTAAGAAT CCAAGAACCA TTGATATGGA TATTGTAGAT
GCTCAACAAG CACGTCGTGT TCTTGACCGC TTGGTGGGTT ATTCAATTAG TCCTATCTTG
TGGCAAAAAG TTAAGAAAGG CTTATCCGCT GGACGTGTTC AATCAATTGC CTTGAAGTTA
GTAATTGATC GTGAAAACGA AATTAAGAAC TTTAAGCCAG AAGAATACTG GACAATTGAT
GCTGATTTTG AAAAAGGTAA GGAAAAGTTC AAGGGAGCCT TTTATGGCAT TAAGGGTAAG
AAACAAGACT TACCAAATAA TGAAGCAGTT CAAGATATTT TAAAACAAAT TGATAAACGT
AAGAACTTTG AAGTTACCAA AGTAGTTAAA AAAGAAAGAA GACGTCAGCC TGCTGCGCCT
TTTACTACCT CAACTATGCA GCAAGAAGCT AACAAGCGTT TAGGATATCG TACTCGTCGA
ACAATGAGAA TTGCTCAATC TCTTTATGAA GGGGTTAACC TGGGTAAAGG ATCAGTTGGT
TTAATTACTT ATATGCGTAC TGATTCTAAA CGTATTGCTA ATGTTGCTAA GCATGAAGCT
TCAAAATTCA TCCATGAAGA ATATGGTGCA AATTATGCAG CAATCAAGCC GCAACATTTT
AAAAACGATG CTGATGCTCA AGATGCCCAC GAAGCAATTC GTCCAACTTC AGCTTTCAGA
ACGCCAGCTT CAGTTAAAGA ATATTTGACT ACTGAAGAAT ATCGTCTCTA CACCTTAATT
TGGTCAAGAT TTATTGCTAG TCAAATGACG CCAGCTGTTT ACGATACAGT AAGAGCTGAT
ATTGAACAAA ATGACGTTAC CTTTAGAACA ACTGGTTCAA AACTTAAGTT TGCTGGTTTT
ACTAAGGTTT ATGATAACCA AAAAGAAAAG AATAATGAAT TACCTGAGTT AAATGAAGGC
GATAAGGTTA AGCTTAAAAA GACCGATGAT CGTCAGCACT TTACTCAGCC ACCAGCAAGA
TATACTGAAG CCAGTTTAGT TAGAGCTCTT GAAGAAAATG GCGTTGGTCG TCCATCAACT
TATGCACCAA CAATTGATAC GATTCAAAAA CGTTATTACG TAAAACTTGA AGGAAGATCA
ATTGTACCGA CTGAATTAGG TGAAATTGTC GATAAGTTAA TCGAAGAATT CTTCCCAGAT
ATTGTTAACG TCGATTTCAC CGCTCAACTA GAAGATGATC TTGACGGCGT TGAAGTAGGA
AAGAAGAACT GGATCAAAGT AGTAGATGAA TACTACAAGC CATTTTCTAA AGAATTAGAC
AAGGCTGATC AACAAATTGA AAAAGTTCAA ATTAAGGATG AACCAGCTGG TTTTAACTGC
GACATTTGTG GTGCACCGAT GGTAATTAAG ATGGGACGTT ATGGTAAGTT TTATGCTTGC
TCGCGCTTCC CAGATTGCCG TAACACTAAG CCAATTGTTA AAAAGGTTGG CGTAACTTGT
CCTAAGTGTG GTAAGGGAGA AGTTATTGAA AAGAAGTCTA AACGTAATCG TAAGTTCTAT
GGCTGCTCTC GTTATCCAGA TTGTGACTTT GTATCTTGGG ATCAACCAAT TGGTCGTAAT
TGTCCAAATG ATGGTCATTT CTTAGTTCAA AAGAAGAATA AGAAGGGCTT AGTTATTCTT
TGCCCAAACG GCGATTATCG TGAAGAACCA GAAGAAAATT AA
 
Protein sequence
MPTKRKNKKN LVIVESPHKA KTIEKYLGRN YHVIASKGHI RDLPKSQMGV DVEHDYEPKY 
ISIRGKGDTI KELKSEAKKA KYVYLASDPD REGEAIAWHV AHALNLDPKE HNRVAFNEIT
KDAVKNAFKN PRTIDMDIVD AQQARRVLDR LVGYSISPIL WQKVKKGLSA GRVQSIALKL
VIDRENEIKN FKPEEYWTID ADFEKGKEKF KGAFYGIKGK KQDLPNNEAV QDILKQIDKR
KNFEVTKVVK KERRRQPAAP FTTSTMQQEA NKRLGYRTRR TMRIAQSLYE GVNLGKGSVG
LITYMRTDSK RIANVAKHEA SKFIHEEYGA NYAAIKPQHF KNDADAQDAH EAIRPTSAFR
TPASVKEYLT TEEYRLYTLI WSRFIASQMT PAVYDTVRAD IEQNDVTFRT TGSKLKFAGF
TKVYDNQKEK NNELPELNEG DKVKLKKTDD RQHFTQPPAR YTEASLVRAL EENGVGRPST
YAPTIDTIQK RYYVKLEGRS IVPTELGEIV DKLIEEFFPD IVNVDFTAQL EDDLDGVEVG
KKNWIKVVDE YYKPFSKELD KADQQIEKVQ IKDEPAGFNC DICGAPMVIK MGRYGKFYAC
SRFPDCRNTK PIVKKVGVTC PKCGKGEVIE KKSKRNRKFY GCSRYPDCDF VSWDQPIGRN
CPNDGHFLVQ KKNKKGLVIL CPNGDYREEP EEN