Gene SAG0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0421 
Symbol 
ID1013223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp436461 
End bp439628 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content35% 
IMG OID637315626 
Producthypothetical protein 
Protein accessionNP_687455 
Protein GI22536604 
COG category 
COG ID 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAA AACATCTTAA AACGCTTGCC TTGGCACTTA CTACAGTATC AGTAGTGACA 
TACAGCCAGG AGGTATATGG ATTAGAAAGA GAGGAATCGG TCAAACAAGA ACAAACCCAG
TCAGCTTCAG AAGATGATTG GTTCGAAGAA GATAATGAGA GGAAAACAAA TGTTTCTAAA
GAGAATTCTA CTGTTGATGA AACAGTTAGT GATTTATTTT CTGATGGAAA TAGTAATAAC
TCTAGTTCTA AAACCGAGTC AGTGGTAAGT GACCCTAAAC AAGTCCCCAA AGCAAAACCA
GAGGTTACAC AAGAAGCAAG CAATTCTAGT AATGATGCTA GCAAAGTAGA AGTACCAAAA
CAGGATACAG CTTCAAAAAA GGAAACTCTA GAAACATCAA CTTGGGAGGC AAAAGATTTC
GTAACTAGAG GGGATACTTT AGTAGGTTTT TCAAAATCTG GAATTAATAA GTTATCTCAA
ACATCACACT TGGTTTTACC AAGTCATGCA GCAGATGGAA CTCAATTGAC ACAAGTAGCT
AGCTTTGCTT TTACTCCAGA TAAAAAGACG GCCATTGCAG AATATACAAG TAGGCTAGGA
GAAAATGGGA AACCGAGTCG TTTAGATATT GATCAGAAGG AAATTATTGA TGAGGGAGAA
ATATTTAATG CTTACCAGTT GACTAAGCTT ACTATTCCAA ATGGTTATAA GTCTATTGGT
CAAGATGCTT TTGTGGACAA TAAGAATATT GCTGAGGTTA ACCTTCCTGA GAGTCTCGAG
ACTATTTCAG ACTATGCTTT TGCTCACATG TCTTTAAAAC AAGTAAAGTT ACCAGATAAC
CTAAAGGTCA TTGGAGAATT AGCTTTTTTT GATAATCAGA TTGGTGGTAA GCTTTACTTG
CCACGTCACT TGATAAAATT AGCAGAACGC GCTTTCAAAT CTAATCGTAT TCAAACAGTT
GAATTTTTGG GAAGTAAGCT TAAGGTTATA GGAGAAGCAA GTTTTCAAGA TAATAATCTG
AGGAATGTTA TGCTTCCGGA TGGACTTGAA AAAATAGAAT CAGAAGCTTT TACAGGAAAT
CCAGGAGATG AACATTACAA CAATCAGGTT GTATTGCGCA CAAGGACAGG CCAAAATCCA
CATCAACTTG CGACTGAGAA TACTTACGTC AATCCGGACA AATCATTGTG GCGTGCAACA
CCTGATATGG ATTATACCAA ATGGTTAGAG GAAGATTTTA CCTATCAAAA AAATAGTGTT
ACAGGTTTTT CAAATAAAGG CTTACAAAAG GTAAGACGTA ATAAAAACTT AGAAATTCCA
AAACAACACA ATGGTATTAC TATTACTGAA ATTGGTGATA ACGCTTTTCG CAATGTTGAT
TTTCAAAGTA AAACTTTACG TAAATATGAT TTGGAAGAAA TAAAGCTCCC CTCAACTATT
CGGAAAATAG GTGCTTTTGC TTTTCAATCT AATAACTTGA AATCCTTTGA AGCAAGTGAA
GATTTAGAAG AGATTAAAGA GGGAGCCTTT ATGAATAATC GTATTGGAAC TCTAGACTTG
AAAGACAAAC TTATCAAAAT AGGTGATGCT GCTTTCCATA TTAATCATAT TTATGCCATT
GTTCTTCCAG AATCTGTACA AGAAATAGGA CGTTCAGCTT TTCGACAAAA TGGTGCGCTT
CACCTTATGT TTATCGGAAA TAAGGTTAAA ACAATTGGTG AAATGGCTTT TTTATCCAAT
AAACTGGAAA GTGTAAATCT CTCTGAGCAA AAACAATTAA AGACAATTGA GGTCCAAGCT
TTTTCGGATA ATGCCCTTAG TGAAGTAGTC TTACCGCCAA ATTTACAGAC TATTCGTGAA
GAGGCTTTCA AAAGGAATCA TTTGAAAGAA GTGAAGGGTT CATCTACATT ATCTCAGATT
ACTTTTAATG CTTTTGATCA AAATGATGGG GACAAACGCT TTGGTAAGAA AGTGGTTGTT
AGGACACATA ATAATTCTCA TATGTTAGCA GATGGTGAGC GTTTTATCAT TGATCCAGAT
AAGCTATCTT CTACAATGGT AGACCTTGAA AAGGTTTTAA AAATAATCGA AGGTTTAGAT
TACTCTACAT TACGTCAGAC TACTCAAACT CAGTTTAGAG AAATGACTAC TGCAGGTAAA
GCGTTGTTAT CAAAATCTAA CCTCCGACAA GGAGAAAAAC AAAAATTCCT TCAAGAAGCA
CAATTTTTCC TTGGTCGCGT TGATTTGGAT AAAGCCATAG CTAAAGCTGA GAAGGCTTTA
GTGACCAAGA AGGCAACAAA GAATGGTCAT TTGCTTGAGA GGAGTATTAA CAAAGCGGTA
TTAGCTTATA ATAATAGTGC TATTAAAAAA GCTAATGTTA AGCGCTTGGA AAAAGAGTTA
GACTTGCTGA CGGATTTAGT CGAGGGAAAA GGACCATTAG CGCAAGCTAC AATGGTACAA
GGAGTTTATT TATTAAAGAC GCCTTTACCA TTGCCAGAAT ATTATATCGG ATTGAACGTT
TATTTTGACA AGTCTGGAAA ATTGATTTAT GCACTTGATA TGAGTGATAC TATTGGCGAG
GGACAAAAAG ATGCATATGG TAATCCTATA TTAAATGTTG ACGAGGATAA TGAAGGTTAT
CATACCTTGG CAGTTGCCAC TTTAGCTGAT TATGAAGGTC TTTATATTAA AGATATTTTA
AATAGTTCCC TTGATAAGAT TAAAGCAATA CGCCAGATTC CTTTGGCAAA ATATCATAGA
TTAGGAATTT TCCAAGCTAT TCGAAATGCA GCGGCAGAAG CAGACCGATT GCTTCCTAAG
ACACCTAAGG GGTACCTAAA TGAAGTCCCA AATTATCGTA AAAAACAAGT GGAGAAAAAT
TTAAAACCAG TTGATTATAA AACGCCGATT TTTAATAAGG CTTTACCTAA TGAAAAGGTA
GACGGTGATA GAGCGGCTAA AGGTCATAAT ATAAATGCGG AGACTAATAA TTCTGTAGCT
GTAACACCAA TAAGGTCCGA GCAGCAATTA CATAAGTCAC AGTCTGATGT AAATTTACCT
CAAACAAGTT CTAAAAATAA TTTTATATAC GAGATTCTAG GATACGTTAG TTTATGTTTG
CTTTTCCTAG TAACTGCTGG GAAAAAAGGA AAACGAGCAA GAAAATAA
 
Protein sequence
MTKKHLKTLA LALTTVSVVT YSQEVYGLER EESVKQEQTQ SASEDDWFEE DNERKTNVSK 
ENSTVDETVS DLFSDGNSNN SSSKTESVVS DPKQVPKAKP EVTQEASNSS NDASKVEVPK
QDTASKKETL ETSTWEAKDF VTRGDTLVGF SKSGINKLSQ TSHLVLPSHA ADGTQLTQVA
SFAFTPDKKT AIAEYTSRLG ENGKPSRLDI DQKEIIDEGE IFNAYQLTKL TIPNGYKSIG
QDAFVDNKNI AEVNLPESLE TISDYAFAHM SLKQVKLPDN LKVIGELAFF DNQIGGKLYL
PRHLIKLAER AFKSNRIQTV EFLGSKLKVI GEASFQDNNL RNVMLPDGLE KIESEAFTGN
PGDEHYNNQV VLRTRTGQNP HQLATENTYV NPDKSLWRAT PDMDYTKWLE EDFTYQKNSV
TGFSNKGLQK VRRNKNLEIP KQHNGITITE IGDNAFRNVD FQSKTLRKYD LEEIKLPSTI
RKIGAFAFQS NNLKSFEASE DLEEIKEGAF MNNRIGTLDL KDKLIKIGDA AFHINHIYAI
VLPESVQEIG RSAFRQNGAL HLMFIGNKVK TIGEMAFLSN KLESVNLSEQ KQLKTIEVQA
FSDNALSEVV LPPNLQTIRE EAFKRNHLKE VKGSSTLSQI TFNAFDQNDG DKRFGKKVVV
RTHNNSHMLA DGERFIIDPD KLSSTMVDLE KVLKIIEGLD YSTLRQTTQT QFREMTTAGK
ALLSKSNLRQ GEKQKFLQEA QFFLGRVDLD KAIAKAEKAL VTKKATKNGH LLERSINKAV
LAYNNSAIKK ANVKRLEKEL DLLTDLVEGK GPLAQATMVQ GVYLLKTPLP LPEYYIGLNV
YFDKSGKLIY ALDMSDTIGE GQKDAYGNPI LNVDEDNEGY HTLAVATLAD YEGLYIKDIL
NSSLDKIKAI RQIPLAKYHR LGIFQAIRNA AAEADRLLPK TPKGYLNEVP NYRKKQVEKN
LKPVDYKTPI FNKALPNEKV DGDRAAKGHN INAETNNSVA VTPIRSEQQL HKSQSDVNLP
QTSSKNNFIY EILGYVSLCL LFLVTAGKKG KRARK