Gene SAG1197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1197 
Symbol 
ID1014004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1201497 
End bp1204715 
Gene Length3219 bp 
Protein Length1072 aa 
Translation table11 
GC content36% 
IMG OID637316382 
Producthyaluronate lyase 
Protein accessionNP_688206 
Protein GI22537355 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATCA AAAAGAAACA TCGTATTATG CTTTATTCAG CCCTTATTTT AGGAACAATA 
TTGGTTAACA ATAGTTACCA AGCTAAAGCT GAAGAGCTTA CCAAAACTAC CTCAACGTCA
CAAATAAGAG ATACTCAAAC TAATAATGTT GAAGCTCCCC AGACTGAAAG TACCACTGTC
AAAGAGACTA GCACCACAAC CACACAACAA GATTTGTCTA ACTCCACAGC TTCAACCGCA
ACTGCAACAG CCACTCCTAG CACAATGAAA CAAGTAGTAG ATAATCAAAC TCAAAATAAG
GAGCTGGTGA AAAACGGAGA TTTTAATCAA ACTAACCCTG TATCTGGAAG CTGGTCACAT
ACAGGCGCTA GGGAATGGTC TGCTTGGATT GATAAAGAAA ATACTGCTGA TAAATCACCT
ATTATCCAAC GTACCGAACA AGGCCAAGTA AGTCTATCCA GCGACAAAGG CTTTAGAGGT
GCTGTAACAC AAAAAGTGAA CATTGATCCC ACTAAAAAAT ATGAGGTCAA GTTTGATATT
GAAACAAGTA ACAAGGCTGG ACAAGCTTTC CTTCGTATTA TGGAGAAAAA AGATAACAAT
ACGCGACTTT GGCTTTCTGA GATGACCAGC GGTACTACTA ACAAACATAC CTTAACAAAG
ATATATAACC CAAAGTTAGA TGTCTCCGAG GTGACACTTG AACTTTATTA TGAAAAAGGA
ACAGGTTCTG TTACTTTTGA TAATATATCA ATGAAAGCAA AAGGCCCTAA AGACTCAGAG
CATCCACAAC CCGTCACAAC ACAAATTGAA AAAAGCGTTA ATACGGCTTT AAACAAAAAT
TACGTTTTTA ATAAAGCTGA CTACCAATAC ACTCTAACCA ATCCGTCTCT TGGGAAAATT
GTTGGTGGAA TATTGTATCC AAACGCTACT GGTTCAACAA CTGTTAAAAT ATCTGATAAA
TCTGGCAAAA TAATTAAAGA AGTACCGTTA TCAGTTACAG CTTCAACAGA AGATAACTTT
ACAAAACTCC TCGACAAATG GAACGACGTG ACTATTGGTA ATCATGTTTA CGATACTAAT
GATTCGAACA TGCAAAAGCT TAATCAGAAA TTAGATGAAA CTAACGCCAA AAACATCGAA
GCTATCAAAC TGGATTCTAA TCGCACTTTC CTTTGGAAAG ATTTAGATAA TCTCAATAAT
TCAGCACAGT TAACCGCTAC TTATCGTCGT TTGGAAGATT TAGCTAAACA AATCACCAAT
CCACACTCTA CTATTTACAA AAATGAAAAA GCTATTCGTA CTGTAAAAGA GAGTCTGGCT
TGGCTTCATC AAAACTTCTA CAATGTTAAT AAAGATATAG AAGGCTCTGC CAATTGGTGG
GATTTTGAAA TCGGTGTTCC TCGCTCAATT ACAGGTACCC TAGCTCTCAT GTATAACTAC
TTCACTGACG CTGAAATAAA AACTTATACC GACCCAATTG AACATTTTGT TCCTGATGCA
GGATTTTTCC GTAAAACGCT TGTCAATCCA TTTAAAGCCC TTGGTGGTAA TCTAGTCGAT
ATGGGGCGCG TTAAAATCAT TGAAGGTTTA CTTCGTAAAG ACAATACTAT TATCGAAAAA
ACTTCTCATT CTCTAAAAAA TCTTTTTACT ACTGCTACTA AAGCTGAAGG CTTCTATGCT
GACGGTTCTT ACATCGACCA TACGAATGTT GCTTATACTG GCGCCTATGG TAATGTTCTG
ATAGATGGTT TGACACAATT GCTGCCTATC ATTCAAGAAA CTGACTATAA AATCTCTAAT
CAAGAACTTG ATATGGTTTA TAAATGGATT AATCAATCAT TTTTACCTTT AATTGTAAAA
GGTGAGTTAA TGGATATGAG TCGTGGACGC TCAATTAGTA GAGAGGCTGC TTCTTCTCAT
GCGGCTGCAG TTGAAGTTCT CAGAGGTTTC CTCAGATTGG CTAACATGTC TAATGAAGAG
CGAAACTTAG ACCTCAAATC AACTATTAAA ACGATTATCA CTTCAAATAA ATTCTACAAC
GTCTTCAATA ACCTCAAATC GTATTCCGAT ATTGCCAACA TGAATAAGCT GCTTAATGAC
AGTACAGTCG CTACTAAACC TTTAAAAAGT AATTTATCAA CCTTTAATAG CATGGACCGC
TTAGCTTATT ATAATGCCGA GAAAGACTTT GGTTTCGCAC TTTCATTACA TTCTAAACGT
ACCCTCAACT ATGAAGGAAT GAATGATGAA AATACACGTG GTTGGTATAC CGGAGATGGT
ATGTTCTATC TTTATAATAG TGATCAATCT CATTATAGTA ATCATTTTTG GCCAACCGTC
AATCCTTATA AAATGGCTGG AACAACTGAA AAAGATGCTA AGCGTGAAGA TACGACTAAG
GATTTCATGA GCAAACATAG CAAAGACGCT AAAGAAAAAA CCGGTCAAGT TACAGGAGCA
TCTGACTTTG TTGGTTCCGT CAAACTTAAT GATCACTTTG CTCTTGCCGC TATGGACTTT
ACTAACTGGG ATCGCACCTT AACAGCACAA AAAGGTTGGG TTATCTTAAA TGATAAGATT
GTCTTTTTAG GTAGCAACAT CAAGAATACT AACGGCATTG GAAATGTTTC TACAACAATT
GATCAACGAA AAGACGATTC TAAAACACCT TATACTACAT ACGTCAATGG AAAAACTGTT
GATTTAAAGC AAGCAAGTTC TCAACAATTT ACAGATACAA AGAGTGTCTT TTTAGAATCA
AAAGAACCTG GTCGCAATAT TGGTTATATC TTCTTTAAAA ATAGCACTAT TGATATTGAA
CGCAAAGAGC AAACAGGTAC TTGGAACAGC ATTAATCGTA CTTCTAAAAA TACCTCAATC
GTTAGCAATC CTTTTATCAC TATAAGCCAA AAGCATGACA ACAAAGGTGA TAGCTATGAT
TACATGATGG TTCCAAACAT TGATCGCACA AGTTTTGATA AATTAGCCAA TAGCAAAGAA
GTAGAATTAC TAGAAAACAG TTCAAAACAA CAAGTTATCT ATGATAAAAA CAGTCAAACT
TGGGCTGTTA TCAAACACGA TAATCAAGAG AGTCTCATTA ACAATCAATT CAAAATGAAT
AAAGCGGGAC TTTACCTAGT ACAAAAAGTT GGTAATGACT ATCAAAATGT CTATTACCAA
CCTCAAAGCA TGACAAAAAC AGACCAATTA GCTATCTAA
 
Protein sequence
MEIKKKHRIM LYSALILGTI LVNNSYQAKA EELTKTTSTS QIRDTQTNNV EAPQTESTTV 
KETSTTTTQQ DLSNSTASTA TATATPSTMK QVVDNQTQNK ELVKNGDFNQ TNPVSGSWSH
TGAREWSAWI DKENTADKSP IIQRTEQGQV SLSSDKGFRG AVTQKVNIDP TKKYEVKFDI
ETSNKAGQAF LRIMEKKDNN TRLWLSEMTS GTTNKHTLTK IYNPKLDVSE VTLELYYEKG
TGSVTFDNIS MKAKGPKDSE HPQPVTTQIE KSVNTALNKN YVFNKADYQY TLTNPSLGKI
VGGILYPNAT GSTTVKISDK SGKIIKEVPL SVTASTEDNF TKLLDKWNDV TIGNHVYDTN
DSNMQKLNQK LDETNAKNIE AIKLDSNRTF LWKDLDNLNN SAQLTATYRR LEDLAKQITN
PHSTIYKNEK AIRTVKESLA WLHQNFYNVN KDIEGSANWW DFEIGVPRSI TGTLALMYNY
FTDAEIKTYT DPIEHFVPDA GFFRKTLVNP FKALGGNLVD MGRVKIIEGL LRKDNTIIEK
TSHSLKNLFT TATKAEGFYA DGSYIDHTNV AYTGAYGNVL IDGLTQLLPI IQETDYKISN
QELDMVYKWI NQSFLPLIVK GELMDMSRGR SISREAASSH AAAVEVLRGF LRLANMSNEE
RNLDLKSTIK TIITSNKFYN VFNNLKSYSD IANMNKLLND STVATKPLKS NLSTFNSMDR
LAYYNAEKDF GFALSLHSKR TLNYEGMNDE NTRGWYTGDG MFYLYNSDQS HYSNHFWPTV
NPYKMAGTTE KDAKREDTTK DFMSKHSKDA KEKTGQVTGA SDFVGSVKLN DHFALAAMDF
TNWDRTLTAQ KGWVILNDKI VFLGSNIKNT NGIGNVSTTI DQRKDDSKTP YTTYVNGKTV
DLKQASSQQF TDTKSVFLES KEPGRNIGYI FFKNSTIDIE RKEQTGTWNS INRTSKNTSI
VSNPFITISQ KHDNKGDSYD YMMVPNIDRT SFDKLANSKE VELLENSSKQ QVIYDKNSQT
WAVIKHDNQE SLINNQFKMN KAGLYLVQKV GNDYQNVYYQ PQSMTKTDQL AI