Gene SAG2174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2174 
Symbol 
ID1014988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp2158070 
End bp2159299 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content35% 
IMG OID637317342 
Productserine protease 
Protein accessionNP_689159 
Protein GI22538308 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00156367 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAAA AATTAGTCTC ATCACTTCTA AAGTGTTCTC TAATCATTAT TGTTAGCTTT 
GCTGGTGGAG CATTTGCTAG TTTTGTCATG AATCATAATG ACAATATTCC AAATGGTGGT
GTCACTAAAA CTAGTAAAGT AAATTATAAT AACATAACGC CTACAACAAA AGCTGTTAAA
AAGGTACAAA ATAGTGTTGT TTCTGTTATC AATTATAAAC AACAAGAGAG TCGTTCTGAC
CTATCAGACT TCTATAGTCA TTTTTTTGGT AATCAGGGGG GCAACACTGA TAAGGGCTTA
CAAGTTTACG GTGAAGGCTC TGGAGTCATC TATAAAAAAG ATGGTAAAAA TGCCTATGTT
GTCACTAATA ACCACGTCAT TGATGGGGCT AAACAAATTG AAATTCAACT AGCTGATGGC
TCAAAAGCAG TTGGGAAACT TGTTGGGTCA GATACCTACT CTGATTTAGC CGTCGTCAAA
ATTCCATCAG ATAAGGTTTC AAATATTGCA GAATTTGCTG ATTCATCAAA ACTCAACATT
GGTGAAACTG CTATAGCGAT CGGAAGCCCT CTTGGAACTG AGTATGCAAA TTCTGTAACT
CAAGGTATTG TATCTAGTTT AAAAAGAACT GTAACAATGA CTAATGAAGA AGGACAAACA
GTTTCTACAA ATGCTATCCA GACTGATGCT GCTATCAATC CTGGTAATTC AGGTGGAGCA
CTTATCAATA TTGAAGGACA GGTTATTGGA ATTAATTCTA GTAAAATTTC TTCTACATCA
AATCAAACCT CAGGACAATC GTCAGGAAAT AGCGTTGAAG GTATGGGATT TGCCATTCCT
TCAAATGATG TTGTTAAGAT TATCAATCAA CTTGAGAGTA ACGGACAAGT AGAGAGACCT
GCTCTAGGTA TTTCTATGGC TGGATTAAGT AATTTACCAT CCGATGTTAT TAGTAAACTG
AAAATCCCAA GTAATGTTAC TAATGGTATT GTAGTAGCAT CTATCCAATC TGGCATGCCA
GCTCAAGGCA AACTAAAGAA ATACGATGTC ATTACTAAAG TTGACGATAA AGAAGTAGTA
TCTCCAAGTG ATTTACAAAG TTTACTCTAT GGCCACCAGG TAGGGGATTC CATAACAGTA
ACCTTTTATC GTGGTGAAAA TAAACAAACA GTCACTATAA AACTTACTAA AACTAGTAAA
GATTTAGCTA AACAACGAGC AAATAACTAA
 
Protein sequence
MKKKLVSSLL KCSLIIIVSF AGGAFASFVM NHNDNIPNGG VTKTSKVNYN NITPTTKAVK 
KVQNSVVSVI NYKQQESRSD LSDFYSHFFG NQGGNTDKGL QVYGEGSGVI YKKDGKNAYV
VTNNHVIDGA KQIEIQLADG SKAVGKLVGS DTYSDLAVVK IPSDKVSNIA EFADSSKLNI
GETAIAIGSP LGTEYANSVT QGIVSSLKRT VTMTNEEGQT VSTNAIQTDA AINPGNSGGA
LINIEGQVIG INSSKISSTS NQTSGQSSGN SVEGMGFAIP SNDVVKIINQ LESNGQVERP
ALGISMAGLS NLPSDVISKL KIPSNVTNGI VVASIQSGMP AQGKLKKYDV ITKVDDKEVV
SPSDLQSLLY GHQVGDSITV TFYRGENKQT VTIKLTKTSK DLAKQRANN