Gene SAG0742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0742 
Symbol 
ID1013546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp734877 
End bp736163 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content39% 
IMG OID637315930 
ProductU32 family peptidase 
Protein accessionNP_687757 
Protein GI22536906 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.810338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAATG TAAAAAAACG CCCTGAGGTT TTATCACCTG CAGGAACACT AGAAAAATTA 
AAAGTTGCTA TTGATTATGG AGCAGATGCT GTATTTGTTG GAGGTCAAGC GTATGGTCTT
CGAAGTAGAG CTGGTAACTT TTCTATGGAA GAGTTACAAG AGGGAATAAA CTATGCTCAT
GCAAGAGATG CTAAAGTTTA TGTAGCGGCT AATATGGTTA CTCATGAAGG TAATGAGCTT
GGGGCAGGTC CGTGGTTTCG TGAATTACGT GATATGGGAC TAGATGCAGT CATTGTTTCA
GATCCTGCTC TTATTGTTAT TTGTGCTACA GAAGCACCAG GCTTAGAAAT TCATTTGTCA
ACTCAAGCCT CTTCCACGAA CTATGAAACT TTTGAATTTT GGAAAGAGAT GGGGCTTACT
CGTGTCGTAT TAGCACGTGA GGTTACTATG GCAGAGTTGG CTGAAATCAG GAAGAGGACA
GATGTTGAGA TAGAAGCATT TGTTCATGGC GCGATGTGTA TTTCATACTC AGGACGATGT
GTTCTATCAA ACCATATGAG CCATCGTGAT GCTAATCGTG GCGGTTGCTC TCAGTCATGT
CGTTGGAAAT ATGACCTCTA CGATATGCCA TTTGGACAAG AACGTCAATC GTTAAAAGGC
GAGATTCCAG AACCTTTCTC AATGTCAGCT GTGGATATGT GTATGATTGA GCATATTCCA
GATATGATTG AAAATGGTGT AGATAGTTTA AAAATAGAAG GACGTATGAA ATCCATTCAT
TATGTTTCTA CAGTAACTAA TTGCTATAAA GCTGCTGTAG ATGCCTATAT GGAAAGTCCA
GAAGCTTTTG AAGCTATTAA AGAAGACTTG ATTGATGAAC TTTGGAAGGT TGCACAACGC
GAATTAGCAA CAGGTTTCTA CTACCATACA CCAACTGAAA ATGAACAACT CTTTGGAGCT
CGTCGTAAAA TTCCTCAATA CAAATTTGTT GGGGAAGTGG TTTCATTTGA CAATGCTAAA
ATGGAGGCTA CAATTCGTCA GCGTAATGTT ATTATGGAAG GAGATCGCGT AGAATTCTAT
GGTCCTGGCT TCCGTCACTT TGAATGTTTT ATTGATGGTC TGCGTGATGC TGAAGGAAAT
AAAATAGACC GTGCTCCAAA TCCGATGGAA TTATTAACCA TAACATTACC AAATCCAGTA
AAAAAAGGGG ATATGATTCG TGCTTGTAAA GAAGGATTAG TGAACCTTTA TCAAAATGAT
GGTACTAGCA AGACTGTAAG AGCTTAG
 
Protein sequence
MSNVKKRPEV LSPAGTLEKL KVAIDYGADA VFVGGQAYGL RSRAGNFSME ELQEGINYAH 
ARDAKVYVAA NMVTHEGNEL GAGPWFRELR DMGLDAVIVS DPALIVICAT EAPGLEIHLS
TQASSTNYET FEFWKEMGLT RVVLAREVTM AELAEIRKRT DVEIEAFVHG AMCISYSGRC
VLSNHMSHRD ANRGGCSQSC RWKYDLYDMP FGQERQSLKG EIPEPFSMSA VDMCMIEHIP
DMIENGVDSL KIEGRMKSIH YVSTVTNCYK AAVDAYMESP EAFEAIKEDL IDELWKVAQR
ELATGFYYHT PTENEQLFGA RRKIPQYKFV GEVVSFDNAK MEATIRQRNV IMEGDRVEFY
GPGFRHFECF IDGLRDAEGN KIDRAPNPME LLTITLPNPV KKGDMIRACK EGLVNLYQND
GTSKTVRA