Gene SAG1757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1757 
Symbol 
ID1014566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1752387 
End bp1753397 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content39% 
IMG OID637316925 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionNP_688747 
Protein GI22537896 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0418302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATA GATATATTTT AGCAGTGGAA AGTTCATGTG ATGAAACTAG TGTTGCTATT 
TTAAAAAATG ATAAAGAGTT ACTAGCTAAT ATTATTGCAA GTCAAGTTGA AAGTCACAAA
CGTTTTGGTG GTGTTGTTCC TGAAGTGGCA AGCCGTCATC ACGTTGAAGT AGTAACAACC
TGTTTTGAGG ATGCTCTTCA AGAAGCAGGT ATTGTTGCTA GCGATTTGGA TGCTGTTGCT
GTAACATATG GTCCGGGATT AGTAGGAGCC TTATTGGTAG GTATGGCTGC AGCAAAAGCT
TTCGCTTGGG CAAATAAATT ACCTCTAATT CCTATAAACC ACATGGCAGG TCATTTAATG
GCAGCACGTG ACGTTAAGGA ACTTCAATAC CCATTGTTAG CTTTGCTTGT CAGTGGGGGA
CATACAGAAT TAGTATATGT TTCTGAACCG GGAGATTACA AAATAGTAGG AGAAACTCGG
GATGATGCTG TTGGAGAAGC TTATGATAAA GTAGGCCGTG TTATGGGCTT AACTTATCCA
GCAGGTCGCG AGATTGATCA GTTAGCTCAT AAGGGTCAAG ATACTTACCA TTTTCCTAGA
GCGATGATCA AAGAAGATCA TCTTGAATTT TCTTTTTCTG GATTAAAATC TGCATTTATC
AATTTACATC ATAATGCAGA ACAAAAGGGT GAAGCATTGG TTCTTGAAGA TTTATGTGCT
TCCTTTCAGG CGGCTGTTTT GGATATTTTA TTGGCCAAAA CTCAAAAAGC TTTGCTAAAG
TATCCAGTGA AAACTTTAGT CGTTGCTGGT GGAGTTGCAG CTAATCAAGG ACTTCGGGAA
CGCTTGGCTA CTGATATTTC TCCTGATATT GATGTGGTTA TTCCTCCTCT TAGATTATGT
GGGGATAATG CAGGAATGAT TGCATTAGCA GCAGCGATAG AGTTTGAAAA AGAGAATTTT
GCTTCTTTAA AATTGAATGC CAAACCTAGT TTAGCTTTTG AGAGTTTATA G
 
Protein sequence
MKDRYILAVE SSCDETSVAI LKNDKELLAN IIASQVESHK RFGGVVPEVA SRHHVEVVTT 
CFEDALQEAG IVASDLDAVA VTYGPGLVGA LLVGMAAAKA FAWANKLPLI PINHMAGHLM
AARDVKELQY PLLALLVSGG HTELVYVSEP GDYKIVGETR DDAVGEAYDK VGRVMGLTYP
AGREIDQLAH KGQDTYHFPR AMIKEDHLEF SFSGLKSAFI NLHHNAEQKG EALVLEDLCA
SFQAAVLDIL LAKTQKALLK YPVKTLVVAG GVAANQGLRE RLATDISPDI DVVIPPLRLC
GDNAGMIALA AAIEFEKENF ASLKLNAKPS LAFESL