Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SAG1757 |
Symbol | |
ID | 1014566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus agalactiae 2603V/R |
Kingdom | Bacteria |
Replicon accession | NC_004116 |
Strand | - |
Start bp | 1752387 |
End bp | 1753397 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637316925 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | NP_688747 |
Protein GI | 22537896 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0418302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGATA GATATATTTT AGCAGTGGAA AGTTCATGTG ATGAAACTAG TGTTGCTATT TTAAAAAATG ATAAAGAGTT ACTAGCTAAT ATTATTGCAA GTCAAGTTGA AAGTCACAAA CGTTTTGGTG GTGTTGTTCC TGAAGTGGCA AGCCGTCATC ACGTTGAAGT AGTAACAACC TGTTTTGAGG ATGCTCTTCA AGAAGCAGGT ATTGTTGCTA GCGATTTGGA TGCTGTTGCT GTAACATATG GTCCGGGATT AGTAGGAGCC TTATTGGTAG GTATGGCTGC AGCAAAAGCT TTCGCTTGGG CAAATAAATT ACCTCTAATT CCTATAAACC ACATGGCAGG TCATTTAATG GCAGCACGTG ACGTTAAGGA ACTTCAATAC CCATTGTTAG CTTTGCTTGT CAGTGGGGGA CATACAGAAT TAGTATATGT TTCTGAACCG GGAGATTACA AAATAGTAGG AGAAACTCGG GATGATGCTG TTGGAGAAGC TTATGATAAA GTAGGCCGTG TTATGGGCTT AACTTATCCA GCAGGTCGCG AGATTGATCA GTTAGCTCAT AAGGGTCAAG ATACTTACCA TTTTCCTAGA GCGATGATCA AAGAAGATCA TCTTGAATTT TCTTTTTCTG GATTAAAATC TGCATTTATC AATTTACATC ATAATGCAGA ACAAAAGGGT GAAGCATTGG TTCTTGAAGA TTTATGTGCT TCCTTTCAGG CGGCTGTTTT GGATATTTTA TTGGCCAAAA CTCAAAAAGC TTTGCTAAAG TATCCAGTGA AAACTTTAGT CGTTGCTGGT GGAGTTGCAG CTAATCAAGG ACTTCGGGAA CGCTTGGCTA CTGATATTTC TCCTGATATT GATGTGGTTA TTCCTCCTCT TAGATTATGT GGGGATAATG CAGGAATGAT TGCATTAGCA GCAGCGATAG AGTTTGAAAA AGAGAATTTT GCTTCTTTAA AATTGAATGC CAAACCTAGT TTAGCTTTTG AGAGTTTATA G
|
Protein sequence | MKDRYILAVE SSCDETSVAI LKNDKELLAN IIASQVESHK RFGGVVPEVA SRHHVEVVTT CFEDALQEAG IVASDLDAVA VTYGPGLVGA LLVGMAAAKA FAWANKLPLI PINHMAGHLM AARDVKELQY PLLALLVSGG HTELVYVSEP GDYKIVGETR DDAVGEAYDK VGRVMGLTYP AGREIDQLAH KGQDTYHFPR AMIKEDHLEF SFSGLKSAFI NLHHNAEQKG EALVLEDLCA SFQAAVLDIL LAKTQKALLK YPVKTLVVAG GVAANQGLRE RLATDISPDI DVVIPPLRLC GDNAGMIALA AAIEFEKENF ASLKLNAKPS LAFESL
|
| |