Gene SAG1297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1297 
Symbol 
ID1014104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1308706 
End bp1310061 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content44% 
IMG OID637316471 
ProductC-5 cytosine-specific DNA methylase 
Protein accessionNP_688295 
Protein GI22537444 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000219997 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTT TAGATTTATT TGCTGGGATA GGCGGTTTTA GGCTAGGGAT GGAATCACAG 
GGTCATAAAT GCCTGGGCTT TTGTGAAATT GATAAATTCG CTAGAACATC TTATAAAGCC
ATGTTTAACA CAGAAGGGGA AATAGAATAC CATGACATTA AAGAGGTCAC AGACCATGAC
TTTAGACAAT TTAGAGGGCA AGTGGACATC ATCTGCGGGG GATTTCCTTG CCAAGCATTT
TCACTCGCAG GCAGACGATT GGGATTTGAA GATACTCGAG GGACTCTCTT TTTTGAGATT
GCTCGAGCGG CCAAACAAAT CCAACCACGT TTTCTATTTT TGGAAAACGT CAAAGGCCTA
CTCAATCACG ACGAGGGACG GACGTTCGCC ACAATCCTCT CCACGCTGGA TGAATTGGGG
TATGATGTCG AATGGCAGGT GCTTAACAGC AAGGACTTCC AAGTCCCGCA AAACAGAGAA
CGGGTCTTTA TTATCGGACA TTCTAGAAGA TACCGTTCCA GATTCATATT TCCTCTCAGA
AGAGAAGACA GCCCAGCTCA TCTTGAAAGG CTAGGAAATA TCAATCCCTC TAAACATGGT
TTGAATGGTG AAGTCTATCT GACGAGTGGA CTTGCTCCTA CACTAACAAG AGGTAAAGGA
GAGGGTGCAA AAATCGCCAT TCCAGTCTTA ACACCAGATA GACTAGAAAA ACGCCAACAT
GGTCGTCGAT TTAAGGACAA TCAAGACCCT ATGTTTACTT TGACCAGTCA AGACAAACAC
GGAGTTGTTG TCGCAGGAAA TCTGCCGACT AGCTTTGACC AGACCGGTAG AGTATTTGAC
ATTTCTGGCT TGTCACCGAC CTTGACCACC ATGCAAGGTG GAGACAAGGT GCCAAAGATT
TTGCTGAGGG AGGAGCTGCC ATTTCTGAAA ATCAAGGAAG CCACTAAAAC AGGGTACGCA
AAGGCAACTC TTGGAGACTC TGTCAATCTG GCTTATCCAG ACTCAACCAA ACGTAGGGGA
CGTGTGGGAA AGGGAATATC CAATACTCTG ACGACTTCAG ACAATATGGG AGTAGTAGTT
GCTGCTCTGG AATATCGACA GGATAAGTGG TATGAAGTCA CAGGCATTGT CTTAGAGGGG
AAACTTTATC GCCTGAGAAT AAGACGACTG ACACCAAGAG AGTGCTTCAG ACTTCAAGGC
TTTCCTGATT GGGCTTATGA AAGAGCAGAG AGTGTTTCTA GTAAGAGCCA ACTATACAAA
CAGGCCGGCA ATAGCGTGAC TGTCACAGTT ATTGAAGCCA TTGCCAGAGA ATTTAGAAGA
ACGGAAGAGG AAGAAAAACA TGAACTTACT ACATAA
 
Protein sequence
MKFLDLFAGI GGFRLGMESQ GHKCLGFCEI DKFARTSYKA MFNTEGEIEY HDIKEVTDHD 
FRQFRGQVDI ICGGFPCQAF SLAGRRLGFE DTRGTLFFEI ARAAKQIQPR FLFLENVKGL
LNHDEGRTFA TILSTLDELG YDVEWQVLNS KDFQVPQNRE RVFIIGHSRR YRSRFIFPLR
REDSPAHLER LGNINPSKHG LNGEVYLTSG LAPTLTRGKG EGAKIAIPVL TPDRLEKRQH
GRRFKDNQDP MFTLTSQDKH GVVVAGNLPT SFDQTGRVFD ISGLSPTLTT MQGGDKVPKI
LLREELPFLK IKEATKTGYA KATLGDSVNL AYPDSTKRRG RVGKGISNTL TTSDNMGVVV
AALEYRQDKW YEVTGIVLEG KLYRLRIRRL TPRECFRLQG FPDWAYERAE SVSSKSQLYK
QAGNSVTVTV IEAIAREFRR TEEEEKHELT T