Gene SAG1399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1399 
Symbol 
ID1014208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1407357 
End bp1408691 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content36% 
IMG OID637316575 
ProductCBS domain-containing protein 
Protein accessionNP_688397 
Protein GI22537546 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGACC CTGGCAGTCA GAGTTTGTTA CTACAATTTG TTATTTTATT AATATTAACC 
CTATTTAACG CCTTCTTTTC AGCATCTGAA ATGGCTTTAG TATCGCTAAA CCGCTCTAAA
GTAGAACAGA AAGCTGAAGA AGGGGACAAA AGGTATCGAC GTCTTTTGGA CGTTCTTGAA
AATCCCAATA ATTTTTTATC AACAATTCAA GTTGGTATTA CTTTTATTAG CCTTTTGCAA
GGGGCTAGTT TATCTGCCTC ACTAGGACAT GTGATTTCAG GGTGGCTAGG AAATTCAGCA
ACAGCCCGTA CAGCAGGGAG TATTATTGCG CTAATCTTTT TAACATATGT CTCTATTGTT
CTAGGAGAAT TATATCCTAA GCGTATTGCT ATGAATTTGA AAGATCGTCT AGCAATTGTT
TCAGCTCCCA TTATTATCTT TCTAGGGAAA ATCGTAAGCC CCTTTGTATG GTTGCTCTCA
GCTTCAACTA ACTTATTGAG TAGAATCACT CCGATGACTT TTGATGATGC TGACGAGAAG
ATGACACGTG ATGAAATTGA ATATATGTTG ACGAATAGTG AAGAAACTTT GGAAGCTGAA
GAAATTGAGA TGCTGCAAGG GATTTTCTCG CTAGATGAAA TGATGGCGCG TGAAGTTATG
GTTCCGCGCA CTGATGCTTT CATGATTGAC ATCAACAATG ATGCACAATC AAATATTGAA
GGAATTTTAT CACAAAACTT TTCTCGTGTC CCTGTTTTTG ATGACGATAA AGATAGAGTT
GTTGGTGTTT TACATACCAA GCGCTTATTG GAAGCGGGCT TTAAAACTGG TTTTGATACT
ATCGACTTGC GAAAAATACT TCAGGAACCT CTTTTTGTTC CAGAAACGAT TTTTGTAGAT
GACCTTTTAA AAGCATTACG CAACACTCAA AATCAAATGG CCATTTTACT AGATGAATAT
GGTGGCGTTG CTGGTTTAGT AACTCTAGAG GATTTATTGG AAGAAATTGT CGGCGAAATT
GATGATGAAA CAGATACAGC TGAACAATTT GTCAGGGAAA TCGATGAGAA TATCTATATT
GTTCTAGGAA CTATGACGCT TAATGAATTT AATGATTATT TTGAGACAGA GTTAGAAAGT
GATGATGTTG ATACAATTGC GGGTTATTAT TTAACTGGTG TGGGTAGTAT TCCAAACCAA
GAGGAAAAAG TAGCTTACGA AGTAGACAGC AAAGATAAAC ACATCACTTT GATTAATGAT
AAAGTTAAAG ATGGTCGTAT AACAAAATTG AAGGTGCTAT TGTCTGATAT AGAACAGAAT
ATTGAAAAAG ACTAA
 
Protein sequence
MQDPGSQSLL LQFVILLILT LFNAFFSASE MALVSLNRSK VEQKAEEGDK RYRRLLDVLE 
NPNNFLSTIQ VGITFISLLQ GASLSASLGH VISGWLGNSA TARTAGSIIA LIFLTYVSIV
LGELYPKRIA MNLKDRLAIV SAPIIIFLGK IVSPFVWLLS ASTNLLSRIT PMTFDDADEK
MTRDEIEYML TNSEETLEAE EIEMLQGIFS LDEMMAREVM VPRTDAFMID INNDAQSNIE
GILSQNFSRV PVFDDDKDRV VGVLHTKRLL EAGFKTGFDT IDLRKILQEP LFVPETIFVD
DLLKALRNTQ NQMAILLDEY GGVAGLVTLE DLLEEIVGEI DDETDTAEQF VREIDENIYI
VLGTMTLNEF NDYFETELES DDVDTIAGYY LTGVGSIPNQ EEKVAYEVDS KDKHITLIND
KVKDGRITKL KVLLSDIEQN IEKD