Gene SAG0099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0099 
Symbol 
ID1012867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp108709 
End bp109956 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content34% 
IMG OID637315272 
ProductGntR family transcriptional regulator 
Protein accessionNP_687135 
Protein GI22536284 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAACCA AAGTTGAAGA GATTCGCTCA TATTTGATAG CTTCTATACA AAATGGTAAG 
TTGGCTCCAG GAGATCGCCT ACCATCTATA AGACAGTTAG CTAATCAATT TTCCTGTAAC
AAAGATACAG TCCAACGAGT TTTGATGGAA TTGCGTTTTG ATAATTATAT CTATGCAAAG
CCTAGGTCAG GCTATTACGT CTTTGATTCT CATCAAGAGG AAGTTGAAGA AGGGGTTAGT
TTACCAAACT CTGAGATTGC AAATATAGCT TATGATGATT TTAGATTGTG TTTGAATGAG
ACCCTTATTG GTAGGGAAGA TTACCTTTTC AATTATTACT ATCGTCAAGA AGGTCTTCTT
GATTTAAGTA AAGCAGTGGC TAAATTAATG GAAGAAACAG GGGTCTATGT TCCCCTTGAT
GATATTGTTA TTACGGCTGG TACTCAACAG GCATTATTTA TTTTGACACA GGTTACCTTT
CCAAATCGAA AATCTCGAGT TTTAATAGAA GAACCGACCT ATCCTCGTAT GATTGAACTA
ATCAAAACAC AAAATTTACC CTATGAAACT ATTTCTCGAG GTACTCATGG AATTGATTTT
CAGCGTTTAG AGGAGATTTT CCAGACACAA TCAATTAAGT TTTTTTATGT TATACCTCGC
ATGCATAATC CTTTGGGAAC ATCCTATAAT CCGGTAGAGA TGAAAAGATT AATAGAGATG
GCAGAGAAGT ATGATGTTTA TATTGTGGAA GATGACTATA TGTCTGATTT TGCAAGTCAG
TCACCATTAC ATTATTATGA TACTCACGGG CGTGTTATTT ATCTAAAATC TTTTTCAAAG
GCTATTTTCC CTGCTTTAAG ATTAGCTGCG ATTTGTTTAC CACAAGCTTT AAAATCAACA
TTTATGGCTT ACAAGAAGTT GATGGATTAT GATACTAATC TGATTTTACA AAAAGCATTA
GCGCTTTATA TTGAAAATGG CCTTTATGCT AAGAATAGTC AATATTTGAA ATATCGTTAT
CAGAAAGACC TTGCAAATTC AAAATCTATT TTAGCTGATC ACCCTAATCT ACCCTCATAT
AGTTTACATC ACGATAGTGT ATTATTTGAT TGTTCGAAAC TCGATAACTT TAAAATATTA
CGGCAATACG GCGATACTTT GGAAAATTAT TTTTGTCAAA AATCGCATCA ATCTCTCTTA
CAAGTAAAAA ATGATTCCTG CTTAAAGCAG TTCTTGGGAT CGTTGTAG
 
Protein sequence
MVTKVEEIRS YLIASIQNGK LAPGDRLPSI RQLANQFSCN KDTVQRVLME LRFDNYIYAK 
PRSGYYVFDS HQEEVEEGVS LPNSEIANIA YDDFRLCLNE TLIGREDYLF NYYYRQEGLL
DLSKAVAKLM EETGVYVPLD DIVITAGTQQ ALFILTQVTF PNRKSRVLIE EPTYPRMIEL
IKTQNLPYET ISRGTHGIDF QRLEEIFQTQ SIKFFYVIPR MHNPLGTSYN PVEMKRLIEM
AEKYDVYIVE DDYMSDFASQ SPLHYYDTHG RVIYLKSFSK AIFPALRLAA ICLPQALKST
FMAYKKLMDY DTNLILQKAL ALYIENGLYA KNSQYLKYRY QKDLANSKSI LADHPNLPSY
SLHHDSVLFD CSKLDNFKIL RQYGDTLENY FCQKSHQSLL QVKNDSCLKQ FLGSL