Gene SAG0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0644 
Symbol 
ID1013448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp630361 
End bp631569 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content35% 
IMG OID637315837 
ProductAraC family transcriptional regulator 
Protein accessionNP_687664 
Protein GI22536813 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTACAT TCGATTTTAA ACATGTTCAA ACATTGCATA CCATATCTCA ACTACCTATT 
TCAGTCATGT CACGAGACAA GGAACTCGTT CAGTTATATG GTAACGAAGA CTATCTGTTG
CCTTACTATC AGTTTTTAAA ACATTTAGCT ATCCCTTATA ACCAAGATAT CACTGTTTAT
GAGGGTCTTT TTGAAGAATC ATTCTTAATT TTTCCTGTCT GTCAATATCT TATTGCTATT
GGACCATTTT ATCCCTATAG CCCTGATATC AATAGGCAAG AACAGTTATC CAGTCGCTTT
CTAGAACAAT TTTCTCATCG TAATAAAAAA GAAATCTTAT CCTATATAAA GCTTGTCCCT
TGTTTTCCTA CTACCAGTAT ACGTAGTCTT CTTGTGTCTA TCGATGCCTT TTTCCAAACA
CAGTTTGAGG CTAGTTGCCA ACAAGTCATC AATCATTTAT TAGAAGAGTC AGAACAGATT
GTTGCGGATC CTGATATTGT TCTTCACCTA AAACATACTA AGAAAAACTC TTTTCAGTTA
CCCACTGTTT TAAACCATCT CAATCACATT ATTGATCTCG TTAAGCTGGG TAACACTCAA
CTGTTAAAGC AAGAAATTAA TCGCCTCCCA TCATCCAGTG TTACCTCATC TTCAATCCCT
GCCCTAAGAG CTGAAAAGAA CTTAACTGTT GTTTACTTAA CAAAATTACT AGAATTAAGT
TTCGAGGAAA ATACTGATGT GGCTAAAAGT TATGCGCTGG TAAAGCACTA TATGGCTTTA
AACGAAGAGG CTCCTGATCT TATTGATGTT TCAAGAATTC GCTGTGCAGC TCTTATTGAT
TTTTCAGAAT CCTTAACCAA TAAGAGCATC TCTGACAAGC AGCAAATGTA CAATAGTATT
CTCCATTATG TGGACAACCA CCTCTACTCC AAACTCAAAG TATCTGATAT TGCCAACTAC
CTATATATCT CAGATTCCCA CTTACGCTCA GTTTTTAAAA AATACTCTGA CATTTCCTTG
CAAAGTTATA TTCTAAAGGC AAAAATTAAG GAGGGACAAT TACTACTGCA AAGAGGGGTA
CCGATTGGGG AAGTAGCGAA ATTATTACAT TTTTACGACA CCACACATTT TCTTAAAACC
TTTAAAAAAT ACGTGGGAAT ATCTTCAAAC GAATATCTTA CTAAATATCG TGAGACCTCA
TGCCAGTAA
 
Protein sequence
MVTFDFKHVQ TLHTISQLPI SVMSRDKELV QLYGNEDYLL PYYQFLKHLA IPYNQDITVY 
EGLFEESFLI FPVCQYLIAI GPFYPYSPDI NRQEQLSSRF LEQFSHRNKK EILSYIKLVP
CFPTTSIRSL LVSIDAFFQT QFEASCQQVI NHLLEESEQI VADPDIVLHL KHTKKNSFQL
PTVLNHLNHI IDLVKLGNTQ LLKQEINRLP SSSVTSSSIP ALRAEKNLTV VYLTKLLELS
FEENTDVAKS YALVKHYMAL NEEAPDLIDV SRIRCAALID FSESLTNKSI SDKQQMYNSI
LHYVDNHLYS KLKVSDIANY LYISDSHLRS VFKKYSDISL QSYILKAKIK EGQLLLQRGV
PIGEVAKLLH FYDTTHFLKT FKKYVGISSN EYLTKYRETS CQ