Gene SAG0432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0432 
Symbol 
ID1013234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp446066 
End bp447259 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content31% 
IMG OID637315637 
ProductAraC family transcriptional regulator 
Protein accessionNP_687466 
Protein GI22536615 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGATT TTCTACTGTT GAAATCATTA CACAGTCTTC TAGGTCTAAC TATTACTGTT 
TGTGATCAAA ATTTTTCTGT TATCAGAGAG TATAAATCCG AGAAAACTAT TTCACTTTTT
TACAATCATT ACCTTATTTT AAGTAACTTT AGTAAAACTC AACATGATTT TTTATTTCAT
TATGGCTCTT TAGGAGAACT TTTTCTTGTC CACCATATTC AACAATATTA TATTATCATC
GGTCCCTGGC GTAGCAATGT TATTGACCCT TTACTTCTTA AGAAAAAGCT GACTGAAACA
CAAATCAATG CTAGTGAACA AGACTACTTT ATTGATAGAC TATCCCAATT ACCTTTCTTC
TCACTTAGTC AAATCCGAGA ATTACTAATA GTAACCAATT ATTGTCTAAC TGGGGTTGTG
AAAGATAAAT TATCAGAGCC CTTACATTAC TACACGAAAG GGTGGAGCAA CTCCTTTGAT
CTAGATAAAA TTAAGCAATT TTCTAAACAG AATATGAGTT CTTACAAATA TCAGTATCAT
TTTGAAAATA ATATTCTAAA AGCTGTTAAA TCAGGTAGTG AATTTCTTTT GAAAGAAACA
GTGGAACAAT TTAGCAATTC TATCGTACCT ATAATCAGTG GAGATGAATT GCGATCTGAA
AAAAACTACT CCATCATGAT TTATGATCGT CTCTCACAGG CTACCATCCA AGCAGGACTT
GATATCGAAA CAGCTTATCG AGCACGAGAT CGTTTTATAA AAGAAAACGA GTCAACTATA
AGTCTAAATG AAGTTTTAAA ATTACGTGAT ACTGCTATCT TATTCTATAC TCAACAAGTT
CATTCTTTAA AAAAACATCT CGAAACCCCT CATTCTCAAA CTATTGTCGC AGTGATTCGA
TATCTTGAAA ATAATCTAAA TCGCTTTATT AAAACAGAAG AAATAGCTAA AGAATGTCAC
ATGAGTGAAT CGAAGTTACG AAAACTATTT AAGCAAGAAA AACACATCAC TATTCAACAA
TATTTTCTAA ACTTAAAAAT CGAAGCTGCT AAGCAATTAC TAGATGAAAA TAAGAAAGTA
GAAGAAGTTT CCAATTTACT TGGATTTTCC ACTTCTTCTA ATTTTTCAAG GACATTTAAA
AAAATAGTGG GAATTAGCCC ACTAGAATAT AAGCAAAAGC CTAAGACAAT ATAG
 
Protein sequence
MIDFLLLKSL HSLLGLTITV CDQNFSVIRE YKSEKTISLF YNHYLILSNF SKTQHDFLFH 
YGSLGELFLV HHIQQYYIII GPWRSNVIDP LLLKKKLTET QINASEQDYF IDRLSQLPFF
SLSQIRELLI VTNYCLTGVV KDKLSEPLHY YTKGWSNSFD LDKIKQFSKQ NMSSYKYQYH
FENNILKAVK SGSEFLLKET VEQFSNSIVP IISGDELRSE KNYSIMIYDR LSQATIQAGL
DIETAYRARD RFIKENESTI SLNEVLKLRD TAILFYTQQV HSLKKHLETP HSQTIVAVIR
YLENNLNRFI KTEEIAKECH MSESKLRKLF KQEKHITIQQ YFLNLKIEAA KQLLDENKKV
EEVSNLLGFS TSSNFSRTFK KIVGISPLEY KQKPKTI