Gene SAG1528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1528 
Symbol 
ID1014337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1540800 
End bp1542515 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content35% 
IMG OID637316700 
Productchorismate binding enzyme 
Protein accessionNP_688522 
Protein GI22537671 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0770675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATATTG AGACTGTTAT TGATTTCAAA GAATTAGGAA AAAGATATCG TTTTAAAAAT 
CCTACAAAAG AATTAATAGC TGATACTTTA GAACAAGTCT TAGAAGTGAT AAAAGAAGTT
GATTATTATC AATCTCAAAA TTATTATGTT GTTGGTTATT TATCTTATGA AGCATCTGCT
GCTTTTGATT CACATTTTAA AGTTTCTCAA CAGAAGTTGG CTGGAGAACA TCTAGCTTAT
TTTACAGTAC ATAAAGATTG TGAGAACGAA GCTTTTCCTT TAAGTTATGA AAATGTTAGA
TTAGCAGATA ATTGGACTGC TAATGTTTCT GAGCAAGAAT ATCAAGAGGC AATTGCTAAT
ATTAAAGGAC AAATTAGACA AGGAAATACT TATCAAGTAA ATTATACACT AGAGCTTAGC
CAACAATTAT GCTCGGATCC CTTTAGTGTT TATGAACGCC TAATGGTAGA ACAAGGAGCA
GGATATAATG CTTATATTGC CTACGACGAT AAAAGGATTT TATCTGTAAG CCCGGAATTG
TTCTTTAAGA AGAAGGATGA AGTCTTAACA ACTAGACCAA TGAAGGGAAC AAGCGCTAGA
AAGCCTACTT ATCAAGAAGA TGTGGCGGAG CGAGATTGGT TGGCAAATGA TCCTAAAAAT
CGTTCTGAAA ATATGATGAT TGTTGATTTA TTAAGAAACG ATATGGGCCG TATCTGTGAT
GTTGGAACAG TTAAGGTAAA AAAACTATGT CAAGTGGAGC AGTATGCAAC TGTGTGGCAA
ATGACATCAA CTATTGAAGG AGTTTTATCG CCAGAAGTGA CACTTATGTC TATTTTTCAA
GCTCTATATC CTTGTGGATC TATCACTGGA GCTCCTAAGA TATCAACAAT GGCTATTATT
AATGAGTTGG AAAAACGGCC AAGAGGTATT TATTGTGGGA CGATAGGACT TTGCATGCCA
GACGGACAAG CTATTTTTAA CGTTCCTATT CGCACAGTAC AAATGAAAGG TCAACAAGCC
TATTACGGCG TTGGTGGAGG TATTACGTGG GAGAGCCAGA CAGATTCTGA ATATGAAGAA
ACCCGTCAAA AATCAGCTGT TTTAACACGT GTCAATCCAA AATTTCAATT AATAACTACA
GGAAGAGTGA CTGAAAATAA ACTTCTTTTC TCTCAACAAC ATGTCGAGAG ATTAGTTGAA
TCAGCGTCTT ATTTTGCTTA TTCTTTTGAT AAAAGTAAAT TCGAAAGGGA ACTGAAAAAA
TACCTTCATC AGCTAGATGA GAAGGATTAC CGCTTGAAAA TAATGCTTGA TAAAACTGGT
AAGGTAACGT TTGAGGTAAA ACAACTAGTG AATTTATCAA AAAAGTTTTT AACGGCAGAG
GTGGTCGTAC AAGACTACCC TATTAAATTA TCCCCGTTTA CTTATTTTAA AACTTCTTAT
CGCCCACATA TTATTGAAGG TCAGAATGAA AAGATATTTG TATCTCCTGA GGGGTTGCTA
TTGGAAACAA GTATTGGGAA TATTGTTTTA GAAAAAAATG GAAGGTTTTT AACCCCAGAT
TTATCAGAAG GAGGGTTGAA TGGGATTTAT CGTCGTCATC TCCTTAAAAA CCAAAAAGTA
ATTGAAGCAC CACTAACTTT AAAAGATTTA GAATCAGCCG ATGCTATATA CGCCTGTAAT
GCTGTTAGAG GGCTTTATCC TCTAAACCTA AAGTAA
 
Protein sequence
MHIETVIDFK ELGKRYRFKN PTKELIADTL EQVLEVIKEV DYYQSQNYYV VGYLSYEASA 
AFDSHFKVSQ QKLAGEHLAY FTVHKDCENE AFPLSYENVR LADNWTANVS EQEYQEAIAN
IKGQIRQGNT YQVNYTLELS QQLCSDPFSV YERLMVEQGA GYNAYIAYDD KRILSVSPEL
FFKKKDEVLT TRPMKGTSAR KPTYQEDVAE RDWLANDPKN RSENMMIVDL LRNDMGRICD
VGTVKVKKLC QVEQYATVWQ MTSTIEGVLS PEVTLMSIFQ ALYPCGSITG APKISTMAII
NELEKRPRGI YCGTIGLCMP DGQAIFNVPI RTVQMKGQQA YYGVGGGITW ESQTDSEYEE
TRQKSAVLTR VNPKFQLITT GRVTENKLLF SQQHVERLVE SASYFAYSFD KSKFERELKK
YLHQLDEKDY RLKIMLDKTG KVTFEVKQLV NLSKKFLTAE VVVQDYPIKL SPFTYFKTSY
RPHIIEGQNE KIFVSPEGLL LETSIGNIVL EKNGRFLTPD LSEGGLNGIY RRHLLKNQKV
IEAPLTLKDL ESADAIYACN AVRGLYPLNL K