Gene SAG1175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1175 
SymbolcpsA 
ID1013982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1178207 
End bp1179664 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content32% 
IMG OID637316360 
Productcapsular polysaccharide biosynthesis protein CpsA 
Protein accessionNP_688184 
Protein GI22537333 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAATC ATTCGCGCCG TCACCAAAAG AAACACTCAC ATACACCTCT ACGGGTGATT 
AATTTATTTC TTTTGGTGAT TTTTATTTTG TTAAGTGTAG TCTCATTATT TCTTATGTAT
CGTCACCATT TTTTGGCATT TAGACACCTG AACGTCATTT ATGGAGTTGT AATTGTTTTA
ATCATTTTAG TAAGTTTATT TCTTTGTATT AAGAATAAAG CTAGAATTTT TACAACTATA
ATTTTAGTAT TAGCTTCTAT TTTCGTTGCT ACTACTTTAT ATGGATTTAA GTCAACCATT
GATTTGACAA ATAATCTAAA TAAAACTGCT TCATACTCTG AAATTGAGAT GAGTGTAGTT
GTACCAAAAG ATTCTAAAAT AACCAATATA GAAGCTGTCA GCAAATTAGC CGCACCAGTT
AAAAACGATA CTTCAAATAT TACTGATTTG ATAGAACATA TAAAATCAGA AAAAGGAATC
TCTATTACAC CACAAAAAAC AGATTCTTAC CAGGATGCAT ACAATAGAAT TAAAAGTGGT
GATAGTCAGG CTATGGTTTT AAATAATGCT TATGTTAGCT TAATTGAACT TAGCACCCCT
GATTTTAAAT CGCAGATAAA AACGATTTAT ACTTACAAAA TTAAGAAAAA AATTAATCGT
AAAAATACTA ATCATAAAGA AGGGGTATTT AATATCTATA TTAGCGGTAT TGATACTTTT
GGCTCTATAT CAACAGTATC AAGATCTGAT GTAAATATTA TTATGACGGT TAATACCAAT
ACCCACAAAG TATTGTTAAC GACAACACCA CGAGATGCCT ATGTAAAAAT TCCAGATGGT
GGGGGCAATC AATATGATAA ATTAACCCAT GCAGGTTTGT ATGGCGTTGA GACATCAATG
AAAACACTTG AAAACCTTTA CGACATCAAC CTTGATTATT ATGCTAGAAT TAATTTTTCA
TCATTTTTAA AATTAATAGA CCTCTTGGGA GGAGTGACAG TTTATAACGA TCAAGCTTTT
ACAAGTAAAC ATGGTAATTT TGACTTCCCT GTTGGTCAAG TAACATTGAA TTCTGAGCAG
GCTTTGGGCT TTGTTAGAGA ACGTTATTCT CTACAAGGAG GCGATAACGA TAGAGGTAGA
AATCAAGAAA AAGTGATTGC AGCTATTATA AATAAGTTAG CTTCTAGTCA GTCAGTAACA
AAATTAAATA GCATTACCTC ACAGCTCCAA ACGTCCGTTC AAACTAATAT GACTATTGAT
AACATTAATG ATTTGATTAA CAATCAATTG TCAACTGGAC AACGCTTCAC TGTCGAGTCA
CAAGCATTAA CTGGTCATGG TTCAACGGGT GAACTCCCTT CATATGCAAT GCCAGGAGCT
CAACTTTATA TGATGTCAAT TGATCAATCT AGCTTATCTA ATGCAAAATC AAAAATTAAG
AACACAATGG AGGAATAA
 
Protein sequence
MSNHSRRHQK KHSHTPLRVI NLFLLVIFIL LSVVSLFLMY RHHFLAFRHL NVIYGVVIVL 
IILVSLFLCI KNKARIFTTI ILVLASIFVA TTLYGFKSTI DLTNNLNKTA SYSEIEMSVV
VPKDSKITNI EAVSKLAAPV KNDTSNITDL IEHIKSEKGI SITPQKTDSY QDAYNRIKSG
DSQAMVLNNA YVSLIELSTP DFKSQIKTIY TYKIKKKINR KNTNHKEGVF NIYISGIDTF
GSISTVSRSD VNIIMTVNTN THKVLLTTTP RDAYVKIPDG GGNQYDKLTH AGLYGVETSM
KTLENLYDIN LDYYARINFS SFLKLIDLLG GVTVYNDQAF TSKHGNFDFP VGQVTLNSEQ
ALGFVRERYS LQGGDNDRGR NQEKVIAAII NKLASSQSVT KLNSITSQLQ TSVQTNMTID
NINDLINNQL STGQRFTVES QALTGHGSTG ELPSYAMPGA QLYMMSIDQS SLSNAKSKIK
NTMEE