Gene SAG1171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1171 
SymbolcpsE 
ID1013978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1174668 
End bp1176056 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content30% 
IMG OID637316356 
Productglycosyl transferase CpsE 
Protein accessionNP_688180 
Protein GI22537329 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAGAAA AAGAAAATAT ACAAAAGATT ATTATAGCGA TGATTCAAAC AGTTGTGGTT 
TATTTTTCTG CAAGTTTGAC ATTAACATTA ATTACTCCCA ATTTTAAAAG CAATAAAGAT
TTATTGTTTG TTCTATTGAT ACATTATATT GTCTTTTATC TTTCTGATTT TTACAGAGAC
TTTTGGAGTC GTGGCTATCT TGAAGAGTTT AAAATGGTAT TGAAATACAG CTTTTACTAT
ATTTTCATAT CAAGTTCATT ATTTTTTATT TTTAAAAACT CTTTTACAAC GACACGACTT
TCCTTTTTTA CTTTTATTGC TATGAATTCG ATTTTATTAT ATCTATTGAA TTCATTTTTA
AAATATTATC GAAAATATTC TTACGCTAAG TTTTCACGAG ATACCAAAGT TGTTTTGATA
ACGAATAAGG ATTCTTTATC AAAAATGACC TTTAGGAATA AATACGACCA TAATTATATC
GCTGTCTGTA TCTTGGATTC CTCTGAAAAG GATTGTTATG ATTTGAAACA TAACTCGTTA
AGGATAATAA ACAAAGATGC TCTTACTTCA GAGTTAACCT GCTTAACTGT TGATCAAGCT
TTTATTAACA TACCCATTGA ATTATTTGGT AAATACCAAA TACAAGATAT TATTAATGAC
ATTGAAGCAA TGGGAGTGAT TGTCAATGTT AATGTAGAGG CACTTAGCTT TGATAATATA
GGAGAAAAGC GAATCCAAAC TTTTGAAGGA TATAGTGTTA TTACATATTC TATGAAATTC
TATAAATATA GTCACCTTAT AGCAAAACGA TTTTTGGATA TCACGGGTGC TATTATAGGT
TTGCTCATAT GTGGCATTGT GGCAATTTTT CTAGTTCCAC AAATCAGAAA AGATGGTGGA
CCGGCTATCT TTTCTCAAAA TAGAGTAGGT CGTAATGGTA GGATTTTTAG ATTCTATAAA
TTCAGATCAA TGCGAGTAGA TGCAGAACAA ATTAAGAAAG ATTTATTAGT TCACAATCAA
ATGACAGGGC TAATGTTTAA GTTAGACGAT GATCCTAGAA TTACTAAAAT AGGAAAATTT
ATTCGAAAAA CAAGCATAGA TGAGTTGCCT CAATTCTATA ATGTTTTAAA GGGTGATATG
AGTTTAGTAG GAACACGCCC TCCCACAGTT GATGAATATG AAAAGTATAA TTCAACGCAG
AAGCGACGCC TTAGTTTTAA GCCAGGAATC ACTGGTTTGT GGCAAATATC TGGTAGAAAT
AATATTACTG ATTTTGATGA AATCGTAAAG TTAGATGTTC AATATATCAA TGAATGGTCT
ATTTGGTCAG ATATTAAGAT TATTCTCCTA ACACTAAAGG TAGTTTTACT CGGGACAGGA
GCTAAGTAA
 
Protein sequence
MKEKENIQKI IIAMIQTVVV YFSASLTLTL ITPNFKSNKD LLFVLLIHYI VFYLSDFYRD 
FWSRGYLEEF KMVLKYSFYY IFISSSLFFI FKNSFTTTRL SFFTFIAMNS ILLYLLNSFL
KYYRKYSYAK FSRDTKVVLI TNKDSLSKMT FRNKYDHNYI AVCILDSSEK DCYDLKHNSL
RIINKDALTS ELTCLTVDQA FINIPIELFG KYQIQDIIND IEAMGVIVNV NVEALSFDNI
GEKRIQTFEG YSVITYSMKF YKYSHLIAKR FLDITGAIIG LLICGIVAIF LVPQIRKDGG
PAIFSQNRVG RNGRIFRFYK FRSMRVDAEQ IKKDLLVHNQ MTGLMFKLDD DPRITKIGKF
IRKTSIDELP QFYNVLKGDM SLVGTRPPTV DEYEKYNSTQ KRRLSFKPGI TGLWQISGRN
NITDFDEIVK LDVQYINEWS IWSDIKIILL TLKVVLLGTG AK