Gene SAG1412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1412 
Symbol 
ID1014221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1423553 
End bp1424977 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content34% 
IMG OID637316587 
Productpolysaccharide biosynthesis protein 
Protein accessionNP_688409 
Protein GI22537558 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTTT TGAAAAATAT GTTTTATAAT ACCTCATACC AGTTATTGAC TTTGTTGTTA 
CCTTTAGTAA CAGTTCCTTA TGTTTCCCGT GTACTATCTC CTCAAGGCAT CGGTATTAAT
GCCTATACGA GCTCCATTGT CATGTACTTT ACTTTATTTG GTGCTTTAGG GATATCTTTG
TACGGTAATC GTGAGATAGC TTTTGTGCAG TCAAATAAGT ATAAGAGAAG TAAGATATTT
TGGGAATTAG TTGTATTAAA ACTTGCCTCT GTATCTATAG CAACGTTATT ATTCTTTGGA
TTTGTCTTAC TTACAAACGA ATGGCAATTA TTCTATTTAA TTCAAGGTAT TAATTTATTA
GCGACTGCAA CTGATATTTC GTGGTATTTT ATTGGAGTTG AAGACTTTAA GATAATTGTC
ATTCGTAATA CCATTGTTAA GCTTATTACA GTTGTTTTGA CCTTCTTAGT TGTGAAGACG
CCAGATGACC TAGCACTTTA TATGTTTTTA ATTGCATTTG CATCATTACT CGGAAATTTA
ACAGTCTGGC ATCATTTAAA GCATGAAATC ATCAAGATAC CATTTAGTAG ATTAGATATC
TTAATACATC TAAGACCAAC TTTAATGTTA TTTTTACCCC AAATCACCAT GCAGATTTAT
CTCTCTCTTA ACAAGAGTAT GCTGGGAGCA ATGGATAGTG TGGTCAGTGC TGGATATTTT
GATCAATCTG ATAAAATTAT TCGTATTTTA TTTACCATTG TTTCAGCAAT CGGAGGAGTT
TTCTTACCAA GGCTCTCCAG TCTATTTTCT TCAGGAAAAG AAAAGCAAGC TAAGGCTTTA
CTTTTGAAAC TTGTTGATTT AAGCAATGCT ATCTCAATGT TAATGATTGC GGGAGTGGTA
GGAGTTTCGT CGACATTTGC CGTCTTCTTT TTTGGGAAAG GGTATGAAGC TGTCGGTCCG
CTAATGGCTG TTGAGTCTTT GATGATCATT TGCATTTCTT ATGGTAACGC TTTAGGGACA
CAGTATCTAC TGGCTTCAAG ACGTACAAAA GCTTACACAA TGTCTGCAGT TATTGGACTT
GTGGCAAATG TTGTGCTTAA TATACTTTTA ATACCAATTT TAGGTGCTAT GGGAGCTATT
ATCTCTACAG TTATTACTGA ATTTATTGTC TCCCTATATC AAGCTATCTC TTTAAGAGAT
GTATTTACTT TTAAAGAATT AACAAGAGGA ATGCTGCGCT ATTTAATTGC TGCTACTCTT
AGTGGAGCTG TACTCTATTA TATCAATACT CAAATGTCAG TATCATTGGT AAATTATGTC
ATTCAAAGTT TAGTAGCAGT AACTATCTAT GTCGGTATCG TTTTTATAAC CAAAGCACCA
GTCATTCAGT TATTTAGAGA TTTTAGAAAA GAGCATAGAA CATGA
 
Protein sequence
MKLLKNMFYN TSYQLLTLLL PLVTVPYVSR VLSPQGIGIN AYTSSIVMYF TLFGALGISL 
YGNREIAFVQ SNKYKRSKIF WELVVLKLAS VSIATLLFFG FVLLTNEWQL FYLIQGINLL
ATATDISWYF IGVEDFKIIV IRNTIVKLIT VVLTFLVVKT PDDLALYMFL IAFASLLGNL
TVWHHLKHEI IKIPFSRLDI LIHLRPTLML FLPQITMQIY LSLNKSMLGA MDSVVSAGYF
DQSDKIIRIL FTIVSAIGGV FLPRLSSLFS SGKEKQAKAL LLKLVDLSNA ISMLMIAGVV
GVSSTFAVFF FGKGYEAVGP LMAVESLMII CISYGNALGT QYLLASRRTK AYTMSAVIGL
VANVVLNILL IPILGAMGAI ISTVITEFIV SLYQAISLRD VFTFKELTRG MLRYLIAATL
SGAVLYYINT QMSVSLVNYV IQSLVAVTIY VGIVFITKAP VIQLFRDFRK EHRT