Gene SAG1448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1448 
Symbol 
ID1014257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1461074 
End bp1462582 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content34% 
IMG OID637316622 
Productglycosyl transferase, group 1 family protein 
Protein accessionNP_688444 
Protein GI22537593 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR02918] accessory Sec system glycosylation protein GtfA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.622489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGTTT ATAATATAAA CCATGGAATT GGTTGGGCAA GCAGCGGTGT TGAGTACGCT 
CAAGCATACC GAGGTTCTGT TTTGAGAAAA CTTGGAATTG ATGCTAAGTT TATTTTTACA
GATTTTTTTT CAGCTGATAA TATTATTAGC CTCACGCGAA ATATAGGTTT TCAGGATAAA
GAAATTATTT GGTTATATAA TTTTTTTTCA GATATTGAGC TTGCCCCGAC AACTTTTAGT
ATAGAAGATT TGCAAAAACA ATATCTTGGA AAACTTGTTC GAAAAGAAGA TAAAGGTCGG
GTAATCAAGT TTTTCTACGA AGATGAAAAT ATCTATTTAA CAGTTTACCT TGATAATTTT
AACAAGGATA AAGTGCATCG TGTAGAAATC GTTTCAAATA ACAACCTTAT CAGAAAAGAT
TATTATAGTT ATACACGTAC TTTCTCAGAG TATTACTATC CTAAAGATGG TGTAGCTCAC
CTTTATCAAC GTAGATTCTA CAATGAAGAT GAAAGTACAG CCTTTTTGGA GATGGTAGAA
AATAGCTCAA GTCGTTTTAT TATCAATGGC AGACTTTTAC CATCCAAAGT AGCTTTTTTT
GATTATTTTT TGGAATCAAT GACTTTCACT TCAAAAGATA TTATTCTTTT GGATAGAGGA
ACTGATACCG CTCAGAGTCT TTTGCGCCAT GGTAAACCCG CTAAGCTAGG GACTGTTGTT
CATGCGGAAC ATTTTAGTGA GAATGCTGTG ACTGCTGACA CTATTTTATG GAATAACTAT
TACGATTATC AGTTTACTAA CGCTAATAGA TTTGATTTTT TTATTACCTC CACTGATAAA
CAAACAGAAC TTTTGGAACA ACAATTTAAA CAATTTACAA ATCATAATCC TAGAATTATA
ACTATCCCGG TAGGCTCAAT TGACAATCTT AAAATGCCAA TGGACAATCG CCGTCCGTAC
TCTATTTTGA CAGCTTCACG CCTAGCTAGT GAAAAACATG TAGATTGGTT AGTACGTGCA
GTTATTAGGA TAAGAGAAAT TCTTCCTGAA GTGACCTTTG ATATCTATGG ATCAGGTGGA
GAAGAAGAAA AAATTAGAAA TATTATAAAT GCAGCCAATG CAACGGAATA CATTCGATTG
ATGGGACATA AAAATCTCTC GAATGTTTAT CAAAATTATG AGTTATATTT GACAGCTTCT
AAAAGTGAGG GGTTTGGCTT AACTTTACTT GAAGCTATTG GCGCAGGACT TCCTTTGATT
GGGTTTGATG TTCGTTATGG TAATCAAACT TTTATCAAAG ACGGAGAAAA TGGTTATCTA
ATTCCTCGAT TTGATATGGA TGATGAGGAA GCTATTGTAG AAGCTTTTAA AGAGAAAGTG
TTACAATTAT TCCAACAGGA TCAAAAGGCT TTACGAGAAG CTTCTTACGC CATTGCCGAA
GGATTCTTAA CAAATGAAGT AGAAGGAAAA TGGTATAACT TAGTTAAGGA GTTGGTACAA
GATGATTAA
 
Protein sequence
MTVYNINHGI GWASSGVEYA QAYRGSVLRK LGIDAKFIFT DFFSADNIIS LTRNIGFQDK 
EIIWLYNFFS DIELAPTTFS IEDLQKQYLG KLVRKEDKGR VIKFFYEDEN IYLTVYLDNF
NKDKVHRVEI VSNNNLIRKD YYSYTRTFSE YYYPKDGVAH LYQRRFYNED ESTAFLEMVE
NSSSRFIING RLLPSKVAFF DYFLESMTFT SKDIILLDRG TDTAQSLLRH GKPAKLGTVV
HAEHFSENAV TADTILWNNY YDYQFTNANR FDFFITSTDK QTELLEQQFK QFTNHNPRII
TIPVGSIDNL KMPMDNRRPY SILTASRLAS EKHVDWLVRA VIRIREILPE VTFDIYGSGG
EEEKIRNIIN AANATEYIRL MGHKNLSNVY QNYELYLTAS KSEGFGLTLL EAIGAGLPLI
GFDVRYGNQT FIKDGENGYL IPRFDMDDEE AIVEAFKEKV LQLFQQDQKA LREASYAIAE
GFLTNEVEGK WYNLVKELVQ DD