Gene SAG1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1410 
Symbol 
ID1014219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1421579 
End bp1422718 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content34% 
IMG OID637316585 
Productglycosyl transferase, group 1 family protein 
Protein accessionNP_688407 
Protein GI22537556 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATA AGGTAAAAAC AGTTGCTGTT TTCTCAGGAT ATTATCTACC CTTCCTCGGA 
GGTATTGAAC GTTATACAGA TAAAATGACA GCAGATTTGG TTAAGCGTGG CTACCGTGTT
GTTATTGTAA CAACTAATCA TGGTGATTTA CCGATTATAG ATGAAGACAA GGGTAGAAAA
ATTTACCGTT TACCGACCAA AAATATTGTC AAGCAGCGCT ATCCTATTAT TAACAAAAAT
CGAGAATACA ACACGCTGAT GAAATATGTT TCAGATGAAA ATATTGATTT TGTGATTTGT
AATACACGTT TTCAGTTAAC AACGCTAGAA GGGCTATCTT TTGCAAAGAA TCACCATTTA
CCTAGTATTG TTTTAGACCA TGGGTCAAGT CATTTTTCTG TCAATAATCG TTTTTTAGAT
TTCTTTGGAG CTATCTATGA ACATCTTCTC ACAGCGCGTG TTAAGCATTA TCGTCCAGAT
TTTTATGCGG TATCCAAGAG AAGCGTTGAA TGGTTAAAAC ATTTTAATAT AGAAGCTAAA
GGAGTTATTT ATAATTCGGT ATCTGAAAGT CTAGGCTCTG ATTTTGCTGG TACGGCTTAC
CTTGAAAAAT CTGCTGATGA TATTTTCATC ACTTATGCTG GTAGAATTAT TAAAGAAAAA
GGCATTGAAT TGCTCTTAGA AGCATTTTCG ATGTCACAGT ACTCTGAAAA TGTTTATTTA
CAGATTGCAG GAGACGGACC TGAATTAGCA CATTTGAAAG AGAAATATCA GTCCAAACAG
ATTAACTTTT TGGGGAAATT AAACTTTGAA CAGACTATGT CTTTAATGGC ACAGACTGAT
ATCTTTGTTT ACCCTTCAAT GTATCCAGAG GGATTACCAA CCTCTATTTT AGAAGCTGGT
CTACTTTCAA GTGCTATTAT TGCAACAGAT CGCGGTGGAA CAGTTGAAGT TATTGATAGC
CCTGAGTTAG GAATCATCAT GGAAGAAAAT ACTCAGTCAC TCCATGAATC TCTTGATTTA
TTAGTAAAAG ATAAGGCATT AAGAGAGAAA CTACAACAGA ATATTGCTAA AAGAATTAAA
GAACATTTCA CATGGGAAAA AACAGTTGAA AAGTTGGATT ATATTATTCA AAAAAATTAA
 
Protein sequence
MENKVKTVAV FSGYYLPFLG GIERYTDKMT ADLVKRGYRV VIVTTNHGDL PIIDEDKGRK 
IYRLPTKNIV KQRYPIINKN REYNTLMKYV SDENIDFVIC NTRFQLTTLE GLSFAKNHHL
PSIVLDHGSS HFSVNNRFLD FFGAIYEHLL TARVKHYRPD FYAVSKRSVE WLKHFNIEAK
GVIYNSVSES LGSDFAGTAY LEKSADDIFI TYAGRIIKEK GIELLLEAFS MSQYSENVYL
QIAGDGPELA HLKEKYQSKQ INFLGKLNFE QTMSLMAQTD IFVYPSMYPE GLPTSILEAG
LLSSAIIATD RGGTVEVIDS PELGIIMEEN TQSLHESLDL LVKDKALREK LQQNIAKRIK
EHFTWEKTVE KLDYIIQKN