Gene SAG2061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2061 
Symbol 
ID1014872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp2041160 
End bp2042365 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content32% 
IMG OID637317227 
Productglycosyl transferase family protein 
Protein accessionNP_689047 
Protein GI22538196 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAG CAGTTGCACT TGCAGTTGAT TCAAACTACT TGGATAAAGC CTTAGTAACA 
ATAAAGTCTA TTTGTGTTTA TAATAGAAAT ATAACTTTTT ATTTATTCAA TCAAGATACC
CCAGTTGAAT GGGTACGTAA TATAAACAGG AAACTAGAGC CTCTAGGATC AAAACTGATT
AATGTTAAAA TATATAACTA TGATATTGCT CATCTAACGA CTTTTCTAAC TGTTAGTACA
TGGTTTAGAT TATTTTTAGC AGATTATATA CCTAGTTCAC GTGTACTTTA TTTAGATTCA
GATATTATCG TTAACACTAA TCTTGATTAC TTATTTGAAC TAGATTTTAA AGGTTATTAC
TTAGCAGCCG TCAAAGATCC CCATAAAAAT GAAGAAGGAG GGTTTAATGC TGGCATGCTT
TTAGCTAATC TAGAACTATG GCGGGAAGAT GGGCTCACTA AAACATTACT AAAAACAGCT
GAAGAACTCC ACCGAGTTGT CAAAACAGGG GATCAAAGTA TCTTGAACAT TGTTTGCCAT
AATCGTTGGT TATCTCTGAA CAAAACATGG AACTTTCAAA CTTATGATGT CGTTAGCCGC
TATAATCATC GATCTTATTT ATATCTAAAC ATAGAAAATA GAACTCCTAA TATTATACAT
TTTTTAACTA GTGACAAACC TTGGAATGAA AATAGCGTTG CAAGGTTTAG AGAACTATGG
TGGTATTACT TCCAACTTGA TTTTTGCCAA TTAACCGGCA AGCAAAGAAA AGTGATTTCT
TACGAAAAGT CCATGGAATT GCTTTCTGTT TCAGATATTC ATCTTTTCAC TCTTACATCT
TCCGATAATT TAGAACACAT TGAATCGCTA ATTTGTAGAT GTCCTACTGT TCAATTCCAT
ATTGGTGCCT ACACAACAGT GTCAAATAAA CTTAGCAAAC TAGAACAATA TCCAAATGTC
CTAGTTTACC CTGAATTAAT TGAGGCAAGA ATTGAAAAAT TAATAACATT AGCCACTGCA
TACCTTGATA TTAATCATGG TCCAGAAGTA GGTAATATTT TACAAAGAGT TCATTTGAAA
CAAAAGCCAA TTTATTCTTT CAACAACACG TCTCATCAAG AGAATATAAC CAAACATATC
GTTAATCATA ATTCTATTGA TAATATGGTT GTTTTAATTA ACGAATTAAA CCTAAATCAA
TTATAA
 
Protein sequence
MEKAVALAVD SNYLDKALVT IKSICVYNRN ITFYLFNQDT PVEWVRNINR KLEPLGSKLI 
NVKIYNYDIA HLTTFLTVST WFRLFLADYI PSSRVLYLDS DIIVNTNLDY LFELDFKGYY
LAAVKDPHKN EEGGFNAGML LANLELWRED GLTKTLLKTA EELHRVVKTG DQSILNIVCH
NRWLSLNKTW NFQTYDVVSR YNHRSYLYLN IENRTPNIIH FLTSDKPWNE NSVARFRELW
WYYFQLDFCQ LTGKQRKVIS YEKSMELLSV SDIHLFTLTS SDNLEHIESL ICRCPTVQFH
IGAYTTVSNK LSKLEQYPNV LVYPELIEAR IEKLITLATA YLDINHGPEV GNILQRVHLK
QKPIYSFNNT SHQENITKHI VNHNSIDNMV VLINELNLNQ L