Gene SAG1986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1986 
Symbol 
ID1014797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1966825 
End bp1967952 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content32% 
IMG OID637317153 
Productphage integrase family site specific recombinase 
Protein accessionNP_688973 
Protein GI22538122 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAATG CAAGATATAG AAGGAGAGGA AACCAGAATT TATGGGCTTA TGAAATCCGA 
GAGGAAGGTA AAACAGTTGC TTATAATAGT GGCTTTAAAA CAAAGAAACT AGCAGAGGCA
GAAGCAGAGC CTATTCTCCA AAAATTGAGA ACTGGTAGCA TAATCACAAA AAATATTTCA
TTACCAGAAC TTTATCAAGA ATGGCTAGAT TTAAAAATCA TGCCTAGTAA TAGAAGTGAT
GTGACTAAAA AGAAATATCT TAGTCGGAAG GTAACGCTTG AAAAGCTATT TGGTGATAAA
CCCATTTCTC AAATCAGACC TAGTGAATAT CAACGAATTA TGAATAACTA TGGACAACGA
GTTAGTCGCA ATTTCTTAGG CCGGTTAAAT ACTGGCGTGA AACAGAGTTT GCAAATGGCT
ATTGCTGATA AGGTTATGAT AGAAGATTTT ACTCAAAATG TAGAATTATT CTCAACAGTA
AAAAGTCAGG ATGCAGATAG TAAATACCTT CATAGTGAAA AAGCTTATTT GGATCTCATT
AATGCTGTAA AAGATAAATT CAATTATAAG AAATCGGTAG TACCATATAT TATCTATTTT
TTATTAAAAA CAGGTATGAG ATACGGAGAA TTAATTGCTT TAACTTGGGA AGATATTGAT
TTTGATAAGG GAATCTTTAA AACATATAGA AGATTTAACT CTGAAACAAG TCAGTTTGTT
CCACCTAAAA ATAAAACCTC AATCAGAATT GTTCCAGTTG ATAATGAGTG TCTAGAAATC
TTGAAAAATC TTAAGATTGA ACAAAATCAA TCAAATAAAG AGCTTGGATT ACAAAATACA
AACAATATGG TATTTCAACA TTTTGGTTAT CCTAATAGTG TACCGAGTAC AAACGGTACA
AATAAGGTTT TAAGAGGCAT CGTTCAAGAG TTGAATATAG AACCCATCAT TACAACAAAA
GGTGCTAGAC ACACCTATGG AAGCTTCCTA TGGCATAGAG GATATGATCT AGGAATTATT
GCGAAAATTC TTGGCCATAA AGATATTTCT ATGCTAATTG AAGTCTATGG TCATACTCTT
GAAGAAAAAA TTCAAGAAGA GTACAATGAG ATAAAACAAC TATGGTGA
 
Protein sequence
MANARYRRRG NQNLWAYEIR EEGKTVAYNS GFKTKKLAEA EAEPILQKLR TGSIITKNIS 
LPELYQEWLD LKIMPSNRSD VTKKKYLSRK VTLEKLFGDK PISQIRPSEY QRIMNNYGQR
VSRNFLGRLN TGVKQSLQMA IADKVMIEDF TQNVELFSTV KSQDADSKYL HSEKAYLDLI
NAVKDKFNYK KSVVPYIIYF LLKTGMRYGE LIALTWEDID FDKGIFKTYR RFNSETSQFV
PPKNKTSIRI VPVDNECLEI LKNLKIEQNQ SNKELGLQNT NNMVFQHFGY PNSVPSTNGT
NKVLRGIVQE LNIEPIITTK GARHTYGSFL WHRGYDLGII AKILGHKDIS MLIEVYGHTL
EEKIQEEYNE IKQLW