Gene SAG1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1098 
SymboliscS-1 
ID1013902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1106099 
End bp1107214 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content37% 
IMG OID637316280 
Productcysteine desulphurase 
Protein accessionNP_688107 
Protein GI22537256 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTATT TAGACAATGC TGCTACTACC GCTCTAACCC CATCTGTTAT TGAGAAAATG 
ACCAATGTCA TGACAAGTAA CTATGGTAAT CCATCTAGTA TACATACCTT TGGGCGTCAA
GCAAATCAAC TTTTACGTGA ATGTCGACAA ATTATTGCTG AATATCTAAA TGTTAATTCA
CGTGAAATTA TTTTCACTTC TGGGGGAACT GAGAGCAACA ATACAGCTAT CAAAGGGTAT
GCTCTTGCAA ATCAGCTAAA AGGTAAACAT ATTATTACCT CTGAAATTGA ACATCATTCA
GTCCTACATA CTATGACTTA CTTATCAGAG CGATTTGGTT TTGATATTAC TTACTTAAAA
CCAAATCATG GACAAATAAC TGCAAAAGAC GTGCAAGAAG CTTTACGAGA TGATACTATT
ATGGTATCTC TCATGTTTGT TAATAATGAA ACCGGGGACT TTTTACCAAT TCAAGAGATT
GGTCAGCTTC TCAGGAACCA CCAAGCTGTT TTTCACGTTG ATGCCGTTCA AGTCTTTAGC
AAAATGGAAC TTGATCCTCA TTCTTTAGGA ATTGACTTTT TAGCTGCTTC TGCCCATAAA
TTTCACGGTC CAAAAGGTGT TGGGATACTT TACTGTGCTC CCCATCACTT TGATAGTCTA
CTTCATGGTG GAGACCAAGA GGAAAAAAGA CGTGCTTCAA CTGAAAATAT AATTGGTATT
GCTGGAATGT CTCAAGCTCT TACTGATGCT ACGACTAACA CCCTTAAAAA TTGGACTCAC
ATTAGTCAGC TGAGAACGAC CTTTTTAGAT GCTATTTCAG ACCTTGACTT CTATCTTAAT
AACGGTCAAG ACTGCTTACC TCATGTACTT AATATAGGTT TTCCTGGACA GAATAATGGC
TTGTTATTGA CACAATTAGA TTTAGCTGGA TTCGCAGTTT CAACAGGTTC TGCATGTACT
GCAGGAACAG TCGAACCTAG CCATGTCTTA ACAAGCTTGT ACGGAGCCAA CTCACCACGT
CTAAATGAAT CAATACGTAT TAGTTTTTCA GAACTAAATA CCCAAGAAGA AATTCTTGAA
TTAGCTAAAA CCTTAAGAAA AATTATAGGA GATTAA
 
Protein sequence
MIYLDNAATT ALTPSVIEKM TNVMTSNYGN PSSIHTFGRQ ANQLLRECRQ IIAEYLNVNS 
REIIFTSGGT ESNNTAIKGY ALANQLKGKH IITSEIEHHS VLHTMTYLSE RFGFDITYLK
PNHGQITAKD VQEALRDDTI MVSLMFVNNE TGDFLPIQEI GQLLRNHQAV FHVDAVQVFS
KMELDPHSLG IDFLAASAHK FHGPKGVGIL YCAPHHFDSL LHGGDQEEKR RASTENIIGI
AGMSQALTDA TTNTLKNWTH ISQLRTTFLD AISDLDFYLN NGQDCLPHVL NIGFPGQNNG
LLLTQLDLAG FAVSTGSACT AGTVEPSHVL TSLYGANSPR LNESIRISFS ELNTQEEILE
LAKTLRKIIG D