Gene HMPREF0424_0386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHMPREF0424_0386 
SymbolsufS 
ID8709207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGardnerella vaginalis 409-05 
KingdomBacteria 
Replicon accessionNC_013721 
Strand
Start bp415982 
End bp417340 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content51% 
IMG OID646482502 
Productcysteine desulfurase, SufS family protein 
Protein accessionYP_003373636 
Protein GI283782882 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.559739 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACAG TGACTACAGA AAATATTATG CGAGAATTTA GTGATTTGCG CTCGCAGTTT 
CCGATTTTAA GTCGCACGGT TCACGGCACT TCGCTTCACG ATACTGTGCA CGGAAAACCC
TTGATTTACT TGGATTCTGC AGCTACTTCG CAAAAGCCGC AAGTTGTAAT CGACGCCGAA
AAAGAGTTTT ACGAAACAAT TAACGCTGGT GTGCACAGGG GAGCGCACGA GTTGGCGGCA
CTTTCCACGG TTGCGTTCGA AGAAGCGCGT GCGAAAGTTG CGCGTCTCGT TGGAGCTTGC
GCGGATGCTG GCAGCGAGGA AATTATTGTT ACTCCGGGCG CGACCGGCGC GTTTAATCTT
TTCGCAACTG CTATTGGTAA CGCTAGTGCG AAGGCGAGAA TTTGCAAAAA AGACAGCGAT
GATTCTGCTG ATTTCCGCCA AAATTTTGTG CTTCGAGAAG GCGATGTAAT AGTCGTATCT
CGCGCAGATC ATCACTCTGT TTTGCTACCA TTTCAAGAGC TGGCTCTTCG CACTGGAGCG
GAGCTTCGTT GGATTGACGT AACAGAGGAC GGCCGCGTGC GCACGGACGC GGATTATTTG
CGAACAATTA TTGACGAGCG CACGAAAATC GTTGCAGTTA CGCATATTAG CAACGTTACT
GGCGCGATTA CGGATGTTTC GTCGATTTGC AAGCGCGCTC ACGAAGTTGG TGCGATTTTT
GTGCTCGACG CATGCCAGTC AGTGCCTCAT ATTCCGGTTG ATTTTCACAC TTTAAACGTT
GATTTTGCGG CATTTAGTGC GCATAAAATG TATGGTCCTA CAGGAGTTGG ATTCTTGTAT
GGTCGCCGCG AGCTTTTGAA CGCGCTTCCA CCTGCCAATT TTGGCGGCTC AATGGTAGAG
CTTGCGTGGC TTAACAAGCC GGCGCAATAC ATGGATGCGC CTTACCGTTT TGAGGCTGGA
ACTCAGCCGG TTGCGCAAAT GGTTGCGGCT GGCGTTGCAG CGGATTGGTT GCGCGCGGTG
GGCATGAATA AAGTTGCAGA GCATGAGCGG GAAATTGCTG CAGAATTGCT AAAATTGCAA
GACGTTCCGG GAGTTCGCAT TTTGGGGCCG GCTGATTTAG AGAATCGAAT TGGCACGGTT
GCTTTTGAAG TTGAAGGTGT GCATCCGCAC GATGTTGGAC AGTTTTTGGA CGCGCAGGGC
ATTGCGATTC GCGTCGGACA CCATTGTGCT CAGCCGATTC ACCGTCATTT TGGCGTGTAT
GCTTCGAATC GCGCTTCTGT TGGCGTTTAC AACACAGTTG ACGAGGCTCG CGCGTTTGTT
GAAGCAGTCT CACACGTTCG CGCCTACTTT GGCGCATAA
 
Protein sequence
MTTVTTENIM REFSDLRSQF PILSRTVHGT SLHDTVHGKP LIYLDSAATS QKPQVVIDAE 
KEFYETINAG VHRGAHELAA LSTVAFEEAR AKVARLVGAC ADAGSEEIIV TPGATGAFNL
FATAIGNASA KARICKKDSD DSADFRQNFV LREGDVIVVS RADHHSVLLP FQELALRTGA
ELRWIDVTED GRVRTDADYL RTIIDERTKI VAVTHISNVT GAITDVSSIC KRAHEVGAIF
VLDACQSVPH IPVDFHTLNV DFAAFSAHKM YGPTGVGFLY GRRELLNALP PANFGGSMVE
LAWLNKPAQY MDAPYRFEAG TQPVAQMVAA GVAADWLRAV GMNKVAEHER EIAAELLKLQ
DVPGVRILGP ADLENRIGTV AFEVEGVHPH DVGQFLDAQG IAIRVGHHCA QPIHRHFGVY
ASNRASVGVY NTVDEARAFV EAVSHVRAYF GA