Gene Cphy_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1684 
Symbol 
ID5741515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2066377 
End bp2067711 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content40% 
IMG OID641292784 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001558795 
Protein GI160879827 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTG AACTTATGCA AACGTATTGT TCATGGAGTG AATATGAAAA TGCAATTATT 
GATGTCTTAA GTGTTTATAA GTCTGAAGGT CATGAATCAC CAATACATGC ATCTACATTT
GATGTGGACG CGCTGCGAAA CGATTTTCCG ATCCTCTCAA CAAAGGTTCA TGGAAAACCC
TTGATTTGGC TTGATAATGC CGCAACAACC CAAAAACCTA AGCCTGTTAT AGATCGCCTT
TCCTATTTCT ATGAACACGA AAATTCAAAT ATCCACAGAG GCGCACACAC CCTTGCTGCT
ACGGCTACAG ATGCTTATGA AGCGGCGAGA AACAAAATTA AACATTTTAT AAACGCTTCT
TCTTACGAAG AAATTATTTT TGTACGAGGT GCTACGGAAG GTATTAATCT GGTAGCAGCA
TCTTATGGGA GACATAACCT AAATAAGGAT GACGAGATTC TCGTTTCCTG TCTCGAACAT
CATGCTAATA TTGTACCCTG GCAAATGCTT TGCGCGGAAA CAGGCGCTGT ACTACGTATT
ATACCTGTCG ATGATACCGG ACAGATAGAT ATGCAGTCAT ATAAGAAGCT GTTGTCACCC
AAAGTAAAAA TTGTGGCCGC TTCCTATGTA TCCAATTCTC TTGGCACAAT AACGCCCATT
AAGGACATTG TTGCTATGGC ACATCAATAT GGAGCAAAAG TTTTGGTCGA CGCTGCTCAG
GCAGTACCTC ATTTTAAAGT GGATGTTAAT GATTTAGACT GTGATTTTCT TGTCTTTTCA
GGTCATAAGC TATTTGGACC CACAGGCATC GGCATATTAT ATGGTAAAAA GGATACGTTA
AATGAAATGT CGCCCTATCA AGGCGGCGGA AATATGATTG ACAATGTAAC ATTCGAAAAA
ACTACTTATC AACTGACACC CCAAAGATTT GAAGCGGGTA CAGGCGATAT AGCAGGCGCT
GTGGGACTAG GTGCTGCCGT GGATTATCTG CAGCATATAG GAATGGATCA TATTGCTAAA
TATGAGCATA GTCTTTTGTC GTATACTTCT GATGCCTTAC GTAATATTCC CGGACTGTCT
GTCATAGGAA CAGCCTCAGA AAAAGCTGGA GTCATCTCTT TTACACTAAA CGGTATTGCC
ACAGAATCCA TCGGACAGAT GTTGGATAAA GAAGGGATTG CAGTCCGTAC AGGCCATCAT
TGTTCTTTAC CAATTTTAAG AAGATTTGGC GTAGAAAGCA CAGCCAGAAT TTCATTAGCT
TTTTATAATA CCTACAGAGA GATCGATGTA CTGATAGATA CTCTATGGAA ACTAATAACA
AAGACGTATA CTTAA
 
Protein sequence
MSIELMQTYC SWSEYENAII DVLSVYKSEG HESPIHASTF DVDALRNDFP ILSTKVHGKP 
LIWLDNAATT QKPKPVIDRL SYFYEHENSN IHRGAHTLAA TATDAYEAAR NKIKHFINAS
SYEEIIFVRG ATEGINLVAA SYGRHNLNKD DEILVSCLEH HANIVPWQML CAETGAVLRI
IPVDDTGQID MQSYKKLLSP KVKIVAASYV SNSLGTITPI KDIVAMAHQY GAKVLVDAAQ
AVPHFKVDVN DLDCDFLVFS GHKLFGPTGI GILYGKKDTL NEMSPYQGGG NMIDNVTFEK
TTYQLTPQRF EAGTGDIAGA VGLGAAVDYL QHIGMDHIAK YEHSLLSYTS DALRNIPGLS
VIGTASEKAG VISFTLNGIA TESIGQMLDK EGIAVRTGHH CSLPILRRFG VESTARISLA
FYNTYREIDV LIDTLWKLIT KTYT