Gene Cphy_2844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2844 
Symbol 
ID5742160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3476824 
End bp3478206 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content42% 
IMG OID641293936 
Productcysteine desulfurase 
Protein accessionYP_001559943 
Protein GI160880975 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAACT TCCAATGTAA TACCGCTATA AGTAATTCAA GCGATAATAT ACGTAATATG 
ATGTTTGGGC TGGACGCGCT GGTTGAGCTT GATAACAATA AAATGGTACC TGCTATTAAT
TTAGATAATG CCGCTACTAC TCCACCTTTT AAGGAGGTCA TTCAAGAAAT AGAGCGACAG
CTTATGTACT ATGGCTCCAT CGGTCGCGGT AAAGGACAAA AGTCTGAAAA TTCGACCGAG
GTTTATACAA ACGGACGAGA TATCGTTAAG GATTTTGTTG GAGCAAACAG CGATATTTAT
ACGGTTTTCT ACATCAATAA CGCGACAGAT GGAATAAATA AACTTGCGTC AGCTTTTATC
GAAAGCCCTG AGGACATCGT TCTCTCAACT CGCATGGAGC ATCACGCAAA TGATTTGCCT
TGGCGCGAGC GTACGAAAAC GGTATATGCT GAAGTAGATA AAAAAGGGCG GTTGATTGTC
GATGATATAA AGAGGCTTCT TAAGGCGTAT AACGGCCGAA TTAAGTACGT TACAGTCACA
GCGGCTTCCA ATGTCACAGG TTATGTGAAT GATGTGCACT ACATCGCTAA ACTCGCTCAT
CAATATGGTG CAAAGATCAT TGTAGATGGC GCACAAATTG TCGCTCATCG AGCGTTTAAC
ATGTTAGGGC AAACACTGGA AGAGAATATT GATTTTTTTG TTTTCTCAGC GCACAAAATG
TACTCGCCTT TCGGCGGCGG TGCAGTGGTA GGGCTTACAG ATGTGTTAAA TAAGCATATA
GCTAAATTTT ATGGTGGTGG TATGGTAGAG GCGGTATGTG ATTATTCAGT ACGCTATTTA
CCAGCACCCG ATCGATATGA AGCGGGTTCA CCGAACTACC CAGGTGTAGT TGGAATGCTG
AGAGCTATGG AAGTTCTTAA GTGTATTGGA TTTGATTATA TTAAAAACCA TGAGCAGATA
CTTCTAAGAA GGGCACTGGA TGGACTTATG AAACTTCCGG GGGTGATACT CTACGGTGAT
AATGAAAATA TTGCTGATAG AGTGGGCATT GCTGTATTTA CCCTTCGTGG CATAAAGAAT
GAAGAGGTAG CAAATTTTCT CGCAGGTTAT CGTGCCATCG CTGTTCGCCA TGCTGCCTTT
TGCGCCCACC CTTATGTTCG CCGTCTGACA GGGGGTTCAG ATACGTCGGG CTCATTTTGC
TACCCCCTCG AAGGAATGGT GCGCATTAGC TTTGGAATAT ATAACAATGA AACTGATGTC
GATACCTTTT TAGCAACGAT TAAAGAATTA CTATATAGTG AATACTTAAG ACACTTCGCA
AGAGTTAAAA ATAATTCTGT TCAGTTATCA GATAGATTGT GCATACCATA TGACCGTGCT
TAA
 
Protein sequence
MDNFQCNTAI SNSSDNIRNM MFGLDALVEL DNNKMVPAIN LDNAATTPPF KEVIQEIERQ 
LMYYGSIGRG KGQKSENSTE VYTNGRDIVK DFVGANSDIY TVFYINNATD GINKLASAFI
ESPEDIVLST RMEHHANDLP WRERTKTVYA EVDKKGRLIV DDIKRLLKAY NGRIKYVTVT
AASNVTGYVN DVHYIAKLAH QYGAKIIVDG AQIVAHRAFN MLGQTLEENI DFFVFSAHKM
YSPFGGGAVV GLTDVLNKHI AKFYGGGMVE AVCDYSVRYL PAPDRYEAGS PNYPGVVGML
RAMEVLKCIG FDYIKNHEQI LLRRALDGLM KLPGVILYGD NENIADRVGI AVFTLRGIKN
EEVANFLAGY RAIAVRHAAF CAHPYVRRLT GGSDTSGSFC YPLEGMVRIS FGIYNNETDV
DTFLATIKEL LYSEYLRHFA RVKNNSVQLS DRLCIPYDRA