Gene Sfum_1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSfum_1204 
Symbol 
ID4460476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSyntrophobacter fumaroxidans MPOB 
KingdomBacteria 
Replicon accessionNC_008554 
Strand
Start bp1490411 
End bp1491556 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content63% 
IMG OID639701971 
Productcysteine desulfurase family protein 
Protein accessionYP_845332 
Protein GI116748645 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01977] cysteine desulfurase family protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.515711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCTCT ATCTCGATAA CGCCGCAACG TCGTTTCCCA AGCCGCCGTC GGTGTATGAA 
GCCGTACGGC ACGCCCTGAC GGAAGTCGGA GCGAGTCCCG GCAGGGCCTC TCACCGGCAT
GCCAGGCTGG CCTCTTCGAT GGTGGGGGCC GCACGCGAGA AGGTGGCCTC TTTCCTGGGC
ATCGGAGATG CCGACCGGGT GATTTTCACG AAGAACGCCA CCGAGAGCAT CAATATCGTC
TTGAAGGGGT GGTTGAAAAG GGGGGATCGC GTGCTGATCT CGGCCATGGA ACACAATTCC
GTGGTCCGTC CGCTGAAACG ACTGAGTGAA ATCGGCGTGA GCACCGAAAT CGTTCCCTGC
AGCGGCAGCG GAGCCATCGA CGTGGATGAG CTGCGGCGGA GGCTGGAGTC GCGTCCCCGG
CTGATGGCGA TGACCCACGC TTCCAACGTG AACGGCGCGC TCCTTCCGGC GGAAGCAGTG
GCGCAAATGT GCAGCGAATT CGGCGTCCCG CTTTTGCTCG ATGCGGCCCA AACGGCGGGC
GTTCAGGCCA TAAGGGCCGA TAAATGGCGC CTGGCGATGC TGGCGTGTTC CGCCCATAAG
GGGCTGCTTG GTCCTCCCGG GGTCGGCGTG CTTTTCATCC GTTCGGGGCT GGACGTGGAG
CCCTTGTTGG AGGGCGGAAC GGGGAGCCGG TCGGAGGACG CGATACAGCC CGAAATCTGC
CCGGACCGCT ACGAGAGCGG CACTCCAAAC CTGCCCGGGA TCGCGGGACT TGCCGCGGGC
ATCGATTACA TCCTGAGCAG CGGTCTTGAA ACCATTCGCG ATCACGAACT GGGGCTGGCG
GTTCGCCTTG AAGAGCAATT GCGGGCTATT CCCGGAATTA CTGTCATCAG TCCCGAAGTG
CGGGGAACGG CGACGGTCTC GTTCACGATG GCGGGGATCA ATCCGGCCGA TGCGGGACAC
CTGCTCGACG AAGGATACGA TATTGCGGTG CGGACGGGAT TGCACTGCGC TCCCCTCGCT
CACCGGACAT TCGGGACGTT TCCGGAGGGC ACCGTTCGCG TTTCGCCGGG GTATGCGACG
ACCGCGGCGG ATATGGAGCG GCTTGCCGAG GCGATACGGG ACCTGGCGTT GCTTCGCCGC
CGATGA
 
Protein sequence
MTLYLDNAAT SFPKPPSVYE AVRHALTEVG ASPGRASHRH ARLASSMVGA AREKVASFLG 
IGDADRVIFT KNATESINIV LKGWLKRGDR VLISAMEHNS VVRPLKRLSE IGVSTEIVPC
SGSGAIDVDE LRRRLESRPR LMAMTHASNV NGALLPAEAV AQMCSEFGVP LLLDAAQTAG
VQAIRADKWR LAMLACSAHK GLLGPPGVGV LFIRSGLDVE PLLEGGTGSR SEDAIQPEIC
PDRYESGTPN LPGIAGLAAG IDYILSSGLE TIRDHELGLA VRLEEQLRAI PGITVISPEV
RGTATVSFTM AGINPADAGH LLDEGYDIAV RTGLHCAPLA HRTFGTFPEG TVRVSPGYAT
TAADMERLAE AIRDLALLRR R