Gene Cagg_0958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0958 
Symbol 
ID7268031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1186102 
End bp1187340 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content58% 
IMG OID643565806 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_002462312 
Protein GI219847879 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACTT TGTCTCAATT TGATGTGATC GCTTTGCGGC GCGAGTTTCC GATTTTGCAT 
CAGCAGGTGA ATGGCAAGCC GCTGGCGTTT CTCGATAGTG CGGCTTCATC TCAAAAGCCA
CAGCGCGTTA TCGAGACGCT GGAAGATTAC TACCGGCGCT ATAATGCCAA CGTGCATCGT
GGGATTTACC GGCTGAGCGA GGAGGCGACG TTTGCCTACG AGCGAGCACG TGGTAAGCTG
GCCCGTCTGA TCAATGCACC AAGTCAGCGC GAAGTGATCT TTGTGCGGAA TACCACCGAG
GCGATCAATC TGGTTGCGTA TTCGTGGGGG AGTGCCAATG TTCGGGCCGG TGACCGCATC
TTGCTCACGA TGATGGAACA CCATTCCAAT ATCGTGCCGT GGCAGTTGTT GGCGCAACGC
ACCGGTGCTG AGTTGGTCTA TTTGCCGTTT GACGGTCAGG GGCGGTTGGT ACTCGATGAC
CTCGACCGTT TGCTCGATGA ACGGGTGAAG TTGGTTGCGT TTACGCACCA ATCGAACGTC
TTCGGGACGA TTAACCCGGT TGAACCGATT GTGGCGCGGG CCCGCACGGT TGGCGCACGG
GTACTGCTCG ATGCTGCGCA GAGTGTACCA CATATGCCGG TAGACGTGCA GGCGTTAGGG
GTCGATTTTC TTGCCTTTAG TGGACATAAG ATGTGTGGTC CGACCGGAAG CGGCGTGCTA
TGGGGGCGGC GTGAGCTGCT CAACGCGATG CCGCCGTTTC TCGGTGGTGG CTCGATGATC
GACCTGGTTG AACTGGATCA CAGCACGTTT GCCGCCGCGC CGACTCGGTT TGAGGCGGGC
ACGCCGGCGA TTGGTGAGGC AATTGCGCTC GGTGAAGCAG CCGATTACCT GCAAGAGGTC
GGTTTGACGG CGATCCACCA CTACGAGCAA GAATTGACGG CATATGCCCT CGAACGTTTG
GCCGAGGTAC CGGGCCTGAC CGTCTATGGG CCACCGGCAG GGGCGGATCG GGGTGGTGCA
GTGAGTTTCT CGCTCGAAGG AGTGCATCCG CACGACGTAG CCGCTATCCT CGATCAAGAA
GGGGTGGCGG TGCGGGCCGG CCATCATTGT ACGCAGCCAC TCCATCGGGT GCTTGGCGTA
CCGGCAACGA CCCGGGCTAG CTTCTATCTC TACAATTTGC CCGAAGAGAT CGATCGGTTG
GTGGCGGCAT TGCATAAGGC ACGTCACATC TTTGCCTAG
 
Protein sequence
MATLSQFDVI ALRREFPILH QQVNGKPLAF LDSAASSQKP QRVIETLEDY YRRYNANVHR 
GIYRLSEEAT FAYERARGKL ARLINAPSQR EVIFVRNTTE AINLVAYSWG SANVRAGDRI
LLTMMEHHSN IVPWQLLAQR TGAELVYLPF DGQGRLVLDD LDRLLDERVK LVAFTHQSNV
FGTINPVEPI VARARTVGAR VLLDAAQSVP HMPVDVQALG VDFLAFSGHK MCGPTGSGVL
WGRRELLNAM PPFLGGGSMI DLVELDHSTF AAAPTRFEAG TPAIGEAIAL GEAADYLQEV
GLTAIHHYEQ ELTAYALERL AEVPGLTVYG PPAGADRGGA VSFSLEGVHP HDVAAILDQE
GVAVRAGHHC TQPLHRVLGV PATTRASFYL YNLPEEIDRL VAALHKARHI FA