Gene Ccur_12890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_12890 
Symbol 
ID8375494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp1455393 
End bp1456661 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content53% 
IMG OID644994206 
Productcysteine desulfurase 
Protein accessionYP_003151649 
Protein GI256827690 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones159 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACCTT TCGCTCCTCA GGCCACGCAA GAAGCTATTG ACGCGGTCGA TATACAAGAA 
AACCCCTACA AGAAGGATTT TCCCCTTCTT GCAAACGCGC AAGATCTTAC GTTTTTGGAT
AGTGCAGCTA CGGCACAGCG CGCTGCCGGT GTCCTTGATG CTCAACGCGA TTTTTATGAG
CGCCTCAACA GCAATCCGCT TCGGGGGCTC TATCGTCTTT CTATCGAAGC AACTGAGGCA
ATCGATAAGG TACGCCGCAA GGTCGCTGAC TTTATTGGTG CTCCGCTCTC TCAGGAAATT
GTCTTTACCC GTAACGCAAC AGAGTCGCTC AATATTCTGG CGGCATCGCT CGGGCGCACC
ATTTTGGGGC CGGGCGATGA AGTGTGCATT ACCATCATGG AGCATCACAG CAATCTCATT
CCCTGGCAGG AAGTATGCCG TGCCACGGGC GCCAAGTTGG TGTTTATGCG TCCCGACAGT
GAGTGCGTTA TCACACCTGA AGAAATTGAG GCGAAAATCA CGCCGCGCAC GAAGATTGTT
TCAGTCGTGC AGGTAAGCAA CGTACTTGGT GTTGAGAATC CGGTGCGGGA GATTGCTCAA
CGTGCCCATG AAATGGGCGC TCTCTGTTTC GTTGATGGCG CTCAGAGCGC TCCCCACATG
GGTGTCAATG TACAGGAACT TGGCTGCGAT GCATTCGTAT TTTCGGCCCA TAAGATGTTC
GGCCCTATGG GCATTGGTGT ACTTTGGGCA AAGAAAAGCC TTCTCGAAAA AATGCCACCA
TTCCTGACGG GTGGTGAAAT GATCGATTGG GTAACCGAAG AATCGTATGG TTGCGCTGCT
GTTCCGCAGA AGTTTGAAGC AGGCACTCAA GATGCTGCTG GCATTGTTGG ACTGGGGGCA
GCGGTCGACT ATATGCGTGC GGTCAAGCGT GAAAATATTG AGGCGCGCGA GCGGGCTTTA
GTGATTGAAT GCACGCGTCG CTTATGTGAA TTACCCTACA TTCGCCTGAT CGGCCCGTCC
GATCCAGCAT CACATGTTGG GGTGGTGTCA TTCAATGTTT GTGGGGTGCA TCCCCATGAT
GTCGCAAGCA TTCTCGATTC TCAGGGCGTA GCTATTCGCG CTGGCCATCA TTGCGCGCAG
CCCCTTCTTC GCTGGATGGG TGTTGAAAGT TGCTGCCGCG CTTCAGTGGC TCTTTATAAC
GATCAGTCTG ACATCGATGC ATTGGTGTCA GGTATTGCAA AGGTGCGGGA GATATTCCAT
GGCGCTTAA
 
Protein sequence
MAPFAPQATQ EAIDAVDIQE NPYKKDFPLL ANAQDLTFLD SAATAQRAAG VLDAQRDFYE 
RLNSNPLRGL YRLSIEATEA IDKVRRKVAD FIGAPLSQEI VFTRNATESL NILAASLGRT
ILGPGDEVCI TIMEHHSNLI PWQEVCRATG AKLVFMRPDS ECVITPEEIE AKITPRTKIV
SVVQVSNVLG VENPVREIAQ RAHEMGALCF VDGAQSAPHM GVNVQELGCD AFVFSAHKMF
GPMGIGVLWA KKSLLEKMPP FLTGGEMIDW VTEESYGCAA VPQKFEAGTQ DAAGIVGLGA
AVDYMRAVKR ENIEARERAL VIECTRRLCE LPYIRLIGPS DPASHVGVVS FNVCGVHPHD
VASILDSQGV AIRAGHHCAQ PLLRWMGVES CCRASVALYN DQSDIDALVS GIAKVREIFH
GA