Gene Caul_2593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2593 
Symbol 
ID5900048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2816392 
End bp2817618 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content67% 
IMG OID641563084 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001684218 
Protein GI167646555 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.266337 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTTTCG ACGCGCCCTT CGACGTCGAG GCGATCCGCG CCCAGTTCCC GATCCTGGCC 
CGCCAGGTGC ATGGCAAGCC GCTGGTCTAT CTGGACAGCG CCGCCAGCGC CCAGAAGCCC
GAGGCGGTTC TGAACGCCAT GACCGGCCTG GCCCGCACCA GCTACGCCAA TGTCCATCGC
GGCCTGCATA CGCTCGCCAA TGAAACGACG CAGGCCTATG AAAACGCCCG GGAAACCGTG
GCGAAGTTCA TCAACGCAGA ACCATCGGAA ATCGTCTGGA CCAAGAGCGC CACCGAGGCG
GTCAACCTGG TCGCCGACAC CTTCGGCCGC TCGCTGAAGG CGGGCGACGA GATCGTCATC
TCGGAAATGG AGCACCACGC CAACATCGTG CCGTGGCACT TCCTGCGCGA GCGCTACGGC
GTGGTGTTGA AGTTCGTGCC GGTCACGGAC GACGGCCGCC TGGACATGGA GGCCTACAAG
GCCCTGTTGT CCGAGCGCAC CAGGATGGTG GCCATCAGCC ACATGTCGAA CGTGCTGGGC
ACCATCAACG ACGCCGCCGA GATCGTCCGC CTGGCCCACG CGGCCGGCGC GCCGGTGCTG
CTGGACGGCT GCCAGGCCAT CGTCCACAGC AAGGTCGATG TGAAGGCCCT CGACGTCGAT
TTCTACGTCT TCTCCGGCCA TAAACTCTAC GGACCGACCG GGATCGGCGT GCTGTACGGC
AAGGCCGAGC GACTGGTCGC CCTGCCGCCG TACCAGGGCG GCGGCGAGAT GATCGGCTCG
GTCTCGATGG ACCGCATCAC TTATACCGAC CCGCCGCACC GCTTCGAGGC CGGCACGCCG
GCGATCCTCG AGGCCGTCGG CCTGGGCGCG GCGATCGACT GGCTGAACGG CATCGACCGC
GACGCGGGCC TGGCCCACGA GCACGCCCTC TACCAGCGGG TCGTCGATCA GCTGGACGGC
GTCAACGGCG TGCGGATCCT CGGCACGGCC CCCGGCAAGG GCGCGGTGCT GAGCTTCGTG
GTCGAGGGCG CCCATGCCCA TGACGTGGCC CAGGTGCTGG ACCGTTACGG CGTGGCCGTC
CGGGCCGGCA CGCACTGCGC CGAGCCCTTG ATGAAAAGGT TCGGCGTCAC GTCGAGCGCC
CGCGCCTCGT TCGCCCTATA TAATACCCTG GCCGAGGCCG ACGCGTTCGT GAGCGCGCTG
GCCAAGGCCC GCAGTTTCTT CAGTTGA
 
Protein sequence
MAFDAPFDVE AIRAQFPILA RQVHGKPLVY LDSAASAQKP EAVLNAMTGL ARTSYANVHR 
GLHTLANETT QAYENARETV AKFINAEPSE IVWTKSATEA VNLVADTFGR SLKAGDEIVI
SEMEHHANIV PWHFLRERYG VVLKFVPVTD DGRLDMEAYK ALLSERTRMV AISHMSNVLG
TINDAAEIVR LAHAAGAPVL LDGCQAIVHS KVDVKALDVD FYVFSGHKLY GPTGIGVLYG
KAERLVALPP YQGGGEMIGS VSMDRITYTD PPHRFEAGTP AILEAVGLGA AIDWLNGIDR
DAGLAHEHAL YQRVVDQLDG VNGVRILGTA PGKGAVLSFV VEGAHAHDVA QVLDRYGVAV
RAGTHCAEPL MKRFGVTSSA RASFALYNTL AEADAFVSAL AKARSFFS