Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2593 |
Symbol | |
ID | 5900048 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2816392 |
End bp | 2817618 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641563084 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_001684218 |
Protein GI | 167646555 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.266337 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCTTTCG ACGCGCCCTT CGACGTCGAG GCGATCCGCG CCCAGTTCCC GATCCTGGCC CGCCAGGTGC ATGGCAAGCC GCTGGTCTAT CTGGACAGCG CCGCCAGCGC CCAGAAGCCC GAGGCGGTTC TGAACGCCAT GACCGGCCTG GCCCGCACCA GCTACGCCAA TGTCCATCGC GGCCTGCATA CGCTCGCCAA TGAAACGACG CAGGCCTATG AAAACGCCCG GGAAACCGTG GCGAAGTTCA TCAACGCAGA ACCATCGGAA ATCGTCTGGA CCAAGAGCGC CACCGAGGCG GTCAACCTGG TCGCCGACAC CTTCGGCCGC TCGCTGAAGG CGGGCGACGA GATCGTCATC TCGGAAATGG AGCACCACGC CAACATCGTG CCGTGGCACT TCCTGCGCGA GCGCTACGGC GTGGTGTTGA AGTTCGTGCC GGTCACGGAC GACGGCCGCC TGGACATGGA GGCCTACAAG GCCCTGTTGT CCGAGCGCAC CAGGATGGTG GCCATCAGCC ACATGTCGAA CGTGCTGGGC ACCATCAACG ACGCCGCCGA GATCGTCCGC CTGGCCCACG CGGCCGGCGC GCCGGTGCTG CTGGACGGCT GCCAGGCCAT CGTCCACAGC AAGGTCGATG TGAAGGCCCT CGACGTCGAT TTCTACGTCT TCTCCGGCCA TAAACTCTAC GGACCGACCG GGATCGGCGT GCTGTACGGC AAGGCCGAGC GACTGGTCGC CCTGCCGCCG TACCAGGGCG GCGGCGAGAT GATCGGCTCG GTCTCGATGG ACCGCATCAC TTATACCGAC CCGCCGCACC GCTTCGAGGC CGGCACGCCG GCGATCCTCG AGGCCGTCGG CCTGGGCGCG GCGATCGACT GGCTGAACGG CATCGACCGC GACGCGGGCC TGGCCCACGA GCACGCCCTC TACCAGCGGG TCGTCGATCA GCTGGACGGC GTCAACGGCG TGCGGATCCT CGGCACGGCC CCCGGCAAGG GCGCGGTGCT GAGCTTCGTG GTCGAGGGCG CCCATGCCCA TGACGTGGCC CAGGTGCTGG ACCGTTACGG CGTGGCCGTC CGGGCCGGCA CGCACTGCGC CGAGCCCTTG ATGAAAAGGT TCGGCGTCAC GTCGAGCGCC CGCGCCTCGT TCGCCCTATA TAATACCCTG GCCGAGGCCG ACGCGTTCGT GAGCGCGCTG GCCAAGGCCC GCAGTTTCTT CAGTTGA
|
Protein sequence | MAFDAPFDVE AIRAQFPILA RQVHGKPLVY LDSAASAQKP EAVLNAMTGL ARTSYANVHR GLHTLANETT QAYENARETV AKFINAEPSE IVWTKSATEA VNLVADTFGR SLKAGDEIVI SEMEHHANIV PWHFLRERYG VVLKFVPVTD DGRLDMEAYK ALLSERTRMV AISHMSNVLG TINDAAEIVR LAHAAGAPVL LDGCQAIVHS KVDVKALDVD FYVFSGHKLY GPTGIGVLYG KAERLVALPP YQGGGEMIGS VSMDRITYTD PPHRFEAGTP AILEAVGLGA AIDWLNGIDR DAGLAHEHAL YQRVVDQLDG VNGVRILGTA PGKGAVLSFV VEGAHAHDVA QVLDRYGVAV RAGTHCAEPL MKRFGVTSSA RASFALYNTL AEADAFVSAL AKARSFFS
|
| |