Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2656 |
Symbol | |
ID | 4444777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2980556 |
End bp | 2981968 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639690476 |
Product | cysteine desulfurase |
Protein accession | YP_832135 |
Protein GI | 116671202 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACTG CCACTTTCCC CGCAAACACC GCAACCACGT CCACTGCAGG CCTTCTGGAC CCGCGGTTCA CCGTCGCCCA GCGCCCCCTC TCCGCCGTTA CCGGCGCGGA GATCCAGGCT CCGCTGATCC AGGGCGGCCA CGTCCGCTAC GCGAACCTGG ACTACGGCGC ATCGGCCCCG GCGCTGTCCG TTGTCTCGGC CTACCTCAAC GAGATCCTGC CGTACTACGC CAGCGTGCAC CGCGGCGCGG GCTATGCCTC GCAGATCAGC ACGTCGGTGT ACGAAAATGC CCGGGACATC GTCCGCGGGT TCGTGGGCGG CAGGGCGGAC GATTCCGTCA TCTTCACCCG GAACACCACG GACTCACTGA ACCTGCTGGC CGGATGCCTG CCGGTGGCCG ACGGCCGCCA CATCGGCGAA GTGCTGTACC TCGACATCGA ACACCACGCC AACCTGCTCC CCTGGCAGGG CGTCCCGCAC CGGAGTGTTG TGGCCGCCCC GACGCTCAGG GCCACCCTGG AGAACCTGCG TTCCGAGCTC CGGCACGGTG ACATCAGCCT GCTGGCCGTT ACGGGCGCCT CCAACGTCAC CGGCGAAATC CTCCCCATCC GCGAGCTGGC CGCACTGGCT CACGAACACG GCGCAAGGAT CGTTGTGGAC GCGGCGCAGC TGGCCCCGCA CCGCCGCATC GACATCGCCG CGGACGACGT CGACTACCTC GCCTTCTCAG GCCACAAGCT GTACGCGCCG TTTGGCAGCG GAGTCCTCGT GGGCCGGCCG GACTGGCTCG ACGCCGGGAC GCCCCACCTC GCAGGCGGCG GAGCCGTACG TGAGGCGCGG CTGGACTCCG TTAGCTGGAC CACCGGGCCG GCACGTCATG AAGGCGGCTC ACCGAACGTC CTCGGGGCCG CCACCCTGGC CCGGGCCACC CAGGTTATCG GCGCCCTGGA CCAGGAACTG TGGCACGCCC ATGAGACGGC CATCCGGTCC TTCCTCGTTG AGGGACTGCG GAAGATCGAC GGCGTCGCTG TCCACCAGAT CTTCAGCGAC ACGGATGACA CCATCGGGGT GGTCAACTTC TCCGTCGCCG GCTACGACGC CGGCCTCGTC GCGGCCTACC TGTCCGCCGA ACATGGCGTG GGCCTCCGGG ACGGCCGCTT CTGCGCCCAC CCGCTGCTGA AGCGGCTGGG ACTGCCGTCA GGCTCATTGC GGGCAAGCTT CGGCGTCGGC TCCCGCTTGG AGGATGCCCA GCGCCTGCTC GCCGGGATCG AGGAACTTCG CCGGACCGGC CTCGGCTGGG ACTATGTCGT GGACTCGGGC CGCTGGGTGC CTGCCAACGA CACCCGCAGC TACCCGGACT GGGCACCCAA CACTCCAGGC ACCGCCGGCG CAGCGCCCTG CACCGAAGAC TGA
|
Protein sequence | MTTATFPANT ATTSTAGLLD PRFTVAQRPL SAVTGAEIQA PLIQGGHVRY ANLDYGASAP ALSVVSAYLN EILPYYASVH RGAGYASQIS TSVYENARDI VRGFVGGRAD DSVIFTRNTT DSLNLLAGCL PVADGRHIGE VLYLDIEHHA NLLPWQGVPH RSVVAAPTLR ATLENLRSEL RHGDISLLAV TGASNVTGEI LPIRELAALA HEHGARIVVD AAQLAPHRRI DIAADDVDYL AFSGHKLYAP FGSGVLVGRP DWLDAGTPHL AGGGAVREAR LDSVSWTTGP ARHEGGSPNV LGAATLARAT QVIGALDQEL WHAHETAIRS FLVEGLRKID GVAVHQIFSD TDDTIGVVNF SVAGYDAGLV AAYLSAEHGV GLRDGRFCAH PLLKRLGLPS GSLRASFGVG SRLEDAQRLL AGIEELRRTG LGWDYVVDSG RWVPANDTRS YPDWAPNTPG TAGAAPCTED
|
| |