Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_2068 |
Symbol | |
ID | 8753739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | + |
Start bp | 2151489 |
End bp | 2152820 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_003409127 |
Protein GI | 284990573 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.512945 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAGA CCGTCTCGCG TCCCGCAGCC GGGGCGCAGA AGGCGCCCCT GCCCCTGGAC GTCGAGGCGA TCCGGGCGGA CTTCCCGATC CTGTCCCGCA CCGTGCGGGA CGGGAAGCGG CTGGTCTACC TGGACTCCGG GGCCACCTCG CAGAAGCCGC TGGGCGTGCT CGACGCGGAG CGCGACTTCT ACCTGACCTG CAACGCAGCG CCGCACCGGG GCGCGCACCA GCTCGCCGAA GAGGCGACCG CCCGCTACGA GGCGGCCCGC GCCACCATCG GCGCGTTCAT CGGTGCGCCG GCCGAGGAGG TCGTGTTCAC CCGCAACAGC ACCGAGGCGA TCAACCTGGT CGCCTACGCG CTGTCGAACG CGGCCACGGC CAAGGAGCCG GAGTTCCGGC GGTACGCCGT CGGCCAGGGC GACGAGATCG TCGTCACCGA GATGGAGCAC CACGCCAACC TCGTGCCCTG GCAGCAGCTC TGCGAGCGCA CCGGCGCCAC CCTGCGCTGG CTCGGGCTGA CCGACGACGG CCGGCTGGAG CTGTCCGACC TCGGCACGGT GGTCAACGAG CGCACCAAGC TGGTCGCGGT CACCCAGCAG TCGAACATCC TCGGCACGAT CAACCCGCTC GGGGAGATCT CCGCGCGGGC CCGCGAGGTC GGCGCCCTGC TGCTGGTCGA CGGCGCGCAG TCGGTGCCGC ACCAGCCGGT CGACGTCACC ACGCTGGGTG CCGACTTCCT GGTGTTCTCC GGCCACAAGA TGCTCGGGCC CACGGGCGTC GGCGTCCTGT GGGGCCGCTA CGAGGTGCTC GACGCGCTGC CGCCCTTCCT CACCGGCGGC TCGATGATCG AGGTCGTGCG CATGGAGGGC AGCACCTTCA TGCCGCCGCC GCAGCGCTTC GAGGCCGGCG TCCCGATGAC CGCCCAGGTG ATCGGGCTCG CCGCCGCCGT CGAGTACCTG CAGCGGCTCG GCATGGACCG CGTGCAGGCG CACGAGGAGG CGCTCACCGG CTACGCGCTG GCGAAGCTGG CCGAGATCCC CGGCGTCACG GTGATCGGCC CGCCCGACAC CGTCGCCCGC GGCGGCGCGG TCTCCTTCAC CGTCGAGGGC ATCCACCCGC ACGACGTCGG CCAGGTCCTC GACGACCTCG GCGTCGAGGT GCGGGTCGGC CACCACTGCG CCTGGCCGGT GGTGCGCCGC TACGGCGTCC CGGCCACGAC GCGGGCCACC TTCTACGTGC ACACCGGCTA CGACGACATC GACGCGCTCG CCGACGCCAT CCGGGAAGCC CAGCGCTTCT TCGGTGTCGC ACCAACCGCG GGGGTCGCCT GA
|
Protein sequence | MTQTVSRPAA GAQKAPLPLD VEAIRADFPI LSRTVRDGKR LVYLDSGATS QKPLGVLDAE RDFYLTCNAA PHRGAHQLAE EATARYEAAR ATIGAFIGAP AEEVVFTRNS TEAINLVAYA LSNAATAKEP EFRRYAVGQG DEIVVTEMEH HANLVPWQQL CERTGATLRW LGLTDDGRLE LSDLGTVVNE RTKLVAVTQQ SNILGTINPL GEISARAREV GALLLVDGAQ SVPHQPVDVT TLGADFLVFS GHKMLGPTGV GVLWGRYEVL DALPPFLTGG SMIEVVRMEG STFMPPPQRF EAGVPMTAQV IGLAAAVEYL QRLGMDRVQA HEEALTGYAL AKLAEIPGVT VIGPPDTVAR GGAVSFTVEG IHPHDVGQVL DDLGVEVRVG HHCAWPVVRR YGVPATTRAT FYVHTGYDDI DALADAIREA QRFFGVAPTA GVA
|
| |