Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0199 |
Symbol | |
ID | 3916187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 205411 |
End bp | 206622 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640442925 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_495482 |
Protein GI | 87198225 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGTGG ACACGATCAC GCGCGGCGCG CAATGGCCGG GCCTGCTGAA TCCCGACGGC AGCCGCTGGC ACTATCTCGA CACCGCAGCC ACCGCGCAGA AGCCGCAGGT GGTGATCGAC GCGGTAACTC GCGCCCTCGG CGCCGACTAC GCCACCGTCC ATCGCGGCGT CTATGCCCGC TCGGCCGACA TGACGCTCGC CTTCGAGGCC GCGCGCCGCA AGGTGGCGGG GCTGGTCAAC GGCGATGAAG GCGAGATCGT CTTCACGCGC GGCGCAACCG AGGCGATCAA CCTCGTCGCG CAGACCTGGG GCCAGGCAAA CCTCAAGGCG GGCGACCGCA TCCTGCTCTC CACGCTCGAG CATCATTCGA ACATCGTACC TTGGCAGTTG CTGCGCGACC GGACCGGAGT CGAGATCGAC GTCTGCCCGT TGACCGAGGA CGGCCGCATC GACCTTGCCG CGGCGGAGCG CATCCTGACC CCCGCGCACA AGCTTGTCGC TCTTGCCCAT GTGTCGAACG TGCTCGGTTC TGTGCTCGAC GTGGCGCAGG CGGTCCGGCT GGCGCGTGCG GTCGGGGCGA AGATCCTGCT CGACGGCTGC CAGGCGGTGC CGCGCCTCGC TGTCGACGTG AAGGCGATGG ACGCTGATTT CTACGTCTTT TCCGCCCACA AGCTCTATGG CCCGACCGGC ATAGGCGCGC TTTGGGCCAA GGCCGCGATT CTCGACGCCA TGCCGCCGTG GCAGGGCGGC GGGGCGATGA TCGACCGCGT CACTTTCGAG CGCACGACCT ATGCCCCCGC GCCGCAGCGT TTCGAAGCCG GCACCCCGGC CATCGTCGAG GCCATCGGCT TCGGCGCGGC GGTGGACTTC GTGCAGGCAC AGGGCCTCGA TGCGATCCAC GCCCATGAAG TCGCGCTCGT GGCCAAGGCC CGCGAGGCGC TCGGGCGGAT GAACTCCGTC CGCCTGTTCG GGCCCGAGGA CAGCGCCGGC ATCGTCAGCT TCGCCATCGA GGGCGTGCAC CCGCACGATC TCGGCACGAT CCTCGACGAG GAAGGCGTGG CGATCCGTGC CGGGCACCAC TGCGCGCAGC CATTGATGGA CCACCTTGGC GTTCCCGCCA CGGCCCGGGC CAGCTTCGGC ATCTACAGCG ATGAAAGCGA TATCGCCGCC CTCGTGCGCG GCATCGAAAG GACCAAGAGG ATATTCGGAT GA
|
Protein sequence | MSVDTITRGA QWPGLLNPDG SRWHYLDTAA TAQKPQVVID AVTRALGADY ATVHRGVYAR SADMTLAFEA ARRKVAGLVN GDEGEIVFTR GATEAINLVA QTWGQANLKA GDRILLSTLE HHSNIVPWQL LRDRTGVEID VCPLTEDGRI DLAAAERILT PAHKLVALAH VSNVLGSVLD VAQAVRLARA VGAKILLDGC QAVPRLAVDV KAMDADFYVF SAHKLYGPTG IGALWAKAAI LDAMPPWQGG GAMIDRVTFE RTTYAPAPQR FEAGTPAIVE AIGFGAAVDF VQAQGLDAIH AHEVALVAKA REALGRMNSV RLFGPEDSAG IVSFAIEGVH PHDLGTILDE EGVAIRAGHH CAQPLMDHLG VPATARASFG IYSDESDIAA LVRGIERTKR IFG
|
| |