Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3227 |
Symbol | |
ID | 4023734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 3581450 |
End bp | 3583438 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637963429 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_570353 |
Protein GI | 91977694 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.545837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCGC CTGAGTCCCC GGCGCCGCAT CCCGGCTCGG CGGTGGCGGG CTCGCAGTTC GGCCTGCCGG ACGCCGGGAC GCTCTCGCCT CCCGCGACTG CCGCTCCGCT CGCGCCGGAT GTCGCCGTGA TCGCGCAACT CGCCAATGCG TTCTTTGCGG CTTTGCCGAA CGGCCCGGCG CCGGAGCCGG GCGCGGCGCT CGGCTCGGCG CCGCTGTTCG TGGCCGAACC GTTGCAAAAC GCCATTCCGG GCACGCCGGC GCTGACGCCG TCCTACAAGC CGAACCACAA TCCCGGCGCG GCCTCGCCGA TTCCCACGGT CGGCGCGGCG CGCCCCTCTG CGCCGATCTT CGCGCTCGAC CCCGGGTTGA CGCCGGAGCC CGCTGCGGCC CGCGCGCCTG CTCAGCCGCA GGGTCTGACC GTGCCGCTGG TGACGACTCC GGTCGCGCCG CCGGCGCAGC CATCGCCGCC GTCGGCGCCG TCATTTACGC GTGACACCGA CCTTGCGGCC TTGCCCGGCC GGCTCGATGA CACGCGCAGC CTGGCGCCGC GCTACGACGC GCCAAACGCC GGACCGTTCG GCGGCAGCTT CGCCGGCGCG TCGGAGCCGC TGTATTTTCT TGCCGGCAAT CCGGCGCTGG CGCGCGCGAC GCCGGCCGTG GCGCCGCCAT CGCCGCCGCG CGTGGAGACC CTCGATCTCG GCGCGTTGGC TGCGCAGCAT CGCACCGACG TCGCCGTCGG CGATCTGCCC GGCCTGCGCG CGTTCGACGC TAATCTGTTT CGTCGCGATT TCCCGATCCT GCGGGAAACC GTCAACGGCC GGCCGCTGAT CTGGCTCGAC AACGGCGCGA CGACGCAGAA GCCGCAATGC GTGATCGATC GCCTTGCGTA TTTCTACGCC CACGAAAATT CCAACATCCA TCGCGCTGCG CATACGCTCG CGGCGCGTTC CACCGACGCC TATGAGGCGG CGCGCGACAA GGTCCGCGCC TTCATCAACG CGCCGCAGGT CGCCGACATC GTGTTCGTGC GCGGCGCCAC CGAGGCGATC AATCTGGTTG CTCAGGCCTG GGGCCGGCGC AATGTCAGTG AGGGCGACGA GATCGTCGTC AGCCATCTCG AGCACCACGC CAATATCGTG CCGTGGCAGC AGCTCGCGGC CGAGAAGGGC GCGCGGCTGC GCGTCGCCCC GGTCGACGAT CACGGCCAGA TCATCCTTGA AGAGTACGAG AAGCTTCTCA ATCCGCGCAC CCGCATCGTC GCCTTCACGC AAGTATCGAA TGCGCTCGGC ACGGTGACGC CGGTCGCCGA GATGACGGCG CTGGCGCATC GCCACGGCGC CAAGGTGCTG GTCGACGGCG CCCAGGGCGT CTGTCATATG CCGGTCGACG TGCAGGCGCT GGACGTCGAT TTCTACGCCT TCTCCGGCCA CAAGATGTTC GCGCCGACCG GCATCGGCGT GCTGTACGGC AAGGCCGATG TGCTGGAAGC GATGCCGCCG TGGCAGGGCG GCGGCAATAT GATCGCCGAC GTCACCTTCG AGAAGACGGT GTTTCAAGGG GCCCCGGACC GGTTCGAGGC CGGCACCGGC AACATCGCCG ACGCCGTCGG CCTCGGCGCC GCGATCGACT ATCTCAGCCG CATCGGCATG GCGAACATCG CCGCGCATGA GCACGAGCTG CTGGCCTACG GCACCCAGGC GCTGCTCGCC GTGCCGGGCC TGAAGCTGAT CGGCACCGCG CGCGAGAAAG CCGGCATCCT GTCATTCGTG CTCGATGGCT GCCGCAGTGA AGATGTCGGC CGTGCGCTCG ATCGCGAAGG CATCGCGGTG CGGGCCGGGC ATCATTGCGC CCAGCCGATC CTGCGCCGGT TCGGCCTCGA GAGCACGGTG CGGCCTTCGC TCGCGCTCTA CAACACCACC GCGGACATCG ATGCTCTGGT CGATGCGCTG AAGCGTCTGC AGAGCGGTCG GGGCGTTCAC TGGAGCTGA
|
Protein sequence | MSAPESPAPH PGSAVAGSQF GLPDAGTLSP PATAAPLAPD VAVIAQLANA FFAALPNGPA PEPGAALGSA PLFVAEPLQN AIPGTPALTP SYKPNHNPGA ASPIPTVGAA RPSAPIFALD PGLTPEPAAA RAPAQPQGLT VPLVTTPVAP PAQPSPPSAP SFTRDTDLAA LPGRLDDTRS LAPRYDAPNA GPFGGSFAGA SEPLYFLAGN PALARATPAV APPSPPRVET LDLGALAAQH RTDVAVGDLP GLRAFDANLF RRDFPILRET VNGRPLIWLD NGATTQKPQC VIDRLAYFYA HENSNIHRAA HTLAARSTDA YEAARDKVRA FINAPQVADI VFVRGATEAI NLVAQAWGRR NVSEGDEIVV SHLEHHANIV PWQQLAAEKG ARLRVAPVDD HGQIILEEYE KLLNPRTRIV AFTQVSNALG TVTPVAEMTA LAHRHGAKVL VDGAQGVCHM PVDVQALDVD FYAFSGHKMF APTGIGVLYG KADVLEAMPP WQGGGNMIAD VTFEKTVFQG APDRFEAGTG NIADAVGLGA AIDYLSRIGM ANIAAHEHEL LAYGTQALLA VPGLKLIGTA REKAGILSFV LDGCRSEDVG RALDREGIAV RAGHHCAQPI LRRFGLESTV RPSLALYNTT ADIDALVDAL KRLQSGRGVH WS
|
| |