Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2990 |
Symbol | |
ID | 3910789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3403472 |
End bp | 3404758 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637884896 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_486603 |
Protein GI | 86750107 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.398941 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.104458 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCACT GCTGCGATCG CTCTCGAGTT GGAAGGCCTG ACATGGCGCA TCCCGCGGTT TCGAATGGGT CCTACGACGT CGCCAAAGTC CGCGAGGATT TTCCGGCGCT GGCGCTGAAG GTTTACGGCA AGGATCTGGT GTATCTCGAC AACGCCGCCT CGGCGCAGAA GCCGCGCGCC GTGCTGGAGC GGATGACCAA GGCGTATGAG AGCGAATACG CCAATGTGCA TCGCGGGCTG CATTATCTCG CCAACGCGGC GACCGAAGCC TATGAGGGCG GTCGCACCCG CGTGCAGCAT TTCCTCAACG CCAAGCGGCC GGAAGAGATC ATCTTCACCC GCAACGCCAC CGAGGCGATC AATCTGGTGG CGTCGTCGTT CGGCGCGCCG AATATCGGCG AGGGCGACGA GATCGTGCTC TCGATCATGG AGCACCATTC CAACATCGTG CCGTGGCACT TCTTGCGCGA ACGTCAGGGT GCTGTTCTCA AATGGGCGCC GGTCGACGAC GACGGCAATT TCCTGATCGA CGAATTCGAG AAGCTGCTGT CGCCGAAGAC CAAGCTGGTC GCGATCACGC AGATGTCGAA CGCGCTCGGC ACCATCGTGC CGGTGAAAGA GGTGGTGAAG CTGGCGCACG ACCGCGGCAT TCCGGTGCTG GTCGACGGCA GCCAGGGCGC GGTGCATCTC ACCATCGACG TCCAGGACAT CGACTGCGAT TTCTACATCA TGACCGGCCA CAAGCTGTAC GGCCCGACCG GGATCGGCGT GCTGTACGGC AAATACGACG TCCTCGCCAA GATGCGGCCG TTCAACGGCG GCGGCGAGAT GATTCGCGAA GTGGCGCAGG ACTGGGTGAC CTACGGCGAC CCGCCGCACC GGTTCGAGGC CGGCACCCCG GCGATCGTCG AGGCGGTCGG GCTCGGGGCG GCGATCGACT ACGTCAATTC GATCGGCAAG GAGCGCATCG CCGCGCACGA ACACGATCTT TTGACGTATG CGGAACAGCG ATTGCGCGAG ATCAATTCGC TGCGCATCAT CGGCACCGCC AAGGGCAAGG GGCCGGTGAT TTCCTTCGAG ATGAAGGGCG CGCACCCGCA CGACATCGCC ACCGTGATCG ACCGCCAGGG CATCGCGGTG CGGGCGGGAA CCCATTGCGT GATGCCGTTG CTGGAGCGGT TCCAGGTCAC GGCGACGTGC CGGGCGTCGT TCGGCATGTA TAATACCCGT GAGGAAGTCG ACCAACTCGC TAATGCGCTG ATCAAGGCGC GGGACCTGTT CGCATGA
|
Protein sequence | MRHCCDRSRV GRPDMAHPAV SNGSYDVAKV REDFPALALK VYGKDLVYLD NAASAQKPRA VLERMTKAYE SEYANVHRGL HYLANAATEA YEGGRTRVQH FLNAKRPEEI IFTRNATEAI NLVASSFGAP NIGEGDEIVL SIMEHHSNIV PWHFLRERQG AVLKWAPVDD DGNFLIDEFE KLLSPKTKLV AITQMSNALG TIVPVKEVVK LAHDRGIPVL VDGSQGAVHL TIDVQDIDCD FYIMTGHKLY GPTGIGVLYG KYDVLAKMRP FNGGGEMIRE VAQDWVTYGD PPHRFEAGTP AIVEAVGLGA AIDYVNSIGK ERIAAHEHDL LTYAEQRLRE INSLRIIGTA KGKGPVISFE MKGAHPHDIA TVIDRQGIAV RAGTHCVMPL LERFQVTATC RASFGMYNTR EEVDQLANAL IKARDLFA
|
| |