Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2837 |
Symbol | |
ID | 6410504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 3088919 |
End bp | 3089956 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642712715 |
Product | cysteine synthase A |
Protein accession | YP_001991820 |
Protein GI | 192291215 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.953672 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATCA AGAACGACGT CGTCGACGCC ATCGGCAACA CGCCCCTGAT CAAATTGAAG CGCGCGTCGG AGGCGACCGG CTGTACCATT CTCGGCAAGG CCGAGTTCAT GAATCCGGGC CAGTCGGTGA AGGATCGGGC TGCGCTGTTC ATCATCCAGG ATGCGGTGAA GCGCGGGACG CTGCGTCCCG GCGGCGTCGT GGTCGAAGGC ACCGCCGGCA ATACCGGCAT CGGGCTGGCG CTGGTCGCCA ATGCACTCGG TTTCCGCACC GTGATCGTGA TCCCGAACAC GCAGAGCCAG GAAAAGAAGG ACATGCTGCG GCTGTGCGGC GCCGAGCTGA TCGAGGTGCC CGCCGTCCCC TACGCCAATC CCAACAACTA CGTGAAGCTG TCTGGCCGTC TCGCCGCGCA GCTTGCCGAA ACCGAACCGA ACGGAGCGAT CTGGGCCAAT CAGTTCGACA ACGTCGCCAA TCGCCAAGCC CATATCGAGA CGACCGCACC GGAAATCTGG AATCAGACCG ACGGCAAGGT CGACGGCTTC GTCGCCGCGG TCGGCTCCGG CGGCACGCTG GCCGGCGTGT CGATCGGCCT CAAGCAGTTC AATCCGAAAG TCCGCGCCGT GCTCGCCGAC CCGTTGGGCT CGGCGCTGTA CAATTACTAC AAGAACGGTG CGCTGAAGTC GGAAGGCTCC TCGATTACCG AAGGCATCGG CCAGGGCCGG GTCACCGCCA ATCTGGAAGG CGCGCAGATC GACGACGCCT ATCAGATCCC CGACGATGAA GCGGTGCCGT TGATCTACGA TCTGCTGGAA CACGAAGGCC TGTGCCTCGG CGGCTCGAGC GGCATCAACG TCGCCGGCGC GATCCGTCTC GCCAAGGATC TCGGCCCCGG CCATACCATT GTGACCATCC TGTGCGACTA CGGCAGCCGC TATCAGTCCA AGCTGTTCAA CCCGGACTTC ATGCGCAGCA AGAACCTGCC GGTGCCGGAC TGGATGGAGA CCAAGAGCAC GATTCAGGTG CCGTTCGAGC AGGCCTAA
|
Protein sequence | MSIKNDVVDA IGNTPLIKLK RASEATGCTI LGKAEFMNPG QSVKDRAALF IIQDAVKRGT LRPGGVVVEG TAGNTGIGLA LVANALGFRT VIVIPNTQSQ EKKDMLRLCG AELIEVPAVP YANPNNYVKL SGRLAAQLAE TEPNGAIWAN QFDNVANRQA HIETTAPEIW NQTDGKVDGF VAAVGSGGTL AGVSIGLKQF NPKVRAVLAD PLGSALYNYY KNGALKSEGS SITEGIGQGR VTANLEGAQI DDAYQIPDDE AVPLIYDLLE HEGLCLGGSS GINVAGAIRL AKDLGPGHTI VTILCDYGSR YQSKLFNPDF MRSKNLPVPD WMETKSTIQV PFEQA
|
| |