Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3639 |
Symbol | |
ID | 3970654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4047710 |
End bp | 4048687 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637926747 |
Product | cysteine synthase A |
Protein accession | YP_533493 |
Protein GI | 90425123 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0668301 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.194292 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACGCT GGTTCGAAGA TAACGCGCAG TCGATCGGCC ACACCCCGCT GATCCGTCTC AACCGCATCA CCGACGGTGC GCCGGCCACC GTGCTGGCCA AGATCGAAGG CCGCAACCCG GCCTATTCGG TGAAATGCCG GATCGGCGCG GCGATGATCG AGGACGCCGA GAAGCGTGGC CTGCTCGGCC CCGGCAAGGA GATCGTCGAG CCGACCTCCG GCAACACCGG CATCGCGCTG GCCTTCGTCG CCGCCGCCAA GGGCATTCCG TTGACCCTGA CCATGCCGGC GACGATGAGT CTGGAGCGGC GCAAGCTGTT GATCGCGTTC GGCGCCAAGC TGGTGCTGAC CGAAGGCCCG AAGGGCATGG CCGGCGCGGT CGCCAAGGCG GAAGAAATCG TCGCCTCCGA TCCGAACCGC TACGTGCTGC TGCAGCAGTT CAAGAACCCG GCCAATCCGG CGATCCACGA AAAGACCACC GGCCCGGAAA TTTGGAACGA TACCGACGGT GCGGTCGATA TCTTCGTCGC CGGCGTCGGC ACCGGCGGCA CCATCACCGG GGTGTCGCGC TACATCAAGA CCACCAAGGG CAAGCCGATC CTGTCGGTCG CGGTGGAGCC CTCGGCCAGC CCGATCCTCA GCCAGAAGGT CGCCGGCGAG GCGCTGAAGC CCGGCCCGCA CAAGATCCAG GGCATCGGTG CCGGCTTCGT GCCGGACGTG CTCGATCTGT CGTTGATCGA TGCCATCGAG CAGGTCGCGA ACGACGAAGC CGTGGAGTAT GCCCGCCGGC TGGCCAGCGA GGAGGGCATC CTGTCCGGGA TCTCCAGCGG GGCTGCAGTG GCGGCCGCGG TGCGGCTCGC CAAGAAACCG GAAAACGCCG GCAAGACCAT CGTGGTGATC CTGCCCGACT CCGGCGAGCG CTATCTGTCG TCGGTGCTGT TCGAGGGCCT GTTCGACGCC CAGGTGCTGG CTGCATGA
|
Protein sequence | MSRWFEDNAQ SIGHTPLIRL NRITDGAPAT VLAKIEGRNP AYSVKCRIGA AMIEDAEKRG LLGPGKEIVE PTSGNTGIAL AFVAAAKGIP LTLTMPATMS LERRKLLIAF GAKLVLTEGP KGMAGAVAKA EEIVASDPNR YVLLQQFKNP ANPAIHEKTT GPEIWNDTDG AVDIFVAGVG TGGTITGVSR YIKTTKGKPI LSVAVEPSAS PILSQKVAGE ALKPGPHKIQ GIGAGFVPDV LDLSLIDAIE QVANDEAVEY ARRLASEEGI LSGISSGAAV AAAVRLAKKP ENAGKTIVVI LPDSGERYLS SVLFEGLFDA QVLAA
|
| |