Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0343 |
Symbol | |
ID | 4020803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 396439 |
End bp | 397437 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637960522 |
Product | cysteine synthase A |
Protein accession | YP_567482 |
Protein GI | 91974823 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.516891 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.476462 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCG CAGCAACCGC ATCGATCAAG TCCAGCGCGG CCGCGCCCGC CCAACAGCCC GGCCGCGGCC GGGTCTATGA TTCGATCGCC GACGCCTATG GCGATACGCC GCTGGTGCGG CTGAACCGGC TGCCGGAGCA GAACGGCGTC AAGGCGACGA TTCTCGCCAA GCTCGAATAT TTCAACCCGG CCTCCAGCGT GAAGGATCGC ATCGGCGCGG CGATGATCGC CGCGATGGAG CGCGAGGGCA TCATCAAGCC CGACACCATC CTGATCGAGC CGACCTCGGG CAACACCGGA ATCGCGCTGG CTTTCGTCGC CGCCGCAAAG GGCTACCGGC TGAAGCTGGT GATGCCGGAA TCGATGTCGA TCGAGCGCCG CAAGATGCTG GCGTTCCTCG GCGCCGAGCT GGTGCTGACG GAAGCCGCCA AGGGGATGAA GGGCGCAATC GCCAAGGCCG AGGAGCTGAT CGCCTCGACC CCGAATGCGG TGATGCCGCA GCAGTTCAAG AATCTCGCCA ACCCGGAAGT CCACCGCCGC ACCACCGCGG AGGAAATCTG GAACGACACC CATGGCGGGA TCGACATCTT CGTCGCCGGC GTCGGCACCG GCGGGACCAT CACGGGCGTC GGCCAGGTGC TGAAGCCGCG CAAGCCGTCG CTGAAGATCG TCGCGGTCGA GCCGGAGGAG AGTCCGGTGC TGTCCGGCGG CGCGCCCGGT CCGCACAAGA TCCAGGGTAT CGGCGCCGGC TTCGTGCCGG ACATTCTCGA CCGCGCGGTG ATCGACGAGG TGATCAAGAT CGCCGGCCCG ACCGCGATCG CGACCTCGCG CGCGCTGGCG CGGCACGAAG GCATCGCCGG CGGGATCTCG TCAGGCGCCG CGATCGCCGC CGCGATCGAA CTCGGCAAAC GCCCGGAAAA CGCCGGCAAG ACCATCGTGG CGATCGTGCC GTCGTTCTCG GAGCGCTATC TGTCGACCGC GCTGTTCGAA GGGGTCTGA
|
Protein sequence | MSSAATASIK SSAAAPAQQP GRGRVYDSIA DAYGDTPLVR LNRLPEQNGV KATILAKLEY FNPASSVKDR IGAAMIAAME REGIIKPDTI LIEPTSGNTG IALAFVAAAK GYRLKLVMPE SMSIERRKML AFLGAELVLT EAAKGMKGAI AKAEELIAST PNAVMPQQFK NLANPEVHRR TTAEEIWNDT HGGIDIFVAG VGTGGTITGV GQVLKPRKPS LKIVAVEPEE SPVLSGGAPG PHKIQGIGAG FVPDILDRAV IDEVIKIAGP TAIATSRALA RHEGIAGGIS SGAAIAAAIE LGKRPENAGK TIVAIVPSFS ERYLSTALFE GV
|
| |