Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0477 |
Symbol | |
ID | 3909822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 525292 |
End bp | 526290 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637882364 |
Product | cysteine synthase |
Protein accession | YP_484099 |
Protein GI | 86747603 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCG CAGCAACCGC ATCGCTCAAA TCCACCGCCG CAGCGCCCGC CCATCAGCCC GGCCGCGGCC GGGTGTATGA TTCGGTCGCC GATGCCTATG GCGACACCCC GTTGGTGCGG CTGAACCGGC TGCCGGGACT GAACGGCGTC AACGCGACGA TTCTCGCCAA GCTGGAATAT TTCAACCCGG CCTCCAGCGT GAAGGACCGC ATCGGCGCCG CGATGATCGC CGCGATGGAG CGCGACGGCA TCATCAAGCC CGGCACCATC CTGATCGAGC CGACCTCCGG CAACACCGGC ATCGCGCTGG CCTATGTGGC CGCCGCCAAG GGCTATCGGC TCAAGCTGGT GATGCCGGAA TCGATGTCGA TCGAGCGCCG CAAGATGCTG GCCTTCCTCG GCGCCGAGCT GGTGCTGACC GAAGCCGCCA AGGGCATGAA GGGCGCCATC GCCAAGGCCG AGGAGCTGAT CGCCTCGACG CCGAACGCGG TGATGCCGCA GCAGTTCAAG AACCTCGCCA ACCCCGAGGT TCACCGCCGC ACCACCGCCG AGGAGATCTG GAACGACACC AACGGCGCGA TCGACATTTT CGTCGCCGGC GTCGGCACCG GCGGCACCAT CACCGGCGTC GGCCAGGTGC TGAAGCCGCG CAAGCCGTCG GTCAGGATCG TGGCGGTCGA GCCGGAGGAA AGCCCGGTGC TGTCCGGCGG CGCACCCGGC CCGCACAAGA TCCAGGGCAT CGGCGCCGGC TTCGTGCCGG ACATTCTCGA CCGCTCGGTG ATCGACGAAA TCATCAAGGT GGCGGGACCG GTTGCGATCG AGACTTCGCG GGCGCTGGCG CGGCACGAAG GCATTCCGGG CGGCATCTCG TCGGGTGCCG CGATTGCGGC TGCGATCGAA CTCGGCAAGC GCCCGGAAAA CGCCGGCAAG ACCATCGTGG CGATCGTGCC GTCGTTCTCG GAGCGCTATC TGTCGACCGC GTTGTTCGAG GGCGTGTAA
|
Protein sequence | MSSAATASLK STAAAPAHQP GRGRVYDSVA DAYGDTPLVR LNRLPGLNGV NATILAKLEY FNPASSVKDR IGAAMIAAME RDGIIKPGTI LIEPTSGNTG IALAYVAAAK GYRLKLVMPE SMSIERRKML AFLGAELVLT EAAKGMKGAI AKAEELIAST PNAVMPQQFK NLANPEVHRR TTAEEIWNDT NGAIDIFVAG VGTGGTITGV GQVLKPRKPS VRIVAVEPEE SPVLSGGAPG PHKIQGIGAG FVPDILDRSV IDEIIKVAGP VAIETSRALA RHEGIPGGIS SGAAIAAAIE LGKRPENAGK TIVAIVPSFS ERYLSTALFE GV
|
| |