Gene RPD_0343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0343 
Symbol 
ID4020803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp396439 
End bp397437 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content67% 
IMG OID637960522 
Productcysteine synthase A 
Protein accessionYP_567482 
Protein GI91974823 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01136] cysteine synthases
[TIGR01139] cysteine synthase A 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.516891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.476462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCG CAGCAACCGC ATCGATCAAG TCCAGCGCGG CCGCGCCCGC CCAACAGCCC 
GGCCGCGGCC GGGTCTATGA TTCGATCGCC GACGCCTATG GCGATACGCC GCTGGTGCGG
CTGAACCGGC TGCCGGAGCA GAACGGCGTC AAGGCGACGA TTCTCGCCAA GCTCGAATAT
TTCAACCCGG CCTCCAGCGT GAAGGATCGC ATCGGCGCGG CGATGATCGC CGCGATGGAG
CGCGAGGGCA TCATCAAGCC CGACACCATC CTGATCGAGC CGACCTCGGG CAACACCGGA
ATCGCGCTGG CTTTCGTCGC CGCCGCAAAG GGCTACCGGC TGAAGCTGGT GATGCCGGAA
TCGATGTCGA TCGAGCGCCG CAAGATGCTG GCGTTCCTCG GCGCCGAGCT GGTGCTGACG
GAAGCCGCCA AGGGGATGAA GGGCGCAATC GCCAAGGCCG AGGAGCTGAT CGCCTCGACC
CCGAATGCGG TGATGCCGCA GCAGTTCAAG AATCTCGCCA ACCCGGAAGT CCACCGCCGC
ACCACCGCGG AGGAAATCTG GAACGACACC CATGGCGGGA TCGACATCTT CGTCGCCGGC
GTCGGCACCG GCGGGACCAT CACGGGCGTC GGCCAGGTGC TGAAGCCGCG CAAGCCGTCG
CTGAAGATCG TCGCGGTCGA GCCGGAGGAG AGTCCGGTGC TGTCCGGCGG CGCGCCCGGT
CCGCACAAGA TCCAGGGTAT CGGCGCCGGC TTCGTGCCGG ACATTCTCGA CCGCGCGGTG
ATCGACGAGG TGATCAAGAT CGCCGGCCCG ACCGCGATCG CGACCTCGCG CGCGCTGGCG
CGGCACGAAG GCATCGCCGG CGGGATCTCG TCAGGCGCCG CGATCGCCGC CGCGATCGAA
CTCGGCAAAC GCCCGGAAAA CGCCGGCAAG ACCATCGTGG CGATCGTGCC GTCGTTCTCG
GAGCGCTATC TGTCGACCGC GCTGTTCGAA GGGGTCTGA
 
Protein sequence
MSSAATASIK SSAAAPAQQP GRGRVYDSIA DAYGDTPLVR LNRLPEQNGV KATILAKLEY 
FNPASSVKDR IGAAMIAAME REGIIKPDTI LIEPTSGNTG IALAFVAAAK GYRLKLVMPE
SMSIERRKML AFLGAELVLT EAAKGMKGAI AKAEELIAST PNAVMPQQFK NLANPEVHRR
TTAEEIWNDT HGGIDIFVAG VGTGGTITGV GQVLKPRKPS LKIVAVEPEE SPVLSGGAPG
PHKIQGIGAG FVPDILDRAV IDEVIKIAGP TAIATSRALA RHEGIAGGIS SGAAIAAAIE
LGKRPENAGK TIVAIVPSFS ERYLSTALFE GV