Gene RPB_0477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0477 
Symbol 
ID3909822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp525292 
End bp526290 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content67% 
IMG OID637882364 
Productcysteine synthase 
Protein accessionYP_484099 
Protein GI86747603 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01136] cysteine synthases
[TIGR01139] cysteine synthase A 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCG CAGCAACCGC ATCGCTCAAA TCCACCGCCG CAGCGCCCGC CCATCAGCCC 
GGCCGCGGCC GGGTGTATGA TTCGGTCGCC GATGCCTATG GCGACACCCC GTTGGTGCGG
CTGAACCGGC TGCCGGGACT GAACGGCGTC AACGCGACGA TTCTCGCCAA GCTGGAATAT
TTCAACCCGG CCTCCAGCGT GAAGGACCGC ATCGGCGCCG CGATGATCGC CGCGATGGAG
CGCGACGGCA TCATCAAGCC CGGCACCATC CTGATCGAGC CGACCTCCGG CAACACCGGC
ATCGCGCTGG CCTATGTGGC CGCCGCCAAG GGCTATCGGC TCAAGCTGGT GATGCCGGAA
TCGATGTCGA TCGAGCGCCG CAAGATGCTG GCCTTCCTCG GCGCCGAGCT GGTGCTGACC
GAAGCCGCCA AGGGCATGAA GGGCGCCATC GCCAAGGCCG AGGAGCTGAT CGCCTCGACG
CCGAACGCGG TGATGCCGCA GCAGTTCAAG AACCTCGCCA ACCCCGAGGT TCACCGCCGC
ACCACCGCCG AGGAGATCTG GAACGACACC AACGGCGCGA TCGACATTTT CGTCGCCGGC
GTCGGCACCG GCGGCACCAT CACCGGCGTC GGCCAGGTGC TGAAGCCGCG CAAGCCGTCG
GTCAGGATCG TGGCGGTCGA GCCGGAGGAA AGCCCGGTGC TGTCCGGCGG CGCACCCGGC
CCGCACAAGA TCCAGGGCAT CGGCGCCGGC TTCGTGCCGG ACATTCTCGA CCGCTCGGTG
ATCGACGAAA TCATCAAGGT GGCGGGACCG GTTGCGATCG AGACTTCGCG GGCGCTGGCG
CGGCACGAAG GCATTCCGGG CGGCATCTCG TCGGGTGCCG CGATTGCGGC TGCGATCGAA
CTCGGCAAGC GCCCGGAAAA CGCCGGCAAG ACCATCGTGG CGATCGTGCC GTCGTTCTCG
GAGCGCTATC TGTCGACCGC GTTGTTCGAG GGCGTGTAA
 
Protein sequence
MSSAATASLK STAAAPAHQP GRGRVYDSVA DAYGDTPLVR LNRLPGLNGV NATILAKLEY 
FNPASSVKDR IGAAMIAAME RDGIIKPGTI LIEPTSGNTG IALAYVAAAK GYRLKLVMPE
SMSIERRKML AFLGAELVLT EAAKGMKGAI AKAEELIAST PNAVMPQQFK NLANPEVHRR
TTAEEIWNDT NGAIDIFVAG VGTGGTITGV GQVLKPRKPS VRIVAVEPEE SPVLSGGAPG
PHKIQGIGAG FVPDILDRSV IDEIIKVAGP VAIETSRALA RHEGIPGGIS SGAAIAAAIE
LGKRPENAGK TIVAIVPSFS ERYLSTALFE GV