Gene Rpal_2837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2837 
Symbol 
ID6410504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3088919 
End bp3089956 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content63% 
IMG OID642712715 
Productcysteine synthase A 
Protein accessionYP_001991820 
Protein GI192291215 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.953672 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCA AGAACGACGT CGTCGACGCC ATCGGCAACA CGCCCCTGAT CAAATTGAAG 
CGCGCGTCGG AGGCGACCGG CTGTACCATT CTCGGCAAGG CCGAGTTCAT GAATCCGGGC
CAGTCGGTGA AGGATCGGGC TGCGCTGTTC ATCATCCAGG ATGCGGTGAA GCGCGGGACG
CTGCGTCCCG GCGGCGTCGT GGTCGAAGGC ACCGCCGGCA ATACCGGCAT CGGGCTGGCG
CTGGTCGCCA ATGCACTCGG TTTCCGCACC GTGATCGTGA TCCCGAACAC GCAGAGCCAG
GAAAAGAAGG ACATGCTGCG GCTGTGCGGC GCCGAGCTGA TCGAGGTGCC CGCCGTCCCC
TACGCCAATC CCAACAACTA CGTGAAGCTG TCTGGCCGTC TCGCCGCGCA GCTTGCCGAA
ACCGAACCGA ACGGAGCGAT CTGGGCCAAT CAGTTCGACA ACGTCGCCAA TCGCCAAGCC
CATATCGAGA CGACCGCACC GGAAATCTGG AATCAGACCG ACGGCAAGGT CGACGGCTTC
GTCGCCGCGG TCGGCTCCGG CGGCACGCTG GCCGGCGTGT CGATCGGCCT CAAGCAGTTC
AATCCGAAAG TCCGCGCCGT GCTCGCCGAC CCGTTGGGCT CGGCGCTGTA CAATTACTAC
AAGAACGGTG CGCTGAAGTC GGAAGGCTCC TCGATTACCG AAGGCATCGG CCAGGGCCGG
GTCACCGCCA ATCTGGAAGG CGCGCAGATC GACGACGCCT ATCAGATCCC CGACGATGAA
GCGGTGCCGT TGATCTACGA TCTGCTGGAA CACGAAGGCC TGTGCCTCGG CGGCTCGAGC
GGCATCAACG TCGCCGGCGC GATCCGTCTC GCCAAGGATC TCGGCCCCGG CCATACCATT
GTGACCATCC TGTGCGACTA CGGCAGCCGC TATCAGTCCA AGCTGTTCAA CCCGGACTTC
ATGCGCAGCA AGAACCTGCC GGTGCCGGAC TGGATGGAGA CCAAGAGCAC GATTCAGGTG
CCGTTCGAGC AGGCCTAA
 
Protein sequence
MSIKNDVVDA IGNTPLIKLK RASEATGCTI LGKAEFMNPG QSVKDRAALF IIQDAVKRGT 
LRPGGVVVEG TAGNTGIGLA LVANALGFRT VIVIPNTQSQ EKKDMLRLCG AELIEVPAVP
YANPNNYVKL SGRLAAQLAE TEPNGAIWAN QFDNVANRQA HIETTAPEIW NQTDGKVDGF
VAAVGSGGTL AGVSIGLKQF NPKVRAVLAD PLGSALYNYY KNGALKSEGS SITEGIGQGR
VTANLEGAQI DDAYQIPDDE AVPLIYDLLE HEGLCLGGSS GINVAGAIRL AKDLGPGHTI
VTILCDYGSR YQSKLFNPDF MRSKNLPVPD WMETKSTIQV PFEQA