Gene Rsph17029_2771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2771 
Symbol 
ID4897881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2913406 
End bp2914482 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content69% 
IMG OID640113373 
Productcysteine synthase A 
Protein accessionYP_001044645 
Protein GI126463531 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01136] cysteine synthases
[TIGR01139] cysteine synthase A 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCTC AGAAGATCCG GACGACCGAG GGCCGCGGCA GGCTCTACGA CAGCGTGCTC 
GACACGGTGG GCAACACGCC CGTCATCCGC ATCAACAACC TCTCGCCCGA AGGCGTGACG
ATCTACGTCA AGGCCGAGTT CTTCAACCCG GCGGCCTCGG TCAAGGACCG GCTCGCGCTG
AACATCATCG AGGCGGCGGA ACGGTCGGGC AAGCTCAAGC CCGGCATGAC CGTCGTCGAG
GCGACCTCGG GCAACACCGG CATCGGGCTC GCCATGGTCT GCGCCCAGAA GGGCTATCCG
CTGGTCATCA CCATGTCCGA GGCCTTCTCG GTCGAGCGGC GGCGGCTGAT GCGGCTTCTG
GGCGCGAAGG TCGTCCTGAC CCCGCGCGGC GGCAAGGGCT TCGGCATGTA TCGCAAGGCG
CAGGAGCTGG CCGAGGCGAA CGGCTGGTTC CTCGCGAGCC AGTTCGAGAC CGACGCCAAT
GCCGACATCC ACGAGGCCAC CACCGCGCGC GAGATCGTGG CGGATTTCGC GGGCGAGCGG
CTCGACTGGT TCGTGACCGG CTACGGCACC GGGGGCACGG TCACCGGCGT CGCGCGGGTG
CTGCGCCGCG AGCGACCGGA GGTGAAGATC GTGCTCTCCG AGCCTGCGAA TGCGCAGCTC
GTGGCCTCGG GCGTGCCGCA GGACCGCAAC GCCGACGGCA CCGCAGCCTC GGGCCACCCG
GCCTTCGAGG CGCATCCGAT CCAGGGCTGG ACGCCCGACT TCATCCCGAA GGTGCTTCAG
GAGGGGCTCG ACGCCGGGGC CTATGACGAG CTGATCCCGG TTGCGGGCGA GGACGGGATG
AAATGGGCGC GCGAGCTGGC GGCCAAAGAG GGCATCCTCA CCGGCGTCTC GGGCGGCTCG
ACCTTCGCGG TGGCGCGGCA GGTGGCCGAA CGGGCGCCGA AGGGCTCGGT GATCCTCGCG
ATGCTGCCCG ACACGGGCGA GCGCTACATG TCGACCCCGC TCTTCCAGGC CATCGGCGAG
GACATGAACG AGGAGGAGAA GGCGCTCTCG GCCTCGACGC CGAGCTTCCA GCTCTGA
 
Protein sequence
MDAQKIRTTE GRGRLYDSVL DTVGNTPVIR INNLSPEGVT IYVKAEFFNP AASVKDRLAL 
NIIEAAERSG KLKPGMTVVE ATSGNTGIGL AMVCAQKGYP LVITMSEAFS VERRRLMRLL
GAKVVLTPRG GKGFGMYRKA QELAEANGWF LASQFETDAN ADIHEATTAR EIVADFAGER
LDWFVTGYGT GGTVTGVARV LRRERPEVKI VLSEPANAQL VASGVPQDRN ADGTAASGHP
AFEAHPIQGW TPDFIPKVLQ EGLDAGAYDE LIPVAGEDGM KWARELAAKE GILTGVSGGS
TFAVARQVAE RAPKGSVILA MLPDTGERYM STPLFQAIGE DMNEEEKALS ASTPSFQL