Gene Rsph17029_3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3089 
Symbol 
ID4898190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp104202 
End bp105386 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content74% 
IMG OID640113691 
Productcreatinase 
Protein accessionYP_001044961 
Protein GI126463848 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.430642 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.871573 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACAC TCCCGGCAGA CACCGCGGCC GACCCGCTCC GCGGCCGTGA CGCGGTTTTC 
CCGGCCGCCG AATTCGCAGG CCGTATGGCG CGGCTGCAGG CGGCCACGGC GGATCTCGGC
GCCGATGCGC TGGTGCTGCT CGGGCCCGAG AACATCTTCT GGGCCACGGG GCGGCAGACG
GCGGGCTATT TCGCCTTTCA GGCCCTCGTG GTGCCGGTCG AGGGGGCGCC CGTGCTCCTC
GTGCGCCAGC TCGAGACGAC GGGCGCGCGT GCTTCGACCT GGCTCGCCGA CATCCGCGCC
TGGCAGGACG GCGAGGACCC CGCGGCGGCG CTCGGCGCCC TCGTGCGGGA TCTGGGCCTC
GGGCGCATCG CCATGGAGCG CGGCGCCTGG TTCATCGGTC AGGACCTGTC CGAGCGCATC
GCCCAGGCCC TTGCCGGCGT GGCGCTGATC GACGGGTCGG GTGTGGCGGA GCGGCTGCGC
GCGGTGAAAT CGCCGGCCGA GCTGTCCGCG ATCCGCAAGG CCGCGGGCTA TGCCGAGGCC
GCCATCGCCG CCTCGATCGA GGCCTGCCGG GCGGGCGTCA GCGAGAACGA GGTGGCCGCC
GCCATGATGG GGGCCGCGAT CCGCGCGGGC TCCGAAGCCA TGGCGATGGA GCCGCTCGTC
TCCTCGGGGC CGCGCTCGGG CGTGCCCCAT GCGACCTGGC GGCGACGGCT GCTCGAACCC
GGCGACGGGG TGTTCCTCGA ACTGGCGGCG AGCCACGACC GCTATCACGC GGCGCTCATG
CGCAGCGTCT GGATCGGCCC GCCGCCCGCC GAGGCCGCGC GCATGATGGA CACGGCCGAG
CGCGCGCTCG ATGCGGCGCT GGCGGCCCTG CGCCCCGGCG CGCCCTGCGC GGCGCCGCAC
GAGGCCGCGC AGGCGGTGAT CGATGCCGCG GGCTATACGG CCGCCTTCCG CAAGCGCATC
GGCTATTCGA TGGGCGCGGC CTTCGCGCCC GACTGGGGCG AGGGGGCGAT CCTGTCGCTC
TTCACCGGCG TGGATCGCCT GCTGGAGCCC GGCATGGTCT TCCACCTGCC CGCCACGCTG
CGCAGCTACG GCGACTATAC GGTCGGCGCC TCCGAGACGG TGATCCTCAC CGAAACCGGC
ATCGAGGTCT TGTCGACCCT GCCCAGACAG ATGAGGGTGC GCTGA
 
Protein sequence
MSTLPADTAA DPLRGRDAVF PAAEFAGRMA RLQAATADLG ADALVLLGPE NIFWATGRQT 
AGYFAFQALV VPVEGAPVLL VRQLETTGAR ASTWLADIRA WQDGEDPAAA LGALVRDLGL
GRIAMERGAW FIGQDLSERI AQALAGVALI DGSGVAERLR AVKSPAELSA IRKAAGYAEA
AIAASIEACR AGVSENEVAA AMMGAAIRAG SEAMAMEPLV SSGPRSGVPH ATWRRRLLEP
GDGVFLELAA SHDRYHAALM RSVWIGPPPA EAARMMDTAE RALDAALAAL RPGAPCAAPH
EAAQAVIDAA GYTAAFRKRI GYSMGAAFAP DWGEGAILSL FTGVDRLLEP GMVFHLPATL
RSYGDYTVGA SETVILTETG IEVLSTLPRQ MRVR