Gene Rsph17025_4048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4048 
Symbol 
ID5086221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp85759 
End bp87198 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content73% 
IMG OID640485611 
Producthypothetical protein 
Protein accessionYP_001170205 
Protein GI146280048 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.422059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0578992 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCC GGCTTTCCGT CGCCGCTGCC GTTGCCGGCA GCCTCGCCCT TCCGCTGCCC 
CAGGCCGCAA GGGCGGATTC CGACTTCGGC AAGGCGATTG TCGGGGGCAT GATCCTGTGC
GGCCTCACCA ACTGCCTCGG CGGCCGGCAG ACGCAGGCGG CGCCCCGCAG CGGCGGCGGC
GGAGGGGGCG GTGGAGGCAG CCGGACGCAG GCTCCGGCGG CGGATCCGAC CGTGCGCACC
GACCAGAATG CGCTGAACTA TTTCGGCTTC CCCGCGGGCT CGGCCGACGG ACGGATGGGC
CGGCGCACCC GCGAGGCGAT CAGCGGCTAT CAGAACTACA TGGGCTATCC CGCCACGGGG
CAACTCAGCC AGTATGAGCG CGATTCCCTC TGGAACGCCT ACAACCGCGC CCAGGCCGGG
GGCGGGGCGG CCTATGGCCA TGTCGTGGCC GCCGAGGGCA ACCGGGGGCT GATCAAGGCT
TTCGCCGCCG AGGCCCGGGG CGAGCAGTAC CGCCCGCCGG GCTACGCCAA TCCCGCCCTG
CCGAACCCGC AGGGATATCC CAACCCGCAG GGCTATGGTG CGCCCGGCAT GCCGAACCAG
CAGGCCTACG GCACCGGCCA GCCGCCCTAC GGCGCGAATG TTCCGGCCCC CCCGCCCCTG
CCGCCCGTGC CGCCGCAGGC CACGGGCGAC CTTCCGGTGG TGAGCGGGGC CCCGAATGCC
CTGCCCTCGC TGCCGCGGCT GCCGGTGCCC GCCGCCGCGG GCATCAGCGC CTCGATGGCC
GACCATTGCC GCTCGGCCGA GCTGCTCAGC GACGCCAACG GGGGCATGGC GGTGCCCGGC
CGCATCCCCG ATCCGGCGCA GGCCCTCGAC GAGCAGTTCT GCGCGGCGCG CGGCGATGCG
ATGTCGCGCT CGCAGCAGAT CCTGGCGCAG ATGGCGGGCG TCGGCGATGC CGAGATCCAG
GGCCAGTGCG AGGGGCTGAT CCGCACCATG GCCAGCCAGA CCGGCGCCCT CGAGGGCGAG
GCCGCCGCCA GCCTGGCGCC GAAGGCCGCG GCCTTCGCCG GAACGCTCGG CGCCGCGCCG
CAGGACATGA CCAACATCGG CGAGATCTGC CTCGGCACCG GCTACAAGAC CGACAATGCC
GACATGGCGG TCGCCTCGGC CATGCTGATG GTGGCGGCCG GCCACCAGCC CCATGCCGAG
ATCCTCGGCC ATCATCTGCG CAAGGGGTTC GGCCCCGGCC AGAACCAGGC CCGTGCCGCC
GAATGGTACG ACATGGGGCT CGGCGCGCTC GAGCAGGGGC ATCCCCCCGC CTTCCTTCCC
GCCCAAAGCG CCCAGCGGGT GAGCGTGATG CGCGAGGCGC TCGCCGCGAT GGGCGGCGGA
GCGGCGCAGC CGGTCAACGC CTCGATGCAG CTGCCCAGCT TCCCGGCGCA GGGCAACTGA
 
Protein sequence
MKFRLSVAAA VAGSLALPLP QAARADSDFG KAIVGGMILC GLTNCLGGRQ TQAAPRSGGG 
GGGGGGSRTQ APAADPTVRT DQNALNYFGF PAGSADGRMG RRTREAISGY QNYMGYPATG
QLSQYERDSL WNAYNRAQAG GGAAYGHVVA AEGNRGLIKA FAAEARGEQY RPPGYANPAL
PNPQGYPNPQ GYGAPGMPNQ QAYGTGQPPY GANVPAPPPL PPVPPQATGD LPVVSGAPNA
LPSLPRLPVP AAAGISASMA DHCRSAELLS DANGGMAVPG RIPDPAQALD EQFCAARGDA
MSRSQQILAQ MAGVGDAEIQ GQCEGLIRTM ASQTGALEGE AAASLAPKAA AFAGTLGAAP
QDMTNIGEIC LGTGYKTDNA DMAVASAMLM VAAGHQPHAE ILGHHLRKGF GPGQNQARAA
EWYDMGLGAL EQGHPPAFLP AQSAQRVSVM REALAAMGGG AAQPVNASMQ LPSFPAQGN