Gene Rsph17029_2222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2222 
Symbol 
ID4897121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2352621 
End bp2353676 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content73% 
IMG OID640112816 
Producthypothetical protein 
Protein accessionYP_001044097 
Protein GI126462983 
COG category[S] Function unknown 
COG ID[COG3768] Predicted membrane protein 
TIGRFAM ID[TIGR01620] conserved hypothetical protein, TIGR01620 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0622508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.457104 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGACA AGCCCCCGAA GAAGCCCTTG CTCCTCGACT GGGATACGCG CGAGGAAACG 
GCGGCCGATG CGCCCCGCGC CCGCCGCCAG TCGGCGGCGC CCGAGACCGT CTCGGGGCCC
ACGCCCGCCG ACGTGCCGCC GGTGCCCGAC CTCGATCTGC CGCAGGGGCA GGCGATGCTG
GCCGCAAGCC GCATCGCCAC CGTCCGCAGC TCGCGTCTGG GCCGCTTCGC CGGCTGGATC
TTCGGCACGC TCCTGAGCTT CGTCCTCTCG GTGGCGGCCT GGGACTTCGT GACCTCGCTC
CTGTCGCGCA ACAGCGTGCT CGGTGCCGCG GCCTTCGTGC TGATCGGGAC GGCGGTCCTC
ACGGCGCTGG CGCTCGCGCT GCGCGAATGG TGGGCCTATG TGCGGCTCGA GCGGCTCGAC
AGCCTGCGCG AGGCCGCCAT CGCCGCGCGC GCCACGAACG ATCTCAAGGC CGCACGCCGC
GTGGTGACCT CCATCGAGAA GATGTATGGC CACCGCGCCG ACCTGCGCTG GGGGAAGGCG
CGGCTGGCCG AGCGGCAGGC CGAGGTCTTC GATGTCGACG GCCTTCTGGG GCTGGCCGAG
AACGAGCTTC TGGTGACGCT CGACCAGAGC GCACGACGCG AGATCGAGGC GGCGGCGCGT
CAGGTGGCGG CGGTCACAGC GCTGGTGCCG CTGGCGCTCG CCGATGTGGC GACGGCGCTC
TATGCCAACC TCCGCATGGT GCGCCGCATC GCCGAGATCT ACGGCGGGCG CTCGGGCAGC
TTCGGCAGCG TGCGGCTGCT GCGCCGGGTG TTCTCGTCGC TGATCGCGGC GGGGGCGGTG
GCCATGACCG ACGATCTGCT CCATTCGGTC GCGGGCGGGG GCGTGCTCTC GAAGGTCTCG
CGCCGGTTCG GCGAGGGGAT GGTGAACGGC GCCCTCACCG CGCGGGTGGG GGTGGCCGCG
ATGGAACTCT GCCGCCCGCT GCCCTTCCAC ACCGCGCCGC GCCCGAAGGT CACGAACCTC
ATCAGCCGCA GCCTCACCGG CCTCTTCGAC CGGTGA
 
Protein sequence
MSDKPPKKPL LLDWDTREET AADAPRARRQ SAAPETVSGP TPADVPPVPD LDLPQGQAML 
AASRIATVRS SRLGRFAGWI FGTLLSFVLS VAAWDFVTSL LSRNSVLGAA AFVLIGTAVL
TALALALREW WAYVRLERLD SLREAAIAAR ATNDLKAARR VVTSIEKMYG HRADLRWGKA
RLAERQAEVF DVDGLLGLAE NELLVTLDQS ARREIEAAAR QVAAVTALVP LALADVATAL
YANLRMVRRI AEIYGGRSGS FGSVRLLRRV FSSLIAAGAV AMTDDLLHSV AGGGVLSKVS
RRFGEGMVNG ALTARVGVAA MELCRPLPFH TAPRPKVTNL ISRSLTGLFD R