Gene Rsph17025_4217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4217 
Symbol 
ID5086388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009430 
Strand
Start bp259707 
End bp260909 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content65% 
IMG OID640485778 
Producthypothetical protein 
Protein accessionYP_001170372 
Protein GI146280215 
COG category[S] Function unknown 
COG ID[COG3177] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.921636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.168634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGAGA CACCCGGCAG GATTGAACCA TGCTTCTTCG AAGAGCACAT ACCTGCCAGC 
CTCGCGGACC TGTCGGTCGA GATTCAGCGC GAAGCGGCGA ACCTCGGCCA GGGTCTCCAC
CCGGACAGCG CAGCCGAACT GGCGGACCTC GTCAGGGTGA TGAACTGCTA CTACTCGAAC
CTGATCGAGG GGCACAACAC GCGCCCGCGC GACATCGAAC GCGCCCTCGC GGGGGCCGAG
CTTGAGGCGG AGACACGCCC GCTTGCGCTG GAGGCTCGAG CCCATGTCAT CGTCCAGCGA
ACAATCGACA GGATGCATCG GGAAGGCACC TTGCTCCGGC CCACATCCGT CGCGTTCCTC
ACCTGGGTAC ACAAGGCCTT CTACGACGAG ATGCCCGACG AGTTCCGGCA TGTCGAACAT
CCGGATGGAA CGACCGAGCC GATCATTCCG GGCCGCATGC GGCAGGAGGG CGACCGCGAA
GTCGCCGTCG GCCGCCATCT TCCCCCCTCC TCGAGTCGGG TCGCGCCCTT CATGGATCAC
TTCGACAAGC GATTTCAGAT CGCGGCCCGC TCGGCGAGCG GACGGATCAT CGCCATCGCC
TCGGCACACC ACCGGCTAAA CTACATACAC CCGTTTCCCG ACGGGAACGG GCGGGTCAGC
CGGCTGATGT CGCATGCGAT GGCGCTCGAA GCAGGCATTG GAGGCCAAGG CTTATGGTCC
GTTTCGCGCG GGCTGGCGCG CGGGCTGGCG GATCGGGGCG AATACAAGCG CATGATGGAC
ATGGCCGACT CCCCCCGTCG CGGCGATCGC GACGGGCGGG GCAATCTGTC CGAGGCTGCC
CTGAAGACCT ATTGCGAATG GTTCCTGACG GTCACGCTGG ATCAGATCAC CTTCTCGGCC
AAGCTCTTCG ACCTTGGCGG CCTGGAAAAG CGCTACCGGC GTCTGGTCGA AGACACGGTC
GACGACAAGC GTGCGCCCGA CCTCATCTCG GCGGTCCTTC GCTATGGCAC GCTGGAACGC
GGCGAGGCGC AGATCGTCCT CAAGACGTCC GAGCGCACGG CGCGCAACAC GCTGAGCAAG
CTGACATCAG CCGGCTACCT GTCATCAGCC TCACCGAAGA CGCCCGTGCG GCTCGCTTTT
CCTCTCGACT ACCGGGAGCG CCTTTTCCCG AACCTGTTCG CTGATGCGTG CCTGCCCGGG
TAA
 
Protein sequence
MRETPGRIEP CFFEEHIPAS LADLSVEIQR EAANLGQGLH PDSAAELADL VRVMNCYYSN 
LIEGHNTRPR DIERALAGAE LEAETRPLAL EARAHVIVQR TIDRMHREGT LLRPTSVAFL
TWVHKAFYDE MPDEFRHVEH PDGTTEPIIP GRMRQEGDRE VAVGRHLPPS SSRVAPFMDH
FDKRFQIAAR SASGRIIAIA SAHHRLNYIH PFPDGNGRVS RLMSHAMALE AGIGGQGLWS
VSRGLARGLA DRGEYKRMMD MADSPRRGDR DGRGNLSEAA LKTYCEWFLT VTLDQITFSA
KLFDLGGLEK RYRRLVEDTV DDKRAPDLIS AVLRYGTLER GEAQIVLKTS ERTARNTLSK
LTSAGYLSSA SPKTPVRLAF PLDYRERLFP NLFADACLPG