Gene Rsph17025_4347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_4347 
Symbol 
ID5086523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009431 
Strand
Start bp105930 
End bp107111 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content71% 
IMG OID640485903 
Producthypothetical protein 
Protein accessionYP_001170497 
Protein GI146280341 
COG category[R] General function prediction only 
COG ID[COG3500] Phage protein D 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.21518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0759441 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG AAACCTTCCT TTCGGGCCGG GCCTTCCTGA CCCCGGCCTT CCGCATCGAA 
CTCGATGGGC GCAGCACCGG CCCCGAGGTG ATCTCGGACG TTCTGGAGGT GTCCTTCACC
GACGATCTGG CCAACATCCC CAGCTTCGAG TTCGTGCTGC ACGACTGGGA TCCGGTGGCG
CTGCGCCCGC GCTATTCAAG CCCGTGGGAC GCCAACGGCC AGCCTTTCAC CCTGCACGAC
GGCGGGCCCG AGGTGCCGAA CTTCGAGCCG GGCGCGCGGG TCGCGCTCTA TCTCGGCTAT
CTCGAGGACG GGGCGCTGCC CCTGATCATG GAGGGCGAGG TGGTCTCGCT CGCCCCGGTC
TTTCCCGCCT CGGGAACGCC GACCTGCCGG GTGCGGGCGC TGAACGCCTT CCTGCGGGGG
CTGCAGAAGA TCCGGGTCGA GCAGAACGAG AACGGCACGC CCAAGGATGT GGTGGATGCG
ATCTGCGCCG GGCATGGCGT CAGCGTCCGC TGGGCCACGC TCGAGGCCGA GGGCGAGGCG
GAGACGAACG TCGAGGTCGA GGGCACGCTC TATGACGAGA TCGCCAAGCG CGCGAAGGGC
TACGGTCTGG TCATGACGGT CGTGCCGCCG GACGCGCCGG GCGAGGAGCC GGTGCTGTAT
CTTGCGCGGC CCAGCCAGTC CAACGACGGG CCGGTGGCCG AGTTCGTCTG GGGGCGCACG
CTGATCTCCT TTACCCCGGC CCTGTCGGCG GCCAACCAGG TCTCGGCCGT GGTCTGCCGG
GGCGGCGATC CGCACGCCGC CGGATCGGCG CAGAACATCG AGGTGGTGAG GACATGGGCC
GACATCGGCC TCTCGCCCGA GGCGCTGGGG CCCGCGCGTG TGGCCGATCT CGAGACGGCC
GTGCGCGGCA TCCGCGAGGT CATCAAGCCC GCCGGCGTGC AGACCGTGGC CGACGCCGAG
CGCGCGGCGC TGGCGCGGCT GCAGGAGCTG GCGGCCGAAA TGATCACCGG CGCGGGCAGC
GCCATCGGCC TGCCCGCGCT GCGGGCCGGA AAGACCGTCG CCATGTCGGG CATGGGCGCG
CGCTTTGACG GGATCTACCG CCTGACGCAG ACGACGCACG CGATCGGCGG CGCGGGCTAC
ACCACCAGCT TCCAGTGCCG CAAGGAGGTG CTCGATGGCT GA
 
Protein sequence
MTAETFLSGR AFLTPAFRIE LDGRSTGPEV ISDVLEVSFT DDLANIPSFE FVLHDWDPVA 
LRPRYSSPWD ANGQPFTLHD GGPEVPNFEP GARVALYLGY LEDGALPLIM EGEVVSLAPV
FPASGTPTCR VRALNAFLRG LQKIRVEQNE NGTPKDVVDA ICAGHGVSVR WATLEAEGEA
ETNVEVEGTL YDEIAKRAKG YGLVMTVVPP DAPGEEPVLY LARPSQSNDG PVAEFVWGRT
LISFTPALSA ANQVSAVVCR GGDPHAAGSA QNIEVVRTWA DIGLSPEALG PARVADLETA
VRGIREVIKP AGVQTVADAE RAALARLQEL AAEMITGAGS AIGLPALRAG KTVAMSGMGA
RFDGIYRLTQ TTHAIGGAGY TTSFQCRKEV LDG