Gene Rsph17029_4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4014 
Symbol 
ID4899048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1157960 
End bp1159120 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content74% 
IMG OID640114617 
Producthypothetical protein 
Protein accessionYP_001045864 
Protein GI126464751 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.244675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGATCC GTGCGGCTCT GGCGGCTCTC GTGCTTCTCG CGGCCACGAC GGCCGGGGCC 
GAGACCCCTG AGCGGCGGCT CGAGATCCTG GCCTTCCCCC GCGCCGACAG CGTGGTGGCG
GGCGAGATGG TGCCGGTGAC GGTGCGCGGC ATCTACGACC GCAAGGTGAC GCTCGAGGAG
ATGACGATCC GGCCCGACGA CAGCTTCGAC TGGGTGCAGC TGGCCAAGGA CGACTGGCAC
GAGGAGCGCA TCGACGGCCG CCTGCGGCTG GTGGTCGAAC GCAGGCTCGC GCTCTTTCCC
AAACATTCCG GCTCGTCGCG CTTCGGGCCC GCCGAGCACC GGCTGACCTT CGTCGGCGCG
GGCGGGAAGG CGGAGACCAT CACCTCGCAT CCGCTCGACC TGTCGGTGGC GCCCATGCCC
GACGATCCGC CCTTCCACAG CCCGCACGGC TGGCGCTTCG CCGTCTCCGA GCTGAGGGTG
ACGGATGAGC TCAGCACCGA TCCGGCCCGG CTCAAGGACG GCGAAACCGT GACGCGGCGC
GTGACAGTGA CCGCGGTGGG CGCGCTGCCC GCGATGCTGC CGCCGCGGCC CGTGGTCTCG
GAGAACTGGC TCATCGCCTT CGCGGCTCCG GTCGAGCGGT CGCTGGAGCT GACGCCGGAC
GGCCCCGTGG CGCGGGTGAT CTGGAGCTGG CAGTTCCGCC CCGAGACCGG CGAGCCCGGC
GTGCTGCCCG CCGTGCCGAT CCCCTATTTC AACACGGTGA CGCGGAAGGT GGAGGCGGCC
GAGATCCCCG CGCTGCCCAT CGGCTATGCG AGCTTCGCCG CCTCGCAGTC GGCCGGCATC
GCCATCACGC CGGCGAGCCT CTGGGGAGGG CTGGCGGCCG GTCTGGCGGG GCTCAGCGCG
GGGACGGCGC TGCTCGTGGC CGGCCACCGG CCGACGGCCG CGGCGCTCGG ACGGCTCGCG
CGGCGGCGCT CGCCCTTCCG CCGCTGGCAG ATCTGGCGCG CGGCACGGGC GGGTGACCTG
CTCGCGCTCC GCCGGGCCAC CGAGGAGGAG GCGCTCGACA GGCCTGCAGC GCGCGCCGCG
CTCGAACGGG CGATCTACGG CCCGCCGCCG CAGCCCTTCG ACGCGCGCGC CTTCCTGCGG
ACCCTGCGCC GGAAGGCTTG A
 
Protein sequence
MVIRAALAAL VLLAATTAGA ETPERRLEIL AFPRADSVVA GEMVPVTVRG IYDRKVTLEE 
MTIRPDDSFD WVQLAKDDWH EERIDGRLRL VVERRLALFP KHSGSSRFGP AEHRLTFVGA
GGKAETITSH PLDLSVAPMP DDPPFHSPHG WRFAVSELRV TDELSTDPAR LKDGETVTRR
VTVTAVGALP AMLPPRPVVS ENWLIAFAAP VERSLELTPD GPVARVIWSW QFRPETGEPG
VLPAVPIPYF NTVTRKVEAA EIPALPIGYA SFAASQSAGI AITPASLWGG LAAGLAGLSA
GTALLVAGHR PTAAALGRLA RRRSPFRRWQ IWRAARAGDL LALRRATEEE ALDRPAARAA
LERAIYGPPP QPFDARAFLR TLRRKA