Gene RSP_4059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_4059 
SymbolhipA 
ID3711930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007489 
Strand
Start bp12163 
End bp13518 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content64% 
IMG OID640069409 
Producthypothetical protein 
Protein accessionYP_345276 
Protein GI77404703 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGATGG ACGTCTGGAT GGAAGGGAGA GATACGCCCG TGGGCGTGCT GACCCGCTCG 
GAAGACAAGA GTCTGTCGTT CGTCTACGCC GGCGACATCG CGCCCGAGCA CCGGATCTCG
ATGTCGTTGC CGATCACCTC GGAGCCATAC AGCGATGCCG ATTGCAGGGG CTATTTCGCG
AACCTCCTGT TCGAGGGGCC GCAACTCGAA CGGGTTCTCG ATGGCTTCGG TCTCGACCGT
GGCGACATCG GTGCGCTCCT GTGGCACCTC GGGGCAGACT GCCCGGGCGC CATCTCGATC
ACCCCCGAGG GCACGGGGCC CGGCAAGATG CCAGGAAGGT TCCCCGAGGA CTACGAACGG
CTTTCGGAGG CCCGGCTCCA TCAGATCGTC CTGTCGCTGC ATCGGCACCG CCGCATGCCC
GAGGGAGAGC GCAACCCGTC ACCCGTCGCG GGCGTTCAGG GCAAGATCGC CTGCCTCATG
CTCGAGGGGG CGGTCTGGCT GCCCAAGGGC GGCTCCCGCG CACCAACAAC ACATATTCTG
AAAGTGTCTC CGCACTTCGA TCCTGACGTC ACACGCCAGG AGACGATCTT GCTCCAGATT
GCCTCCGAGA TCGGGATCGA TGCCGCCGAG ACTCGGGATC TCGTCTTCGA TGTTGGGGGC
ACGCACATCA ACGCGCTCCT GTCGACGCGC TTCGACCGCG ATATCGAGAT CGCAGAGGGG
GCAGGAACAA TTCGTCGCCG ACACGCCGAG GACTTCTGCC AGGCGCTCGG TCTGCCACCG
AGCCTGAAAT ACGAGCGCGA TGCAGTGGAT CCGCATCGCC GTTTCTCGGC CGCCGCCGTG
AATGTCATCG CGGCACAGAC GACGGTGCCG GCCCTGGTGA CGAGGGATTT TCTGGCACAA
ACGATCTTCA ACCTGCTCGT GGGCAACACG GACAACCACG GCAAGAACAC GTCGCTTCTC
TACCGCGGAC GGACGGTCCT CCTGGCGCCG CTTTACGATG TGGCGCCGGT GTTCATGGAC
AGGCGCGTGA CGCACGAATT CGCCTTCAGG CATGGCAGCG CGCGCTTCGC CGAGGACTTC
GATGTCGATG CCCTGCGAGG GCTGCTGTCC GATCTCGGCT TCGGAAAGCC GCCCGTCGAG
CGGGCGATGA AGCAGATCCA GCAGCTTGCG AAAAGGATAT CCGAGCTCTC GGCCCGCCAC
GCCCCCAAGG GCCTTGTCGA TGGTCTCCAT GCCCAGGCGC GCGTGCTCGA GGATGCGCTC
GATGTCGACT TCGGTCTTGA AGAACGAGAT TACTACGACC GCGTCGTCAG AGATGAGGCG
ATCGAGGCAG CGGGTGGATG GGGTACGTTA AGCTGA
 
Protein sequence
MRMDVWMEGR DTPVGVLTRS EDKSLSFVYA GDIAPEHRIS MSLPITSEPY SDADCRGYFA 
NLLFEGPQLE RVLDGFGLDR GDIGALLWHL GADCPGAISI TPEGTGPGKM PGRFPEDYER
LSEARLHQIV LSLHRHRRMP EGERNPSPVA GVQGKIACLM LEGAVWLPKG GSRAPTTHIL
KVSPHFDPDV TRQETILLQI ASEIGIDAAE TRDLVFDVGG THINALLSTR FDRDIEIAEG
AGTIRRRHAE DFCQALGLPP SLKYERDAVD PHRRFSAAAV NVIAAQTTVP ALVTRDFLAQ
TIFNLLVGNT DNHGKNTSLL YRGRTVLLAP LYDVAPVFMD RRVTHEFAFR HGSARFAEDF
DVDALRGLLS DLGFGKPPVE RAMKQIQQLA KRISELSARH APKGLVDGLH AQARVLEDAL
DVDFGLEERD YYDRVVRDEA IEAAGGWGTL S