Gene RPB_3495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3495 
Symbol 
ID3911297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3998763 
End bp4000160 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content69% 
IMG OID637885397 
Producthypothetical protein 
Protein accessionYP_487101 
Protein GI86750605 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTCGT CTGACGGCGG CGGCGAGTCG CTGCGCAAGC GGCGGCAGCG CAATCGTGCG 
CGGCTGCGGC TGGCGACCGC GCTGACCGTG ATTGCGCTGG TCGCGGGCGC GCTCGGGGTG
CTGTTCTACG AATTGCGGCC GGTGACGTTG CGGATCGCGG TGGGGCCGCC GGGCAGCGAC
GATCTCAAGC TGATTCAGGC GCTGGCGCAG ACCTTCGTGC GCGATCGCAG TGCGGTACGG
CTGACGCCGG TGACGACCGA CGGCGCGACG CCGAGTCTGT CCGCATGGCG CGAGGGCCGC
GCCGATCTCG CCGTCACCCG CGGCGACCTC GATCTGCCGA CCGACGCGCA AGCGGTCGCG
ATCCTGCGCA AGAACGTGGT GACGCTGTGG GCGTTGCCGG GCAGGGGCAA GAAGCCGGTC
GCGGTGAAGA GCATCGCCGA GCTCGCCGGC CGTAAGGTCG GGGTGATCGG GCGCACGCCG
GCCAACGTCA AGCTGTTGCA CATCATCCTC GCCGAATCCG GCGTGTCGCC CGACAAGGTG
CAGACCGAGC AGTTCGCCGT CACGGCGATC GCCGAGATGG CGCGCGACGA GCAGCTGTCC
GCCTTTCTGT CGGTCGGCCC GCTCGACAGC AAGATCACGG CGGAGGCGAT CGCCGCCACC
GCGAAGGCGC GCGGCGAGCC GCGGTTTCTG CCGATCGACG TCGCCGATTC CATCGCCAAG
AAACATCCGA TCTACGATTC CGAGGAGATC CCCGGCAGCA CCTTCTCGTC CGCGCCGGCG
CGGCCCGAAG ACAAGGTCGA CACCGTCAGC ATCAACCACC TGATCATCGC CAAAAGTGCG
CTGTCGAGCC TCGATATCGC GCAGCTCACC CGGCAGATCT TCGCCGCGCG GCAGCAGCTC
GCGCGCGAAC TGCCGATCGC CGCCAAGATC GAGGCGCCCG ACACCGACAA GGCCGCCGCG
CTGCCGGCCC ATAGCGGCGC CGCCGCCTTC ATCGACGGCA CCGAGCGCAC CTTCATGGAG
CGCAACAGCG ACTACATCTG GGGGCTGGTC GTGCTGCTCT CCGGGCTCGG CTCGGCCGGC
GCGTGGTTTC GCTCCTACGT CACCCGCGAC GAGCGCGCCG CCGGCGCCAG GATGCGCGAC
CGCGCGCTCG CGCTTGTCGC AAAAGCCCGC AAGGCGCATT CGCTCGACGA GCTCGACGCC
ATGCAGCTCG AGATCGATCA CATCCTGCGC GACATGCTGG ACTGCTATGA CGACGGCGCG
ATCGACGATC TGGCGCCGTT CAACCTGGTG CTCGAGCAGT TTCACCACGC CGTCGCCGAC
CGGCGACAGA GTCTGACGAT CGCGAGCGGC GGAATAGCGC CGGCGCATCC GGATGCCCTC
GGCGTGCCGG GTGCATAG
 
Protein sequence
MVSSDGGGES LRKRRQRNRA RLRLATALTV IALVAGALGV LFYELRPVTL RIAVGPPGSD 
DLKLIQALAQ TFVRDRSAVR LTPVTTDGAT PSLSAWREGR ADLAVTRGDL DLPTDAQAVA
ILRKNVVTLW ALPGRGKKPV AVKSIAELAG RKVGVIGRTP ANVKLLHIIL AESGVSPDKV
QTEQFAVTAI AEMARDEQLS AFLSVGPLDS KITAEAIAAT AKARGEPRFL PIDVADSIAK
KHPIYDSEEI PGSTFSSAPA RPEDKVDTVS INHLIIAKSA LSSLDIAQLT RQIFAARQQL
ARELPIAAKI EAPDTDKAAA LPAHSGAAAF IDGTERTFME RNSDYIWGLV VLLSGLGSAG
AWFRSYVTRD ERAAGARMRD RALALVAKAR KAHSLDELDA MQLEIDHILR DMLDCYDDGA
IDDLAPFNLV LEQFHHAVAD RRQSLTIASG GIAPAHPDAL GVPGA