Gene RPB_3833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3833 
Symbol 
ID3911636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4375641 
End bp4376702 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content65% 
IMG OID637885733 
Producthypothetical protein 
Protein accessionYP_487437 
Protein GI86750941 
COG category[S] Function unknown 
COG ID[COG0392] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00492954 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCGAC TGCTGAGCGC GCTGGGGCGC GGCTTCAAGA CGTATGTCGG GTGGAAACGG 
CTCGGCATCG TCGCGAGCAT TCTGATCATC GGCTTTGCGA TCACGTCGCT GGTCAACACC
CTGAAGGGGG TCGACAGCGC CGTCATCCTG ACCGCGCTGA CGGAGAAGTC CCCAACTCAG
ATCGGGCTCG CCGCGCTGTT CGTGGTCGGC GCGTTCTGCA CGCTGACCTT CTACGATTTC
TTCGCCCTGC GAACGATCGG CAAGCTGCAC GTGCCGTACC GGATCGCGGC GCTGTCGGCC
TTCACCTCTT ATGTCATCGG GCACAATCTC GGCGCCACGG TGTTCACCGG CGGCGCGATC
CGGTTCCGGA TCTATTCCGA CTACGGCCTC ACCGCGATCG ACGTCGCCAA GATCTGCTTC
ATCTCCGGCC TGACGTTCTG GCTCGGTAAC CTGTTCGTGC TCGGCATCGG CATGATCTGG
CACCCCGCCG CCGCCAGCGC GATGGATCTG TTGCCGGACA GCGTCAACCA GCTGATCGGC
GTCGCCTGCC TCACCGGCAT CGCGGCGTAT TTCGTGTGGC TGGCGACCGG CAAGAAGCGC
CGCCAGCTCG GCCAGAACGG CTGGAAGGTG GTACTGCCGT CGGCCAAGCT GACGCTGGTG
CAGGTGCTGA TCGGCGTGGT CGATCTCGGC TTCTGCGCCA TGGCGATGTA CATGCTGATG
CCGTCCGAGC CCTATATCGA CTTCGTCTCG CTGGCGGTGG TGTTCATCCT CGCCACGCTA
CTCGGCTTCG CCAGCCATGC CCCCGGCAGC CTCGGCGTGT TCGACGCCGC GATGCTGGTG
GCGCTGCCGA TGTTCGCCCG CGAGGACATC ATCGCCACGC TGCTGATCTA TCGCGTGTTG
TATTTCCTGC TGCCGTTCGG CGTCGCGATC TCGATCCTCG GCATGCGCGA GCTGTGGCTG
AGCGTGATCA AGCCGTGGCA GGAGAAACGC GCCGGCAATG GCCACCCGGT CGCCGCGGCT
CCGGTCCGGC AGATCGCGCA GCGCCCGCGC AAGCAAGGCT GA
 
Protein sequence
MYRLLSALGR GFKTYVGWKR LGIVASILII GFAITSLVNT LKGVDSAVIL TALTEKSPTQ 
IGLAALFVVG AFCTLTFYDF FALRTIGKLH VPYRIAALSA FTSYVIGHNL GATVFTGGAI
RFRIYSDYGL TAIDVAKICF ISGLTFWLGN LFVLGIGMIW HPAAASAMDL LPDSVNQLIG
VACLTGIAAY FVWLATGKKR RQLGQNGWKV VLPSAKLTLV QVLIGVVDLG FCAMAMYMLM
PSEPYIDFVS LAVVFILATL LGFASHAPGS LGVFDAAMLV ALPMFAREDI IATLLIYRVL
YFLLPFGVAI SILGMRELWL SVIKPWQEKR AGNGHPVAAA PVRQIAQRPR KQG