Gene RPB_3244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3244 
Symbol 
ID3911045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3706532 
End bp3707728 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content68% 
IMG OID637885146 
Producthypothetical protein 
Protein accessionYP_486851 
Protein GI86750355 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTCG GTACCGGGGC GCGTTCACTG CCCAACCATC GTGGGGCCCG CTATATGGGT 
TTTCGTAAGT CACTTGGCGC GATCGCCATC GCGCTGCCGG TGATGGGCTG CGTCTGGACG
GTGTATGCCA ACACCGTGGG CGCAAGCATC TATCCCAGCG TCGGCACTTG GGGCGAGGAG
CCCGCCAGCC GCACGGATTC GCGTCCCGAG CGACCGAATT TCGCATCGCG TTTCGAGCCG
CTGGCCGCGG CTGCGGAGAG TGCGCCACGC CTGATGGCGC ATCTCGCCCT GAAAAACGCC
ATGGCCAAGC AGGGCGCTGC GCCGGTGCAG GTCGCTTCCG CCGATCCCGG CGACGTTCCC
GTCACCGCCA GCGTGCAGGC CGCCAACGCT GCAGCGCCGC AGGCCAAGTC CGGTCACCGC
TATTACGCGC TGCTCGACGC CAATTATTCG TTCGGTTTCG AGCCCGACAG GTTCAGGCCC
GCGCAAAATA AATCGGCCGA GCCGGCCGCG GTCGATGCCG CCGGCAAGGC GGGAGCGGAC
GCGAAACTGG CGCTGGCCGT GCCGATGCCG CCGGAGCGCA AGTCGGTGGC CTCCCGCGTC
GCGCCCGGCC CGACCCGCAA CCCGTTCAGC CAGGAAGCGC TGGTGGCGCG CGCCAAGGCG
GTGCTGATGG CGCAGAAGGC GGAGAGCAAG TCGAGCTTCT TCGAGAAACT GTTCGGCAAG
CCCGACGCCC CGGCGCTGGC CTATGCGTCC GCCGAAGCCG GCGTGACCAG CACCGGCGAG
CCGCGCGATC CTGCCTTCAG TCCGTCCGAG GACGACCGCT ACACCGCAGT CTACGACATC
ACGGCCAAGA CCGTGTACAT GCCGGACGGC AGCAAGCTCG AAGCCCATTC GGGCCTTGGT
CCCAAGATGG ACGATCCGCG CCACGTCAAC GTCCGGATGC ACGGCGCGAC GCCGCCGCAC
ATCTACGACA TGAAGATGCG CGAGGCGCTG TTCCACGGCG TCGCGGCGAT CCGGCTGACG
CCGGTCGGCG GCCAGGACTT GATCCACGGC CGCACCGGCC TGTTGGCGCA CAGCTACATG
CTTGGCCCGC GCGGCGACTC CAACGGTTGC GTCTCGTTCA AGGACTACAA TGCCTTCCTG
CAGGCGTTCA AACGCGGCGA GGTCAAACGC CTCGTCGTCG TCGGCAGCAT GACCTGA
 
Protein sequence
MSVGTGARSL PNHRGARYMG FRKSLGAIAI ALPVMGCVWT VYANTVGASI YPSVGTWGEE 
PASRTDSRPE RPNFASRFEP LAAAAESAPR LMAHLALKNA MAKQGAAPVQ VASADPGDVP
VTASVQAANA AAPQAKSGHR YYALLDANYS FGFEPDRFRP AQNKSAEPAA VDAAGKAGAD
AKLALAVPMP PERKSVASRV APGPTRNPFS QEALVARAKA VLMAQKAESK SSFFEKLFGK
PDAPALAYAS AEAGVTSTGE PRDPAFSPSE DDRYTAVYDI TAKTVYMPDG SKLEAHSGLG
PKMDDPRHVN VRMHGATPPH IYDMKMREAL FHGVAAIRLT PVGGQDLIHG RTGLLAHSYM
LGPRGDSNGC VSFKDYNAFL QAFKRGEVKR LVVVGSMT