Gene RPB_1964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1964 
Symbol 
ID3908044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2231096 
End bp2232631 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content67% 
IMG OID637883858 
Producthypothetical protein 
Protein accessionYP_485583 
Protein GI86749087 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.611513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTCA AGACCATCCT GATCGCATTG GCGATCATAG CACTCTCGTT CGGTGTCACG 
CTGAAGGCGT TCGACTGGCT GTCGCCGCGC GCCGTCAGCC CGCTGACGCT GCAGGCCCTG
CCGCCGCTGC CGGCGATCCG ACAGTCCTCG GTCGTCGTGC CCGTGAGCGT GCCGCTGACC
GCGATCCGCG ATCTGGTCGA CCGGAGCGCG CCGCGCAATT TTTCCGGCAG GGCCGACAAT
CCGATGCCGC AGCTCGTGCA GAACGCCGAT ATCAGCTGGA CGGTGGCGCG CGGCGCCGTC
GCGACCAAGG GCGCACCCGA GCAGGTCACC GTCACCGCCC CGCTGATCGG CACCCTCTCC
GCCCGGGGCT CGCTCTCGAG CAGCGCCCAG AACCAGGTCG GTGATACGCT CGGCAAATTG
TTCGGCGACA AGGTCGCCAA GCAGGTCGGC GTCAACATCA AGTCGTTCAA CGCCAGCGGC
GAGATCAAGG GCATGATCGC GATCACCGCC CGTCCGAAAG TGCTGCCGGA CTGGCACGTC
GACCCGAACC TCACCGCACA GGTGATCCTG TCCGACTCCA ATGTCGCGAT CGGCGGCGCG
CGCATCAACG TGCCGGCGCA GGTCAAGCCG GTGATCGACA AGGCCGTCAA CGACCAGATC
GCCCAGTTGC AACGGACCAT CCGCGACGAC GGCGCGCTGG AGCGCAGCGC GCGGCGCGAA
TGGGCGCGGA TCTGCCGCTC GATTCCGCTG CAAGGCGCCG GCGTGCCGAA CGGCTTCTGG
CTCGAACTGC GTCCGACCAA GGCGCTGGCC GCGCAGCCGC AGATCGACGG CGCGACGGTG
GCGCTGACGC TCGGGATCGT CGCCGACAGC CGTATCACCA CGGCGCCGAC CAAGCCGGAA
TGTCCGTTTC CCGCGCAGCT CGAAATGGTC GCGCCCGACA GCACCGGCGT CAAAGTCGCG
GTGCCGATCG ATATTCCGTT CAAGGAGCTC GATCGCGTCA TCGAGCCGCA ATTCGTCGGC
CGCACCTTTC CTGAAAACGG TGGGGCCGCC GCGATCACGG TGAAGCGCGT CAATGTCGCC
GCCAGCGGCG ACCGGCTGCT GATCTCGATG CTGGTGGATG CCAAAGGCCA GAAGAGTCTG
TTCAGCTTCG GCGGCGAAGC CACGCTGCAC ATCTGGGGGC GGCCGGTCCT GAACCAGGAG
GATCAGACGC TGCGGCTGTC CGACATGCAG CTCGCGGTGG AATCCGAGGC GGCGTTCGGC
CTGCTCGGCG AGGCGGCGCG CGCCGCCGTG CCCTATCTGC AGAAGGCGAT CGCCGAACGG
GCGGTGATCG ATCTCAAGCC GGAATCGCTC AACGTGCAGC GCCGGATCGG CGCGGTGATC
GCGGCGTATC AGCGCAACGA GGACGGCCTG CGCATTTCCT CGGAGATCTC CAGCCTGCGG
CTGACCGATG TGGCGTTCGA CTCCACCATG CTGCGGGTGA CCGCAGAGGC CAACGGCATT
CTCGAAGTCA CGATCACCAA GCTGAAGGCG CCCTGA
 
Protein sequence
MRFKTILIAL AIIALSFGVT LKAFDWLSPR AVSPLTLQAL PPLPAIRQSS VVVPVSVPLT 
AIRDLVDRSA PRNFSGRADN PMPQLVQNAD ISWTVARGAV ATKGAPEQVT VTAPLIGTLS
ARGSLSSSAQ NQVGDTLGKL FGDKVAKQVG VNIKSFNASG EIKGMIAITA RPKVLPDWHV
DPNLTAQVIL SDSNVAIGGA RINVPAQVKP VIDKAVNDQI AQLQRTIRDD GALERSARRE
WARICRSIPL QGAGVPNGFW LELRPTKALA AQPQIDGATV ALTLGIVADS RITTAPTKPE
CPFPAQLEMV APDSTGVKVA VPIDIPFKEL DRVIEPQFVG RTFPENGGAA AITVKRVNVA
ASGDRLLISM LVDAKGQKSL FSFGGEATLH IWGRPVLNQE DQTLRLSDMQ LAVESEAAFG
LLGEAARAAV PYLQKAIAER AVIDLKPESL NVQRRIGAVI AAYQRNEDGL RISSEISSLR
LTDVAFDSTM LRVTAEANGI LEVTITKLKA P