Gene RPB_2081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2081 
Symbol 
ID3908494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2364511 
End bp2365695 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content66% 
IMG OID637883973 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_485698 
Protein GI86749202 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.353288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCG CTTCCCATAT TTCGCGTCGC CTGTTGCTCG CGGCCGGCGC CGCGTCGCTG 
GCACTGATCG CTGTCGGGCC GGCATCGGCC CAGGAGACCC TGAAGGTCGG CCTGGTGGCG
GCGATGTCCG GCCAGTCGGC GAAGTCCGGC GAGGCCATCG TCCGCGGTCT GTCGCTGGCG
CTGGACGAGA TCAACGCCAA GGGCGGCGTG CTCGGCAAGA AGCTGGAACT GGTGGTGCGC
GACGACGAGA GCAATCCCGC CAAGGGCGTG ATCGCCGCGC GCGAGCTGGT GCAGCGCGAG
AAGGTCGCCG CTTACTTCGG CGGCATCGAT ACGCCGGTGT CGATGGCGAT CGTGCCGTTC
GCCAATCAGT CCAAGGTGCC GTTCATCGGC GTCTGGGCCG CCGGTACCAA GATCACCCGC
AACGGCGCGC CGGAGAACTA CGTGTTCCGC GTCTCCGCGG TCGACGAACT GGTCGACATC
GCGCTGGTCG ACTACGCGGT CAAGAAATAC GGCGCCAAGA AGCCGGGCAT GATCCTCATC
AACAATCCCT GGGGCGAATC CAACGAGGCC GGGCTGAAGA GCGCGCTCGA CGCCAAGAAG
ATGACCGCCG CCGGCATCGA GAAATTCGAG ACCGGCGACG TCGACGTCGT GCCGCAGCTC
ACCCGGCTGA AGGACGCCGG CGCCGACACG CTGTTCATGG TCGCCAATGT CGCGCCCTCC
GCGCAGGTGG TGAAGTCGCT CGACCGGATG GGCTGGAGCG TGCCGGTGGT GTCGCATTGG
GGCCCGGCCG GCGGGCGTTT CACGGAGTTG GCCGGCCCCA GCGCCGAGAA GGTCCACTTC
ATCCAGACCT TCAGCTTCTC CGGCAACACC AGCCCGAAAG CCGTGGCGCT GTTCGACGCG
CTGAAGAAGA AATATCCCGA GGTCAAGACG GCCGCCGACG TCACCCCCGC GGTCGGCATC
GCCAATGCCT ACGACGCCAT GCATCTCACC GCGCTGGCGA TCGCCAAGGC CGGCTCGACC
GAAGGCCCGA AGGTCCGCGA AGGCTTCTAC CAGATCGGCA GCTATGACGG GCTGATCAAG
ACCTACAACA AGCCCTTCAC CGCCGACAAT CACGACGCGC TGTCGCCCTC GGACTATCTG
TTCACCTACT TCAAGGGCGC CGAGATCCTG CCGCTGACGA ACTGA
 
Protein sequence
MSFASHISRR LLLAAGAASL ALIAVGPASA QETLKVGLVA AMSGQSAKSG EAIVRGLSLA 
LDEINAKGGV LGKKLELVVR DDESNPAKGV IAARELVQRE KVAAYFGGID TPVSMAIVPF
ANQSKVPFIG VWAAGTKITR NGAPENYVFR VSAVDELVDI ALVDYAVKKY GAKKPGMILI
NNPWGESNEA GLKSALDAKK MTAAGIEKFE TGDVDVVPQL TRLKDAGADT LFMVANVAPS
AQVVKSLDRM GWSVPVVSHW GPAGGRFTEL AGPSAEKVHF IQTFSFSGNT SPKAVALFDA
LKKKYPEVKT AADVTPAVGI ANAYDAMHLT ALAIAKAGST EGPKVREGFY QIGSYDGLIK
TYNKPFTADN HDALSPSDYL FTYFKGAEIL PLTN