Gene RPB_1097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1097 
Symbol 
ID3910183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1258733 
End bp1260013 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content68% 
IMG OID637882990 
Producthypothetical protein 
Protein accessionYP_484718 
Protein GI86748222 
COG category[S] Function unknown 
COG ID[COG3174] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.122525 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.979079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCGA CGCCGTCTGT TCATAGTCTC GGCCTGCTGC TGCTGTTGAG TTTTTTTCTC 
GGCTTCGCGT TCGAGGATTT CTTCGCGAAG ACCAGCTCCG CACGGCCGGG CGGCATCCGC
ACCTTCCCGC TGCTCTCGCT CGGCGGCGGC ATCCTTTATC TGTTCGATCC GACACACCTG
ATCGCCTTCA CTGGCGGACT GCTCGTGCTC GGCGCCTGGC TGGCGATGTT CTACGGCGTC
CATCTGCGCG AGCGCGACGA GAAAGGCGAG CGCAATGCCG GGCTGGTGGT GCTGCTGCTG
AACGTGCACG CCTATCTGCT CGGCGCGGTC GCGCTGGCGC TGCCGCATTG GATCGCGGTC
GGCGTCACCG TGGTCGCGGT GCTGCTGCTG ACCGGGCGCG ACCGGCTGCA CACGCTGGCG
CGCCGCATCG ACATGAAGGA AATCACCACG GCCGGTCAGT TCCTGATTCT GACCGGTGTG
GTGCTGCCGC TGCTGCCGGC CGAGCCGGTG ACGACCCTCA CCAGCATCAC GCCGCGGCAG
GCCTGGCTGG CGCTGACGCT GGTCTGCACC CTGTCCTATG CGAGCTATCT GGCGCAGCGC
TACTGGCCGC GGGCGGCGCG CGGGCTGTGG ATGCCGGCGC TGGGCGGGCT GTATTCGTCG
ACGGCCACCA CCGTGGTGCT GGCGCGGCAG GCGAATGCCG ACCCGGCCTC GCGACGGCAG
GCGCTGGCCG GGATCACGCT CGCCACCGGC ATCATGTATC TGCGCATTCT GGCGATCATC
GCGGTGTTCA ATCTGGCGCT GGCGCGCCAG CTCGTGGTGC CGATGGCCGG CCTCGCCGCC
TTGGCGCTGT CGATCGCCGC GCTGCAATAC TGGCTGATCA AGGCGCCGGC CGCCGAAGCG
CATGACGCAG CGGGGCGCGG CAACCCGCTC GAACTCGGCA CCGCCGCGGC GTTCGCGGCG
ATGTTCGTGC TGATCTCGCT GGCCTCGACC TGGGTGAAGA CGGAATTCGG CACCGAAGGC
ATCTATTGGC TGGCGGCGAT CGTCGGCTTT GCCGACATCG ATCCCTTCGT CCTCAATCTG
GCGCAGGGCG GCACCGCCGG GATCGGCGAC CACGCGGTCG CGATCGCGGT GCTGATCGCG
GCGTCGTCCA ACAACATCCT GAAGGCGACC TACGCGCTGT CGTTCGGCGG CCGCGCGACG
CTGCAGAGCG CGCTGATGCT GGTGATACTG GCCGGGATCG GCGTCGTGCT CGCTGTGCTG
CTCGCGCGCG GGACGCTCTG A
 
Protein sequence
MIATPSVHSL GLLLLLSFFL GFAFEDFFAK TSSARPGGIR TFPLLSLGGG ILYLFDPTHL 
IAFTGGLLVL GAWLAMFYGV HLRERDEKGE RNAGLVVLLL NVHAYLLGAV ALALPHWIAV
GVTVVAVLLL TGRDRLHTLA RRIDMKEITT AGQFLILTGV VLPLLPAEPV TTLTSITPRQ
AWLALTLVCT LSYASYLAQR YWPRAARGLW MPALGGLYSS TATTVVLARQ ANADPASRRQ
ALAGITLATG IMYLRILAII AVFNLALARQ LVVPMAGLAA LALSIAALQY WLIKAPAAEA
HDAAGRGNPL ELGTAAAFAA MFVLISLAST WVKTEFGTEG IYWLAAIVGF ADIDPFVLNL
AQGGTAGIGD HAVAIAVLIA ASSNNILKAT YALSFGGRAT LQSALMLVIL AGIGVVLAVL
LARGTL