Gene RPB_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1020 
Symbol 
ID3909144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1169376 
End bp1170683 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content70% 
IMG OID637882913 
Producthypothetical protein 
Protein accessionYP_484641 
Protein GI86748145 
COG category[S] Function unknown 
COG ID[COG4325] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGCAC GTCTCCGCAA ACTCTACGAC GATCTGTCCG ACACGTTCTG GCTGGTTCCG 
GCGCTGCTGG TGCTGCTCGG CACGGTCGCG GCGTTCGGCA TGATCGAGAT CGATCGCAGC
GGCACGGTGC CGGCCTGGCT GCTGGAGAAC TGGCTCTACA ACGGCGGCGC CACCGGCGCC
CGCACGCTGC TCGGCGCGGT GGCGTCGTCG ACCATCGGCG TCGCCGGCAC GGTGTTCTCG
ATCACCATCG CGGCGCTGTC GCTGGCCGCG GGGCAGATGG GGCCGCGGCT GCTGCGCAAC
TTCACCCGCG ACCGCGGCAA CCAGATCACG CTCGGCATCT TCCTCGGCAC GTTCTGCTAC
GCGCTGATCG TGCTGCGCAG CGTCCGCACC GCCGACGAGG GCGCCTTCGT GCCGCATCTC
GCGCTCGGCG TCGGCATCGC GCTGGGCTTC ATCTGCGTCG CCACGCTGGT GTATTTCGTC
GATCACATGG CGAGCCGGAT CAATGTCGAC ACCGTGATCG GGCTGGTCAG CGACGACGTC
CGCCGCGTGA TGCGCGGACT GACCGACGAC GCCCCGCAGC CGGAGCCGCC GCCGCCGGCG
CATTGGCGCG ACGCCGAGCC GATCCGCGAC GCGCGCGCCG GCTATCTGCA TCATCTCGAC
GAGAACGGGT TGGCGAGTTG GGCGTGCGAG CACGAGACGG AAATCCGGCT GCTGGTCCGC
CCCGGCGACT ACGTCTTTCC CGGCGCGCCG ATCGCGCTGA TCAAGCCGCC GCGCGACGGC
GCGGAAGAGG CCGTCCGCGA CGCGACGGCG CTCAGCCCGA CGCTGACCTC GTCGGACGAT
CTGCGCTTCG CGATCCGGCA ATTGGTCGAG GTCGCGGTGC GGGCGCTGTC GCCGGGCATC
AACGACCCGC ACACCGCGCT CAGCGTGCTC AACCGGCTCG GCGCCGCGCT GTGCGACATG
CAGCCGGTGC GCATGAAGAG CGGCGTCGTG GTCAAACAGG GCCGCCTCGC GCTGGTGGTG
CCGCATCTGC AATACGACGA GCTGGTCGAC GCGATGTTGC ACATGATCCG GCAGAACGCC
GCCGGCAAGC CCGCGGTGCT GATCGGAATG CTCGAGGTGC TGACGCAGGT GGCGAGCGTC
GAGCGCGATC CGCCGCGGCT CTCCAGCGTG CGTCGGCACG CCGACCTGGT GATGGGTGAC
GCCGAGCGCG ACGTCCGCGG GCCGGACGAT CTGGCCGACG TGCGCGCGCG GTATTGGGCG
TTCGTCGACA TGGCCGAACA CGGCCCGCTC GGCCCGTTCA AGCTGTAG
 
Protein sequence
MNARLRKLYD DLSDTFWLVP ALLVLLGTVA AFGMIEIDRS GTVPAWLLEN WLYNGGATGA 
RTLLGAVASS TIGVAGTVFS ITIAALSLAA GQMGPRLLRN FTRDRGNQIT LGIFLGTFCY
ALIVLRSVRT ADEGAFVPHL ALGVGIALGF ICVATLVYFV DHMASRINVD TVIGLVSDDV
RRVMRGLTDD APQPEPPPPA HWRDAEPIRD ARAGYLHHLD ENGLASWACE HETEIRLLVR
PGDYVFPGAP IALIKPPRDG AEEAVRDATA LSPTLTSSDD LRFAIRQLVE VAVRALSPGI
NDPHTALSVL NRLGAALCDM QPVRMKSGVV VKQGRLALVV PHLQYDELVD AMLHMIRQNA
AGKPAVLIGM LEVLTQVASV ERDPPRLSSV RRHADLVMGD AERDVRGPDD LADVRARYWA
FVDMAEHGPL GPFKL