Gene RPB_1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1013 
Symbol 
ID3909137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1161703 
End bp1162944 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content67% 
IMG OID637882906 
Producthypothetical protein 
Protein accessionYP_484634 
Protein GI86748138 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACT TTGCCATCGC GGCCGCCCCC TCCCCGGCCA CGGCGCCGCG CCTGCGGGCG 
CTGCAGCTGG CGCTGCTGTG GTTCGTCGGC GCCAGCGGCG CCATCGTGTT CATCGAGCCG
AGCCCGTATG AATTCGCGAT CCTGCTGTCG ATCGTGGTGT TCTTCGCCTC CGGCCTGCGG
ATCACGCCGG CGCTGATCGT GCCGATCGCG CTCTTGATCG GCGTCGAACT CGGCTACACG
ATCGGCGCCA GCTCCCTGCT CGACGATCCG ATCATCCTGA ACTGGCTGCT GACCTCCTGG
TACATGGCGA TCACCGCGAT GTTCTTCGCG CTGGTGACGC TGGAGCACAC CGGCGACCGG
ATCGAGGCGC TGGCCAAGGG CTATCTGATC GGCGGGCTGA TCGCGTCGCT GGCCGGCATC
GCCGGCTATT TCAACCTGAT CCCCGGCACC ACGGACCTGC TGACCTATGC GGGGCGCGCC
CGCGGCACCT TCAAGGACCC GAACGTGCTC GGCGCGTTCC TGATCTTTCC GGCGGTCTAC
GCGCTGCAGC GGGTGATCGA GGGCTCGTTC TGGAGCGCGA TGCGCCATGC GATCGCCTTC
GGCATCATCG CGCTGGCGAT CTTTCTGGCG TTCTCGCGCG CCGCCTGGGG CACGCTCGCC
GGCGCCTCGA TGCTGATGAT CGCGCTGATG TTCGTCACCG CGCCGACGCA GCAGCGGCGA
TTGCGGATCG TGATGCTGGC GGCGATCGCC GGGCTGGTGC TGGTCGCCGC CATCGCGGTG
CTGCTGTCGT TCGACCGGAT CGACGCGCTG TTCAAGGAGC GCGCCAGCTT CTCGCAACCC
TACGACAGCG GTCGGTTCGG GAGGTTCGGC CGGCATCTGC TCGGCGCCGG CATGGCGCTG
GACTATCCGA CCGGAATCGG CCCGCTGCAG TTCCGGCGGT TCTTTCCCGA GGACACCCAC
AATTCGTTCC TCAACGCCTT CATGTCCGGC GGCTGGATCA GCGGCATCCT GTATCCGGCC
CTGGTGTTCA TCACCGCCGC CTACGGGCTG CGCAACGTTT TCGTCCGCAC GCCGTGGCAG
CGCACCTACA TCGCCATCGT GGCGACGCTG ATCGTGACGC TGCTGGAAAG CTTCATTATC
GATACCGATC ACTGGCGGCA TTATTTCATG CTGATCGGCT TGACCTGGGG CGTGGCAATT
GCGAGCAGTC GCCTCCGGTT GCAGAGCCAC GCCGGGCCCT GA
 
Protein sequence
MTDFAIAAAP SPATAPRLRA LQLALLWFVG ASGAIVFIEP SPYEFAILLS IVVFFASGLR 
ITPALIVPIA LLIGVELGYT IGASSLLDDP IILNWLLTSW YMAITAMFFA LVTLEHTGDR
IEALAKGYLI GGLIASLAGI AGYFNLIPGT TDLLTYAGRA RGTFKDPNVL GAFLIFPAVY
ALQRVIEGSF WSAMRHAIAF GIIALAIFLA FSRAAWGTLA GASMLMIALM FVTAPTQQRR
LRIVMLAAIA GLVLVAAIAV LLSFDRIDAL FKERASFSQP YDSGRFGRFG RHLLGAGMAL
DYPTGIGPLQ FRRFFPEDTH NSFLNAFMSG GWISGILYPA LVFITAAYGL RNVFVRTPWQ
RTYIAIVATL IVTLLESFII DTDHWRHYFM LIGLTWGVAI ASSRLRLQSH AGP