Gene RPB_4369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4369 
Symbol 
ID3912184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4953346 
End bp4955082 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content67% 
IMG OID637886275 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_487967 
Protein GI86751471 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGCC GCGTCTGCGC GGACAAGGGA ACGGAAACGA TGATCTCGCG GCGGGAATTC 
CTGCAGGCCA CAGCGGCCGC ATCGGCGCTC ACGATCGGCA GCGGCCTCGG TCCGATTGGG
CGGGTGGCGG CGCAGCAGCG GCTGACGCAG GCCGACATTC TGAAGTTCGA TCCGCTCGGC
ACGCTGACGC TGCTGCACGT CACCGACATC CACGCCCAGC TGATGCCGCT GCATTTCCGC
GAGCCGTCGG TCAATCTCGG CGTCGGCGAG GTCAAGGGCA AGCCGCCGCA TCTGACCGAC
GCGGAATTCC GCAACTACTT CCACGTCGCC ACCGGATCGC CGGACGCCTT CGCGCTCACC
GCCGATGATT TCGTCGCGCT CGCCCGCAAT TACGGCCGGA TGGGCGGCAT GGACCGCATC
GCCACCTTGG TGAACGCGGT GCGCGCCGAG CGCGGCGCCG ACAAGGTGCT GCTGCTCGAC
GGCGGCGACA CCTGGCAGGG CAGCTGGACC TCGCTGCAGA GCAAGGGCCA GGACATGATC
GACGTCATGA CCGCGCTGAA GCTCGACGCG ATGACCGGCC ATTGGGAATT CACCTACGGC
GCCGAGCGGG TCAAGCAGGT CGCCGACTCG GCGCCGTTCG CCTTCCTGGC GCAGAACGTC
CGCGACAACG AATGGCAGGA GCCGGTGTTC GAGGCGCGCA AGATGTTCGA GCGCGGCGGC
GTCAAGATCG CGGTGATCGG GCAGGCGCTG CCGCGCACCG CGGTCGCCAA TCCGCGCTGG
ATGTTTCCGA ACTGGGAGTT CGGCATCCGC GAGGAGGACA TCCAGAAGCA GGCCGACGAC
GCCCGCGCCG AAGGCGCCGA GGTCGTGGTG CTGCTGTCGC ACAACGGCTT CGACGTCGAC
CGCAAGCTCG CCGGCCGGGT CAAGGGCCTC GACATCATCC TCACCGCCCA CACCCACGAC
GCGATGCCGG GCCTGATCAA GGTCGGCGAC ACCGTGCTGG TGGCGTCGGG CTCGCACGGC
AAATTCGTGT CGCGGCTCGA CATCGCGGTG AAGGGCAAGA AAGTGTCCGA CATCCGCTTC
AAACTGATGC CGGTGTTCGC CGACGCCATC GCGCCGGACC CGGCGATGAA GCAACTGGTC
GAGAAGCTGC GTGCGCCCTA CGCCAAGGAT CTCGCGCGCG TCGTCGGCAA GACCGATTCG
CTGCTGTATC GCCGCGGCAA TTTCAACGGC ACCTTCGACG ACCTGATCTG CGACGCGATG
CTGAAGCAGC GCGACACCGA GATCGCGCTG TCGCCGGGCT TCCGTTGGGG CGGCACGCTG
CTGCCGAACG AGGACATCAC CTGGGAGGCG ATCACCAACG CCACCGCGAT CACCTATCCG
AACTGCTACC GCAGCGAGAT GACCGGCGAG CAGCTCAAGA ACGTGCTCGA GGACATCGCC
GACAACATCT TCCACCCAGA CCCGTATTTC CAGGGCGGCG GCGACATGGT CCGCACCGGC
GGCATGGGCT ATTCGATCGA TATCGGCAAG GAGATCGGCT CGCGGATCTC CGGCATGGTG
CATCTCAAGA CCGGCAAGCC CATCGAGGCG TCGAAGACCT ACACCGTCTC CGGCTGGGCC
AGCATCAACC AGAACACCGA GGGCCCGCCG ATCTGGGACG TGCTGGCCAA GCACGTCGCG
CAGGCGGGGC CGGTGAAGAT CGATCCCAAC AGCGCCGTCA AGGTGTCGGG CGCCTGA
 
Protein sequence
MQGRVCADKG TETMISRREF LQATAAASAL TIGSGLGPIG RVAAQQRLTQ ADILKFDPLG 
TLTLLHVTDI HAQLMPLHFR EPSVNLGVGE VKGKPPHLTD AEFRNYFHVA TGSPDAFALT
ADDFVALARN YGRMGGMDRI ATLVNAVRAE RGADKVLLLD GGDTWQGSWT SLQSKGQDMI
DVMTALKLDA MTGHWEFTYG AERVKQVADS APFAFLAQNV RDNEWQEPVF EARKMFERGG
VKIAVIGQAL PRTAVANPRW MFPNWEFGIR EEDIQKQADD ARAEGAEVVV LLSHNGFDVD
RKLAGRVKGL DIILTAHTHD AMPGLIKVGD TVLVASGSHG KFVSRLDIAV KGKKVSDIRF
KLMPVFADAI APDPAMKQLV EKLRAPYAKD LARVVGKTDS LLYRRGNFNG TFDDLICDAM
LKQRDTEIAL SPGFRWGGTL LPNEDITWEA ITNATAITYP NCYRSEMTGE QLKNVLEDIA
DNIFHPDPYF QGGGDMVRTG GMGYSIDIGK EIGSRISGMV HLKTGKPIEA SKTYTVSGWA
SINQNTEGPP IWDVLAKHVA QAGPVKIDPN SAVKVSGA