Gene Rpal_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1039 
Symbol 
ID6408695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1103274 
End bp1104764 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content65% 
IMG OID642710954 
ProductIntegrase catalytic region 
Protein accessionYP_001990071 
Protein GI192289466 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0651609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGACGG TCGCTCGGAT TCGGCGTGAG CATTTTCTCA AGGGCAAGAC GATCAAGGAG 
ATCGTCCGGG ACCTGAAGGT GTCGCGGAAC ACGGTCCGCA AAGTGCTGCG TTCCGGTGAG
ACGTCATTCG AGTATGAGCG CGAAGTTCAG CCGCGACCGA AGCTTGGGCG GTGGACGGCC
GAGCTGGATG AACTGCTCTC GACGAACGCC ACCAAGGCAG CTCGCGAGCA GTTGACGTTG
ATCCGGATCT TCGAGGAACT GCGCGGGCGC GGTTATGACG GCGGCTACGA TGCCGTGCGC
CGCTTCGCCC GGCGCTGGGC CAAGGAGCGC GGCCAGGCGA CGGCCGCAGC TTACGTACCG
CTGAGCTTCG CGCCGGGAGA AGCCTACCAG TTCGACTGGA GCCACGAGAT CGTCCTGTTT
GGCGGGGTGA CGACGATCGT GAAGGTCGCC CACGTCCGGC TCTGCCACAG CCGGATGTTG
TTCGTGCGGG CCTATCCGCG CGAGACCCAG GAGATGGTGT TCGACGCTCA TGACCGGGCG
TTCGCCTTGT TCAAGGGAAC CTGCGGACGC GGCATCTACG ACAACATGAA GACGGCGGTG
GAGACGATCT TCGTCGGCAA GGACCGTCTC TATAATCGCC GCTTCATGCA GATGTGCAGC
CACTACCTGA TCGAGCCGGT CGCATGCACG CCGGCGTCTG GCTGGGAGAA GGGTCAGGTC
GAGAACCAGG TCGGCCTGGT GCGTGAGCGA TTCTTCACGC CGCGGCTGCG TTTCAGGAGC
TACGACGAGT TGAACGCCTG GCTCACGGAC AAATGCATCG CCTACGCCAA AGCCCATCGC
CACCCAGAGC TGACCGAGCA GACGATCTGG GAGGTGTTCG AAGCCGAGCG ACCAAAGCTC
GTTCCCTATG CCGGCCGGTT CGATGGATTC CACGCGGTGC CGACCTCGGT CTCGAAGACC
TGCCTGGTGC GCTTCGACAA CAACAAATAC TCGGTCGCCG CCAGCGCGGT CGGTCGACCG
GTCGAGGTGC ATGCTTATGC CGACCGCATC GTCATCCGCC AGGACGGCCG CGTCGTTGCC
GAACATCCTC GCTCGTTCGG TCGCGGCGAG ACCACCTACG ATCCCTGGCA TTACGTTCCC
GTGCTGGCGC GCAAGCCGGG CGCCTTGCGC AACGGCGCGC CGTTCAAGGA TTGGGTGCTA
CCGGCAGCGA TGGAACGCGT CAGGCGCAAG CTTGCCGGTG TTGCCGACGG CAACCGGCAG
ATGGTCGATA TCCTCAATGC GGTGCTGACC GATGGCCTGG CGGCGGTCGA AGCCGCCTGT
GTCGAGGCGA TCGCGCACGG CGTCCATTCC GCCGACGTCA TCCTCAACAT CCTCGCTCGC
CGGCGCGATC CAGCGCCGCC GGCCAACATC CTCACCCCCG CGGCGCTGGC GCTGCGTTAC
GCGCCCATCG CCGATTGTGC CCGCTACGAC AACCTCCGGA GGATGGTCTG A
 
Protein sequence
METVARIRRE HFLKGKTIKE IVRDLKVSRN TVRKVLRSGE TSFEYEREVQ PRPKLGRWTA 
ELDELLSTNA TKAAREQLTL IRIFEELRGR GYDGGYDAVR RFARRWAKER GQATAAAYVP
LSFAPGEAYQ FDWSHEIVLF GGVTTIVKVA HVRLCHSRML FVRAYPRETQ EMVFDAHDRA
FALFKGTCGR GIYDNMKTAV ETIFVGKDRL YNRRFMQMCS HYLIEPVACT PASGWEKGQV
ENQVGLVRER FFTPRLRFRS YDELNAWLTD KCIAYAKAHR HPELTEQTIW EVFEAERPKL
VPYAGRFDGF HAVPTSVSKT CLVRFDNNKY SVAASAVGRP VEVHAYADRI VIRQDGRVVA
EHPRSFGRGE TTYDPWHYVP VLARKPGALR NGAPFKDWVL PAAMERVRRK LAGVADGNRQ
MVDILNAVLT DGLAAVEAAC VEAIAHGVHS ADVILNILAR RRDPAPPANI LTPAALALRY
APIADCARYD NLRRMV