Gene RPC_2070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2070 
Symbol 
ID3974028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2266391 
End bp2267500 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content63% 
IMG OID637925178 
Producttransposase, IS4 
Protein accessionYP_531943 
Protein GI90423573 
COG category[L] Replication, recombination and repair 
COG ID[COG5433] Transposase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.361734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAGC CGATGGATCG ATTTGCGGAG TGCTTCGAAG ACCTGCCCGA CCCGCGGGCG 
GGCAATGCGT TGCACGATCT GACCGAGATC TTGTTCATTG CCCTGATGGC GACGCTGTGC
GGGGCGACCA GTTGCACCGA CATGGCGCTG TTTGCGCGGA TGAAGGCCTA TCTTTGGCGG
GATGTGCTGG TCCTGAAGAA CGGCCTTCCG AGCCACGACA CGTTCAGTCG GGTGTTCCGC
ATGCTGGACC CGGAGGCGTT CGAGAAGGCG TTCCAACGCT TCATGAAAGC CTTTGCCAAA
GGCGCCAAGA TCAAGCCGCC GAAAGGGGTG ATCGCCCTCG ACGGCAAGGC GCTGCGGCGC
GGCTACGAAA GCGGCAGAAG CCACATGCCG CCCGTGATGG TGACGGCCTG GGCGGCGCAG
ACCCGCATGG CGCTGGCCAA TGTGCAGGCC CCGAACAACA ACGAAGCCGC CGGTGCCTTG
CAACTGATCG AACTTCTGCA GCTCAAAGGC TGCGTCGTGA CGGCCGATGC GCTGCATTGC
CATCGTGGCA TGGCCGAAGC GATCAAGGCC CGGGGCGGCG ATTATGTGCT GGCCGTGAAG
GACAACCAGC CAGCGCTGAT GCGGGATGCG AAGGCGGCAA TCCGCGCCGC CACGCGCCAG
GGCAAGCCAT CGACGATCAC CGTCGATGCC GGTCATGGAC GCAAGGAAAA GCGCCGTGCT
GTCGTCGCCG CTGTCCCGCA GATGGCGCAA GACCACGACT TTGCCGGGCT CAAAGCGGTG
GCCAGGATCA CCAGCAAGCG CGGCACCGAC AAGACCGTCG AGCGTTACTT TCTGATGAGC
CAGGCCTATC CCCCCAAAGA CGTCCTGCGC ATCGTCCGGA CCCACTGGAC CATCGAAAAC
AGCCTGCATT GGCCGCTCGA CGTCGTGCTC GACGAGGACT TGGCGCGCAA TCGCAAGGAC
AACGCCCCCG CCAACCTCGC CGTGCTCAGA CGCCTGGCCC TCAACGTCGC AAGGGCACAT
CCAGACAACA CCACATCGCT GCGTGGAAAG CTGAAACGTG CAGGATGGAA CGATACGTTC
CTCTTCGAAC TCATCCAACA CATGCGATAG
 
Protein sequence
MEQPMDRFAE CFEDLPDPRA GNALHDLTEI LFIALMATLC GATSCTDMAL FARMKAYLWR 
DVLVLKNGLP SHDTFSRVFR MLDPEAFEKA FQRFMKAFAK GAKIKPPKGV IALDGKALRR
GYESGRSHMP PVMVTAWAAQ TRMALANVQA PNNNEAAGAL QLIELLQLKG CVVTADALHC
HRGMAEAIKA RGGDYVLAVK DNQPALMRDA KAAIRAATRQ GKPSTITVDA GHGRKEKRRA
VVAAVPQMAQ DHDFAGLKAV ARITSKRGTD KTVERYFLMS QAYPPKDVLR IVRTHWTIEN
SLHWPLDVVL DEDLARNRKD NAPANLAVLR RLALNVARAH PDNTTSLRGK LKRAGWNDTF
LFELIQHMR