Gene Rpal_5211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5211 
Symbol 
ID6412911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5622436 
End bp5623395 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content67% 
IMG OID642715101 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001994174 
Protein GI192293569 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.313194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTG CAGTGCCGCT TGCGGTTGCG GCCGGCGTCG ATGCCGGCCA ACGTTTCCTC 
GATGTCGCGC TGGCTCCGTC GGGCCAGACC TTCCGGATGC CTAATCTGGC CGAGGGCATC
GCCGAGATCA TCGCCCGCCT CGAACGCTCC GGCGTCAGGC GGGTGGTTCT GGAGGCGATC
GGCCCCTATG CCGAACCGCT GGTCAAAGCG CTTTCGGTCG CCGGCTTCGA AGTCGGTATC
GTCAATCCGC GCCGGATCAA GGCGTTCCGC GAGGCCGAAG GCAGTCGCGC CAAGACCGAC
AGGCTGGACG CGCGGCTGAT TGCGCGCTTC GCTCTGACTA TGCCCGAGAC CCTGCGCCCC
CTGCCGACCG ACACCCAACT CGAGCTGAAG GCGTTGTCGC TGCGACGCCG CCAGTTCACC
GAGATGATCG CGATGGAGAA GACCCGGATG AAGCAGGTCC GCGGCACGCT CCTGCTCGAT
AGTCATCGGG CCGCGATCGC CGCGTTGTCG GCGCAGTGCC AAGCCATCGA GGCCGAACTC
GCCAAGCGCA TCGGCGACGA CGCCGAACTC CGCCGCGTCC TTCACATCCT CAAATCGATC
CCCGGCATCG GCGAACGGGT GGCTACGCTC CTCATTACCG ACCTCCCCGA ACTCGGCCAG
CGCGATCGCA AGGCCATCGC CAGCCTCGCA GGGCTCGCGC CCCACGTCAG CCAATCCGGC
GCAGCGCCAC CTCGCGCCGC CATCGCCGGC GGGCGTCCTT GCGTTCGCGC CGCGCTCTAC
ATGGCCGCCC TGGTCGCCGC CCGCCACCAC CCCAAGCTCC GCGACGACTA CAACGCCCTT
CGCCTGCAAG GAAAACCCGG AAAGGTCGCG CTCATCGCAA TCGCCCGAAA ACTGCTCGTC
ACAGCCAACG CACTCGTCAA AGCCGACACG CCATATCAAG GTAAGACCCT TGACACATGA
 
Protein sequence
MTIAVPLAVA AGVDAGQRFL DVALAPSGQT FRMPNLAEGI AEIIARLERS GVRRVVLEAI 
GPYAEPLVKA LSVAGFEVGI VNPRRIKAFR EAEGSRAKTD RLDARLIARF ALTMPETLRP
LPTDTQLELK ALSLRRRQFT EMIAMEKTRM KQVRGTLLLD SHRAAIAALS AQCQAIEAEL
AKRIGDDAEL RRVLHILKSI PGIGERVATL LITDLPELGQ RDRKAIASLA GLAPHVSQSG
AAPPRAAIAG GRPCVRAALY MAALVAARHH PKLRDDYNAL RLQGKPGKVA LIAIARKLLV
TANALVKADT PYQGKTLDT