Gene Rpal_3708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3708 
Symbol 
ID6411385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3967699 
End bp3968658 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content67% 
IMG OID642713589 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001992683 
Protein GI192292078 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.251255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATTG CAGTGCCGCT TGCGGTTGCG GCCGGCGTCG ATGCCGGCCA ACGTTTCCTC 
GATGTCGCGC TGGCTCCGTC GGGCCAGACC TTCCGGATGC CTAATCTGGC CGAGGGCATC
GCCGAGATCA TCGCCCGCCT CGAACGCTCC GGCGTCAGGC GGGTGGTTCT GGAGGCGATC
GGCCCCTGTG CCGAACCGCT GGTCAAAGCG CTTTCGGTCG CCGGCTTCGA AGTCGGTATC
GTCAATCCGC GCCGGATCAA GGCGTTCCGC GAGGCCGAAG GCAGTCGCGC CAAGACCGAC
AGGCTAGACG CGCGGCTGAT TGCGCGCTTC GCTCTGACTA TGCCCGAGAC CCTGCGCCCC
CTGCCGACCG ACACCCAACT CGAGCTGAAG GCGTTGTCGC TGCGACGCCG CCAGCTCACC
GAGATGATCG CGATGGAGAA GACCCGGATG AAGCAGGTCC GCGGCACGCT CCTGCTCGAT
AGTCATCGGG CCGCGATCGC CGCGTTGTCG GCGCAGTGCC AAGCCATCGA GGCCGAACTC
GCCAAGCGCA TCGGCGACGA CGCCGAACTC CGCCGCGTCC TTCACATCCT CAAATCGATC
CCCGGCATCG GCGAACGGGT GGCTACGCTC CTCATTACCG ACCTCCCCGA ACTCGGCCAG
CGCGATCGCA AGGCCATCGC CAGCCTCGCA GGGCTCGCGC CCCACGTCAG CCAATCCGGC
GCAGCGCCAC CTCGCGCCGC CATCGCCGGC GGGCGTCCTT GCGTTCGGGC CGCGCTCTAC
ATGGCCGCCC TGGTCGCCGC CCGCCACCAC CCCAAGCTCC GCGACGACTA CAACGCCCTT
CGCCTGCAAG GAAAACCCGG AAAGGTCGCG CTCATCGCAA TCGCCCGAAA ACTGCTCGTC
ACAGCCAACG CACTCGTCAA AGCCGACACG CCATATCAAG GTAAGACCCT TGACACATGA
 
Protein sequence
MTIAVPLAVA AGVDAGQRFL DVALAPSGQT FRMPNLAEGI AEIIARLERS GVRRVVLEAI 
GPCAEPLVKA LSVAGFEVGI VNPRRIKAFR EAEGSRAKTD RLDARLIARF ALTMPETLRP
LPTDTQLELK ALSLRRRQLT EMIAMEKTRM KQVRGTLLLD SHRAAIAALS AQCQAIEAEL
AKRIGDDAEL RRVLHILKSI PGIGERVATL LITDLPELGQ RDRKAIASLA GLAPHVSQSG
AAPPRAAIAG GRPCVRAALY MAALVAARHH PKLRDDYNAL RLQGKPGKVA LIAIARKLLV
TANALVKADT PYQGKTLDT