Gene Rpal_1095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1095 
Symbol 
ID6408751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1160849 
End bp1161877 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content57% 
IMG OID642711001 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001990118 
Protein GI192289513 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCATT ACGTCGGTTT GGATGTCTCG CAAAGAGAAA CTGCGGTGTG TGTGGTCAGC 
GAGATCGGGC AATTAGTCTT CGAGGGAAAG GCCAAGTCAG ATCCCGGCGC TCTGACCAAC
CTGCTTCACA AACATGCTCC GCTTGCGGAG CGCATTGGCT TTGAGACTGG CGCGATGGCA
AGCTGGCTTT GGCACGAGCT TCGTAGAGTC GAACTCCCTG TCGTTTGCAT CGATGCGCGA
CATGCAAACG CGGCCCTGTC GGTCCGTATG AACAAGAGCG ATCAAAATGA CGCTCGAGGC
CTAGCCGAAC TAGTGCGGGT CGGTTGGTAT CGAGAAGTCA AAGTTAAGAG CGAGAAAAGT
CAGAAGATCC GCGCGATGCT TGTAGCACGA TCCCGACTCG TATCGATGCG CCGGGACATT
GAGAACCAGG TCCGTAGTCT GATCAAAGAA TGTGGATTAC TATTCCCTCG CGCCATCGGC
CAACAGTTCC GCAATCGGGT CAGCGAGCTA TTGGGCGAGG ACCATCAGCT TGTCAGCGTG
GTCGCGCCGC TGCTGTCGAT TCATGAGCAC ATCTGTCTGC AGCAAGGCAA GTTCGACGAC
GAGGTTCGCC GATTGGCGAA GTCGGACGAA ACGACGCGAC GCCTGATGAC GGTTCCTGGC
GTCGGAGTAG TGACCGCCCT GACTTTCCGC CATACGATCG ATGACCCATC CCGCTTCCGG
TCGGCCTCGA CAGTCGGCGC CTATCTCGGT CTTACACCTC GGCGCAACCA ATCTGGGGAA
ACCGACACCA GTGGCAAGAT ATCTCGATGG GGCGATCGGC TGCTCCGAAC GTACCTGTTC
GAGGCGGCGA CCGTGCTGCT CTATCGGACT AAGAAATGGT CCTCCCTCAA GGCCTGGGGA
GTGAAGCTCG CGAAACGGAT AGGTATGAAG AAGGCGAAAG TCGCCATCGC CCGCAAGATC
GCCGTGATTC TTCACTGCAT CTGGGTCGAT GGCACATCGT TCGAGTGGGG TCAGGCAACG
CCGGCCTGA
 
Protein sequence
MKHYVGLDVS QRETAVCVVS EIGQLVFEGK AKSDPGALTN LLHKHAPLAE RIGFETGAMA 
SWLWHELRRV ELPVVCIDAR HANAALSVRM NKSDQNDARG LAELVRVGWY REVKVKSEKS
QKIRAMLVAR SRLVSMRRDI ENQVRSLIKE CGLLFPRAIG QQFRNRVSEL LGEDHQLVSV
VAPLLSIHEH ICLQQGKFDD EVRRLAKSDE TTRRLMTVPG VGVVTALTFR HTIDDPSRFR
SASTVGAYLG LTPRRNQSGE TDTSGKISRW GDRLLRTYLF EAATVLLYRT KKWSSLKAWG
VKLAKRIGMK KAKVAIARKI AVILHCIWVD GTSFEWGQAT PA