Gene RPD_3936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3936 
Symbol 
ID4024452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4375060 
End bp4376091 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content61% 
IMG OID637964140 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_571058 
Protein GI91978399 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAAGA TTACCACAAT CGGACTAGAC TTGGCCAAGT CCGTCTTTCA GGTTCACGCA 
GTTGCCGAAG ACGGTCGCGT CATGGTTCGT CGCGCGCTGC GGCGGTCGCA GTTATTGGAC
TTTTTTCGAT CGCTGGAACC GTGCCTCGTT GGCCTGGAGG CTTGCGCAAG TTCGCACTTT
TGGGCCAACG CTATTGGGCA ACTCGGTCAT ACAGTTAGGA TGATGCCGCC GGCCTACGTG
AAGGCCTATG TCAAACGCAA CAAAACAGAC GCCGCCGATG CCGAGGCGAT CTGTGAGGCG
GTGACGCGAC CGACTATGCG CTTCGTGCCG ATCAAATCGC CGGAAGAGCA GGCGGCGGGA
ATGGTCCTGA AGACACGGGA GCTGTTTGTG CGTCAGCGGA GCCAGACGGC GAACGCGATG
CGCGCTCACA TGGCCGAGTT GGGCATCGTA GCCGCAACCG GAATGACCAG CATCGCCAAA
CTCGTCGCCA TTCTCCGTGG CGGTGACGAT GACCGCCTTC CATCTGCAGC TCGAGCAGCC
CTCCTGGAGA TGGCCGAGCA GATCGAGAGA CTGACGGCCC GTATCGAAGC GCTCGACACG
AAAATCATGG CGGCGGTGAA GAATGACGAA GCCGCTCGAC GGCTCACCAC CATCCCCGGC
GTCGGTCCGA TCATCGCCGC GACGGTCCGG GCAACGATCC AGGATCCAGC AGCCTTCCGA
ACGGGACGCG ATCTGGCGGC TTGGATCGGG ATTACACCGA GGGCCAACTC CAGCGGCGGC
AAAGAGCGGC TCGGCCGAAT ATCGAAGCAA GGCAACAAGC AGTTGCGAAC GCTGCTCATC
GTCGGCGCGA CGTCGATTCT GAAGCAGGCA AGTCGTGGCG TGAATCTGCC CGCCTGGGTG
TTATCCTTGA TGGTGCGTCG GCCCTACAAG GTTGCAGCCG TGGCGCTGGC CAACAAGATG
GCGCGCACGA TCTGGGCGCT TCTCGTCAAG GGCGGAACTT ACCAGGCGCC AGCAATCATG
GCGCGAGCAT AG
 
Protein sequence
MEKITTIGLD LAKSVFQVHA VAEDGRVMVR RALRRSQLLD FFRSLEPCLV GLEACASSHF 
WANAIGQLGH TVRMMPPAYV KAYVKRNKTD AADAEAICEA VTRPTMRFVP IKSPEEQAAG
MVLKTRELFV RQRSQTANAM RAHMAELGIV AATGMTSIAK LVAILRGGDD DRLPSAARAA
LLEMAEQIER LTARIEALDT KIMAAVKNDE AARRLTTIPG VGPIIAATVR ATIQDPAAFR
TGRDLAAWIG ITPRANSSGG KERLGRISKQ GNKQLRTLLI VGATSILKQA SRGVNLPAWV
LSLMVRRPYK VAAVALANKM ARTIWALLVK GGTYQAPAIM ARA