Gene Bpro_5089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_5089 
Symbol 
ID4016128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007949 
Strand
Start bp202161 
End bp203609 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content57% 
IMG OID637944726 
ProductIS4 family transposase 
Protein accessionYP_551858 
Protein GI91790907 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00232664 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGAT TCATTGTGGG TGCGGATCGG CAACAAGTCA CGCTGCTACC GGAGTGCTTG 
GACGACTTCA TCACTGAAGA CAACACCGTT CGGGTGGTTG ATGCGTTCAT AGGTGAGTTG
GATATGGTGG CCCTGGGTTT CGAGGGAGCA ACCCCGGCTG CAACGGGTCG ACCTTCGTAC
CACCCATCAG TGCTGTTGAA ACTCTACCTC TACGGTTACC TCAACCGCAT CCAGTCCAGT
CGCCGCCTGG AGCGCGAATG CCAACGAAAT GTGGAGCTGA TGTGGCTGAC CGGGCGCCTG
GCCCCTGACT TCAAGACCAT TGCGGACTTC CGTCGTGACA ACGGCAAAGG CATCCGCAAT
GTGTGTCGGC GCTTTGTGCT GTTATGCCGT GAGCTGAAGT TATTTAGCGA AGCGGTGGTG
GCCATCGATG GCAGCAAGTT CAAGGCCGTC AACAACCGGG AGCGCAACTA CACGCCCGGA
AAGATCGAGC GGCGTGAACG CGAACTCGAA GAAAGCATCC AGCGCTACCT GGACGCCCTG
GAAACAGCCG ACCGCACCCA ACCCACAGAG ATGCAGGCCA AGACAGAGCG CTTACAAGGC
AAGATCCAGA AGATGCGCCA ACGCATGCAA GACCTGCAGG CTGTCAAGGC ACAGCTAGAA
ACCCTACCGG ATCGACAACT CTCCCAGACC GACCCGGACG CACGGGCCAT GACCACCTAC
AGCGCCAAGG GCACAGCCAT GGTGGGCTAC AACATACAAA CAGCGGTAGA CACCAAGAAC
CACTTGATCG TGGCGCATGA AGTTACGAAC AACGGCAGCG ACAGGTCACA GCTGAGTCAG
ATCGCACTGG CTGCACGCGA AGCCATGGGC AAGCGAAAAT TGAAAGCTAT CGCTGACCGC
GGGTATTACA GTGGCACGGA GCTCAAGGCT TGTGAGGATG CAGGGATCGC AGCAATTGTT
CCCAAGCCCA TGACGTCAGG CGCCAGAGCC GAGGGGCGTT TCGATAAATC AGACTTCATC
TACATCGCCC GCGATGACGA GTACCAATGC CCTGCTGGGC AGCGTGCCAT CCACCGCTTT
ACAAGCGACG AGAGGGGAAT GCAGATTCGT ACCTACTGGA GCAGCTCGTG TATTGGATGC
GCAATCAAGG CGCAATGCAC TCCCAGTGAC TACCGGCGCA TAAGGCGCTG GGAGCATGAG
GCGGTGATGG ACAAGGTACA GCAGCGTCTG GATCGTATGC CAAAGGCCAT GACGGTGCGT
AAGAGCACTA TTGAACATGT CTTTGGAACG CTCAAGCACT GGATGGGCTG GACGCACTTC
CTCACGCGGG GCATGCACAA CGTGGCAACG GAAATGAGCT TGAGCGTGCT GGCTTACAAC
CTCAAACGTG TCATCCGCAT TCTTGGCTTT GCGAAGACGA TAAAGGCAAT GCAACTGGTG
GGTGCATAG
 
Protein sequence
MARFIVGADR QQVTLLPECL DDFITEDNTV RVVDAFIGEL DMVALGFEGA TPAATGRPSY 
HPSVLLKLYL YGYLNRIQSS RRLERECQRN VELMWLTGRL APDFKTIADF RRDNGKGIRN
VCRRFVLLCR ELKLFSEAVV AIDGSKFKAV NNRERNYTPG KIERRERELE ESIQRYLDAL
ETADRTQPTE MQAKTERLQG KIQKMRQRMQ DLQAVKAQLE TLPDRQLSQT DPDARAMTTY
SAKGTAMVGY NIQTAVDTKN HLIVAHEVTN NGSDRSQLSQ IALAAREAMG KRKLKAIADR
GYYSGTELKA CEDAGIAAIV PKPMTSGARA EGRFDKSDFI YIARDDEYQC PAGQRAIHRF
TSDERGMQIR TYWSSSCIGC AIKAQCTPSD YRRIRRWEHE AVMDKVQQRL DRMPKAMTVR
KSTIEHVFGT LKHWMGWTHF LTRGMHNVAT EMSLSVLAYN LKRVIRILGF AKTIKAMQLV
GA