Gene Bpro_1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_1021 
Symbol 
ID4012142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp1046913 
End bp1048247 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content61% 
IMG OID637940699 
ProductType I secretion membrane fusion protein, HlyD 
Protein accessionYP_547872 
Protein GI91786920 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01843] type I secretion membrane fusion protein, HlyD family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.458838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACA AGAGCGGCAC CCACTCAGCT GAACATGCCA TGTTCAGCAC GGCAGCCATT 
GCAATGCGGC AACAACCACC TGCCAGAGTG GCACGCATGG TCACCCTGGC CATCTGCGTC
ATGGCCGCCG CCGCACTGGC CTACGCCAGC CTGGCATCGA TGGATATTGT GGTCACGGCA
CAGGGGCGCG TCAGTGCCTC GGGCAAGAGC AAGGTCATCC AGCCGCTGGA AGCAGGAGTC
GTCAAGGCCA TTGCGGTCAG GGACGGCCAG TCCGTCAAAG CTGGCGACTT GCTCCTGGAG
CTCGACGCCA CCGCCACCCT GGCCGATCGT GACCGGCTCC AGCGTGAATT TTGGGAAACC
CAGGCGGACG TGTTACGGCT TAACGCCTTG CTATCGGGCA AGGTGGCTTG GGCGGAAGCG
CGCGATTTGC CGGTCGCGAT GGTTGCCAAT CAGCAGGCGG TGCTGGCCAG CCGGCGCAGT
GAGCAGGATG CCCGTGTAGC GGCGCTGGAC GCCGATATTG CGCGACGTAC AGCCGATCAT
GAGGCCATCT CGGCCAATAT CGCCCAGTTG CACAACAGCC TGCCGCTGGT GCGCAAGAAG
CACGAAATGC GCGAGGAACT GGCCACCACG GGCCATATCG CGCAGACGGG GCTGATCGAA
ACCCGGCTTG AACTGCTGGG CATGGAAAAA GACCTCTCCG TCATGGGCAA GCGGCTCAAT
GAATCTGCGG CAAGCCTTCA TGCTTCTGTC CAGCAGCGAA AACAGGCGCA GGCCGAGTTT
CGTGCCCGGG CCAGTGCCGA ACTGGTGGAC GCCATCCGCA AGCATGATGC AGCGCGCCAG
GAGCTGACCA AGGCCACCCA GCGTCGCGAC TTGCAAACCC TGCGCAGCCC GATTGACGGC
GTGGTACAGC AACTGGCAGT CACCACGGTC GGCGGTGTGG TTACTCAAGC GCAGGCACTG
ATGACCATCG TGCCACATCA CGCGGCCCTG GAAGTTGATG CCCAGATCAA TAATCGCGAT
ATCGGTCATG TCAAGGTCGG CCAGCGCGTG ATCAACAAGG TGGAGACCTT CGATTTCACC
CGTTTCGGAT ACATCGAAGG CATGGTGCAG TGGGTGGGCA CCGATGCCGT GATTGATCCC
AAGCTCGGAC CTGTCTACCC GGTGCGTATC AAGTTGAATT CGGTTGAGAC ACCGAACGTT
GTGAACGGTT TGCACGGAGC CGTCACGGCC GGCATGAGTG TGAGCTCGGA TATCCGCACG
GGTGAGCGCC GCATGATCGA ATACTTCATC GCACCCATGC TGCGCTACCA GCAGGAGGCC
TTGCGTGAAA GATAA
 
Protein sequence
MKNKSGTHSA EHAMFSTAAI AMRQQPPARV ARMVTLAICV MAAAALAYAS LASMDIVVTA 
QGRVSASGKS KVIQPLEAGV VKAIAVRDGQ SVKAGDLLLE LDATATLADR DRLQREFWET
QADVLRLNAL LSGKVAWAEA RDLPVAMVAN QQAVLASRRS EQDARVAALD ADIARRTADH
EAISANIAQL HNSLPLVRKK HEMREELATT GHIAQTGLIE TRLELLGMEK DLSVMGKRLN
ESAASLHASV QQRKQAQAEF RARASAELVD AIRKHDAARQ ELTKATQRRD LQTLRSPIDG
VVQQLAVTTV GGVVTQAQAL MTIVPHHAAL EVDAQINNRD IGHVKVGQRV INKVETFDFT
RFGYIEGMVQ WVGTDAVIDP KLGPVYPVRI KLNSVETPNV VNGLHGAVTA GMSVSSDIRT
GERRMIEYFI APMLRYQQEA LRER