Gene Bpro_5533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_5533 
Symbol 
ID4016492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007950 
Strand
Start bp288258 
End bp291248 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content63% 
IMG OID637945149 
Producthypothetical protein 
Protein accessionYP_552281 
Protein GI91791331 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACTTAA TCAGCGTTGC CTTAAGCCCT ACGGCGACCT CAAGCCTGAG GTACGAGTGG 
CTCAGGTCAT TGGGCAAGAA GCAGGCACAG GATTTGCTCA ACGGAGACCA GACATTCCTG
GGCGAGCTTG GCGGCGGCTT GGGGTATGGG TACCTGAGTG CTGAGTTTGT AGGTGCGCTG
CAGGTGAGCC AACTGCTGCA AATCCGCCAA CCCGCGCAAT TGAGCAAGGA GGCACTGGCA
GGCCTGAGTG CAGAGCAGGC TAAGGGGCTG GTCGGGAACA TATCCTTTCT GTCCAACCTC
TCCAGCAACG CAGCAGGCCT GACGGCAAAG TTCCTGCAGG CGCTGAGTCT GTCGCAAGTG
GGCCAGATCA CCGCGAGCAT GAGCAGTGCG ATGACGTCCG ATCAGATCAA AGCGCTAGTG
TTCCAACCCA AGGACCACAA CTCCAACCCC AACGGCTACT TTCTGCTCAA CGCATTGAGC
GCAGCGGCGG CAGCGGCGTT GACGACAAAG GCGCTTGGCA ACCTGGACGG CTCGGATGCG
ATCTTGCTCA GGGTGCGGAC GGGGATGAGG ACGACAACGT TGCTGTCTAA CCTGCAGGGC
AACGCACAGG GGCTCACCGC CGCCTTCATA GGGTCACTGG ACACGGCGCA GCTGAACTTG
CTGACACCAA CTGCAGTGGC CGCGCTGACG CCATCGGCGC TGGGCGCGCT CGATGCAAAC
GATGCAGCGA ACTTGCAGCA GGCCGCAGGG CCGACGTTTG TGGCAAGGCT GCAAGCCAAT
GCCAGCGGCC TGAGTACGGC CTTCCTGGGG GCGCTCACCA CAGTGCAGAT CGCGGCGATC
AGCGCTACAG CCATGGCCGG CCTCACGGCT GCGCAGCTCG GCTCGTGGAC CACCACCCAG
ATCAAGGCGC TCACCACAGT GCAGATCGCC GCGATTACCA AGGCGGCCTT TAGCGGCCTG
ACGAAGGCGC AGCTCGACTC CTGGAGCACC GCCCAGATCG GCGCCCTGTC TGCTGCTGTG
GTGAGCGGTA TCACAACCGA TCAGTTCAAA GCCTGGTCAG GCGCGCAGAT CAGCGCACTG
TCTACTGATG CGGTGAGCGG CTTCACGGCC ACGCAGCTCG GCTCGTGGAG CGTGACCCAG
ATCGATGCAC TCAAGCCAGC GCAGATCGCG GCGATCACCC CCGCCGCCAT GGCCGGTCTG
ATGGCCGCGC AGCTCGGCTC GTGGAGCGTT AGCCAGATCG ATGCGCTCAA GCCAGCTCAG
ATCAACGCCC TCACGGTCGC CGCCATGGCC GGCCTCACGG CCACGCAGCT CGGCTCGTGG
ACCACCACCC AGATCAAGGC GCTCACCACA GTGCAGATCG CCGCGATTAC CAAGGCGGCC
TTTAGCGGCC TGACGAAGGC GCAGCTCGAC TCCTGGAGCA CCGCCCAGAT CGGCGCCCTG
TCTGCTGCGG CGGTCAGCGG TATCACAACC GATCAGTTCA AAGCCTGGTC AGGCGCGCAG
ATCAGCGCAC TGTCTGCTGA TGCGGTGAGC GGCTTCACGG CCACGCAGCT CGGCTCGTGG
AGCGTGACCC AGATCGATGC GCTCAAGCCA GCGCAGATCA ACGCCCTCAC GGTCGCCGCC
ATGGCCGGCC TCACGGCCGC GCAGCTCGGC TCGTGGACCC CCGTCCAGAT CAAGGCATTC
ACCACAGCGC AGATCCCGAA GATCACGGCC GCCGCCATGG CCGGCCTCAC GGCGGCGCAG
CTCGGCTCCT GGACCACCGA TCAGATTGGC GCCCTGACAG CGACCCAGGT GCCGTTCCTG
GGGGACGCGG TGCTCGCGGC CTCCGAAAGC CGGCTGAGCC CGGCGCAGCT GGGCTCGCTC
AGCCTGGCCC GCCTGGCCGC CGTAGCGCAA TTGCTGACGC CAGAGCGCCT GGCGCACTGG
AGCGAGATCG GCCTGCTCCC CTACCTGGTC CCTCTGATGG GTCAGCCCGC CTTTGACGCG
GTGGCCCTGA AGGCCCCGCA AGCGATGTTG GCCTATGCCC TGGCCGAGCA GATCCGCGGT
GCCTCCGGCG CCGCCCTGCA GGCGCTGGCA CCCTTGATCA ACACCGCCGA CAGCCTGGAG
CCGCTCAAAG CCCTGCTGCA GGCGCCCAAC ATCACCAAGG CGTCCTTTCT GGCCATTGGC
TATGACCTCG TTGCAGACGC CCTGCAGACC TATGATGACC TGGCTCTGTC GGTCACCGAG
CAGGGCTGGT TCAATGCCTC TGGCCATGGC CGCATCCTCA AGGACATTGC CAGCCTCACC
CAGGCCGATC TGGCCACCAA GAAGTACGAG TGGGTGGCCA AACACATCTC CGAGCTGAGC
CATACGCAGC GCTCCTGGCT CACCATCGGG CAACTGGTCG ATACTTCGCC GGCAACGATT
GATCTGTATG GTCAGGCGGA CAACTTTATT ACGGCACTGG CCCTGGCGAA GCCCAAGCTG
CAACAGCAGC TCACGAAATC GCCTTTGTAC AAACTCACTG ATGCGGAATT CCAGGCCCTG
GACGGCAAGC TCCTGTTGCA GGACATCGCT GTCCACATCA TCTTTACCAA TCCAGCGGCA
GGTACCTGGG ATTGGCTGCG CGGAAATGAC ACTAGCCAAA CCAACTCGCT TTTTGTGCTG
AGCGGCATGC TCGATGACCG CCACATCACG CTGATCACGG CCGAACAGTG GGTGGGCAAA
GATGCCGACG GCCAAACATT ACTCGACCGC ATGATTGGCA GCAACTTCAG CAGCGAGAAG
CGTCCTCATT ACCTGCCTCT AATTACCGCC GACACCTGGG CAGCTCCCGT GGCCGACCCC
GCCAACCCGG GCTCGACCAT CCCGCTGATT GATCTGCTGC CGGCAACGCC CGAGGTTATT
CACCTGCTGT CCGCACAGTT GCTGGGCACT TCGGTGATAG ATCCGTTTGA TTCGCTCCCC
ATCCTGGGCA GCGAGGACGG TGATGACAAC GATGCGGACG GTGATGCGTA A
 
Protein sequence
MDLISVALSP TATSSLRYEW LRSLGKKQAQ DLLNGDQTFL GELGGGLGYG YLSAEFVGAL 
QVSQLLQIRQ PAQLSKEALA GLSAEQAKGL VGNISFLSNL SSNAAGLTAK FLQALSLSQV
GQITASMSSA MTSDQIKALV FQPKDHNSNP NGYFLLNALS AAAAAALTTK ALGNLDGSDA
ILLRVRTGMR TTTLLSNLQG NAQGLTAAFI GSLDTAQLNL LTPTAVAALT PSALGALDAN
DAANLQQAAG PTFVARLQAN ASGLSTAFLG ALTTVQIAAI SATAMAGLTA AQLGSWTTTQ
IKALTTVQIA AITKAAFSGL TKAQLDSWST AQIGALSAAV VSGITTDQFK AWSGAQISAL
STDAVSGFTA TQLGSWSVTQ IDALKPAQIA AITPAAMAGL MAAQLGSWSV SQIDALKPAQ
INALTVAAMA GLTATQLGSW TTTQIKALTT VQIAAITKAA FSGLTKAQLD SWSTAQIGAL
SAAAVSGITT DQFKAWSGAQ ISALSADAVS GFTATQLGSW SVTQIDALKP AQINALTVAA
MAGLTAAQLG SWTPVQIKAF TTAQIPKITA AAMAGLTAAQ LGSWTTDQIG ALTATQVPFL
GDAVLAASES RLSPAQLGSL SLARLAAVAQ LLTPERLAHW SEIGLLPYLV PLMGQPAFDA
VALKAPQAML AYALAEQIRG ASGAALQALA PLINTADSLE PLKALLQAPN ITKASFLAIG
YDLVADALQT YDDLALSVTE QGWFNASGHG RILKDIASLT QADLATKKYE WVAKHISELS
HTQRSWLTIG QLVDTSPATI DLYGQADNFI TALALAKPKL QQQLTKSPLY KLTDAEFQAL
DGKLLLQDIA VHIIFTNPAA GTWDWLRGND TSQTNSLFVL SGMLDDRHIT LITAEQWVGK
DADGQTLLDR MIGSNFSSEK RPHYLPLITA DTWAAPVADP ANPGSTIPLI DLLPATPEVI
HLLSAQLLGT SVIDPFDSLP ILGSEDGDDN DADGDA