Gene Bpro_1744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_1744 
Symbol 
ID4015607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp1803782 
End bp1804912 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content60% 
IMG OID637941416 
ProductRieske (2Fe-2S) region 
Protein accessionYP_548578 
Protein GI91787626 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.517961 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATC TAAGCCTTCG CCTGCTGCAG GCCACCAGCC AGCTTCCCAT CTCCAGCTAC 
TTTGATACCG GCCTGTACCA GCGGGAGCAG CAAAAACTGT TTGCCCGCGG GCCGCGCTAC
CTGGGCCATG AGCTCGCGGT GCCCAACCTC GGCGACTTTT ATGCGCTGCC ACACGAAGGT
GAAGGCCGCG CGCTGGTGCG CAACAGGTCG GGCATTGAAC TGATTTCCAA TGTCTGCAGA
CATCGCCAGT CGACCATGCT GCAGGGCCGC GGCTCTTTGG GCAACGGGGC TGACAGCAAT
ATCGTGTGCC CGCTGCACCG CTGGACTTAC AACACCTCGG GCGAGCTGAT CGGTGCGCCG
CACTTTGAGA TTGACCCCTG CCTGAACCTC AACCGCTACA AAACGACCAC CTGGAACGGC
CTGGTATTCG AAGACAACGG CCGCGATATC GCCGGCGAGA TGTCGCAACT GGCCACCCGG
GCCGACCTGG ATTTTGTCGG CTACCAGCTG GACAAGGTCC ACCTGCACGA ATGCAACTAC
AACTGGAAGA CCTTTATTGA GGTCTACCTT GAGGACTACC ACGTGGGGCC TTTCCATCCC
GGGCTGGGCG GATTTGTCAC CACGGAAGAT CTGCGCTGGG AACTGAAACC CAATTACTCG
GTGCAAACCG TGGGCGTGTC CGACAAACTG GGCAAGCCCG GCACAGACAT CTACAAAAAA
TGGCATGACG TGGTGCTGCA ATACCGCCAG GGCGTAGCCC CCAAATACGG CGCGATCTGG
CTCACCTACT ATCCGCATGT GATGGTCGAG TGGTACCCGC ATGCCCTGGT GGTCAGCACG
CTGCACCCGC AGGGGCCGGA CAAGACACTC AACGTGGTTG AATTTTTCTA CCCCGAGGAA
ATCTGCGCCT TCGAGCGCGA GTTCATCGAA GCGCAGCAGG CCGCCTACAT GGAAACCTGC
GTGGAGGACG ACGAAATCGC ACTGCGCATG GACGCCGGCC GCAAGGCGCT GATGCAGCGC
GGCGACAACG AGTTCGGCCC CTACCAGAGC CCCATGGAAG ACGGCATGCA GCACTTTCAC
GAGTGGTACC GCCGCGAAAT GGGCGCCAGC AAAACCACGC AGATGATCTG A
 
Protein sequence
MSDLSLRLLQ ATSQLPISSY FDTGLYQREQ QKLFARGPRY LGHELAVPNL GDFYALPHEG 
EGRALVRNRS GIELISNVCR HRQSTMLQGR GSLGNGADSN IVCPLHRWTY NTSGELIGAP
HFEIDPCLNL NRYKTTTWNG LVFEDNGRDI AGEMSQLATR ADLDFVGYQL DKVHLHECNY
NWKTFIEVYL EDYHVGPFHP GLGGFVTTED LRWELKPNYS VQTVGVSDKL GKPGTDIYKK
WHDVVLQYRQ GVAPKYGAIW LTYYPHVMVE WYPHALVVST LHPQGPDKTL NVVEFFYPEE
ICAFEREFIE AQQAAYMETC VEDDEIALRM DAGRKALMQR GDNEFGPYQS PMEDGMQHFH
EWYRREMGAS KTTQMI