Gene Bpro_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_1847 
Symbol 
ID4015518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp1908699 
End bp1909676 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content65% 
IMG OID637941516 
Productshikimate dehydrogenase 
Protein accessionYP_548678 
Protein GI91787726 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0169] Shikimate 5-dehydrogenase 
TIGRFAM ID[TIGR00507] shikimate 5-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.127242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.110669 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCCA CCATCCAGGG CAGCACCGAT GTTTATTTGA TCTCCGGCGA TCCGGTGGAG 
CAGGTGCGTG CGCCCGAGGT GTTCAACCTG ATCTTCAGGA CGCTGGGTAT CAACGCGGTG
CTGGTACCTG TGCATGTGCC CCTCCAGGAC ATCGAGGCTT TCGTGCGCAC CGCTTTCCTG
GCCAAAAACA TCAAGGGCAT GCTGCTGACC ATTCCGCACA AATCGCGGGT GATGGGCCTG
CTTGCGCGTT GCAACGACGC TGGTCGCGTG GCGGGCGCGG TGAACGGCAT ACGCCGCAAC
GCAGCGGGCG AGCTGGAAGG CGCGCTGTTC GACGGCAAGG GTTTTGTCTT GTCGCTGGAC
TACTTCGGGG TGGCCTACGC CGGCAAGCGC GTGCTGATCC TGGGCGCCGG TGGCGCAGCT
GCCGCCATTG CCGTCTCGCT GGCGATGGCG GGTAGCCGCG CACCTGCCGG GATTGCACTC
TATGACCCGG CACCTGGCAA GGCGCAGGCC CTGGCAAGCC AGCTTGCTGC GGCGGCGCAG
ACGCCCCTGA AAAAGGTTGT GGCCGCGACC AGCAGCGACC CGTCGGCTTA TGACCTGGTG
GTGAACGCGT CACCGCTGGG GCTCAGGGCT GGCGATCCGC TGCCGTGCGA TGTGTCGCGG
CTGGCTCCGC ATGCGGCCGT GGTGGACATC CTGATGAAGA ACCAGCCGAC ACCGCTGGTC
CAGGCTGCCC GGGCACGGCA TCTGGTCGCA CAGCCCGGCT TTGAAATGCT GATCCAGCAA
GCACCGGATT ACCTGGCTTT TTTTGGCTAC GCCGAGGCGG CGCAAGCCGT TCGCAACAAC
GCCACCTTTA TTCGCGAACT TCTGTATCCG GCCGCCATGC AGGGTGAGAT CAACCGCCCA
GGCCAGCGCA TCCCCAGCTC CGAATTTGCC AGCGTTCCAG TCCCGGGAGT CGGGGAATCA
CGACTCCCGG CCCATTGA
 
Protein sequence
MHSTIQGSTD VYLISGDPVE QVRAPEVFNL IFRTLGINAV LVPVHVPLQD IEAFVRTAFL 
AKNIKGMLLT IPHKSRVMGL LARCNDAGRV AGAVNGIRRN AAGELEGALF DGKGFVLSLD
YFGVAYAGKR VLILGAGGAA AAIAVSLAMA GSRAPAGIAL YDPAPGKAQA LASQLAAAAQ
TPLKKVVAAT SSDPSAYDLV VNASPLGLRA GDPLPCDVSR LAPHAAVVDI LMKNQPTPLV
QAARARHLVA QPGFEMLIQQ APDYLAFFGY AEAAQAVRNN ATFIRELLYP AAMQGEINRP
GQRIPSSEFA SVPVPGVGES RLPAH