Gene Bpro_4010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4010 
Symbol 
ID4013338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4208999 
End bp4210075 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content58% 
IMG OID637943659 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_550802 
Protein GI91789850 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.275251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.168772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCAG CCTTCTGGCA CGGCAGACGC GTCTTTCTCA CTGGGCACAC CGGCTTCAAG 
GGCAGCTGGA TGTCCCTGTG GCTTCAATCT CTAGGTGCGA ACCTCACAGG TTATGCTCTT
CAATCACCAA CCCAGCCCAG CCTGTTTGAT GAGGCTAAGG TGGGACTAGG CATGCGCTCC
ATCATTGGCG ACATTCGTGA CTTGGCATTC CTGCAAAAGG CTATGCAGGA ATGCCAGCCC
GAAATCGTCA TCCACATGGC GGCACAACCG TTGGTACGAT ATTCCTACGC AAATCCGGTG
GAAACCTATT CCACCAACGT AATGGGCACC GTGCATCTCC TCGAGACCGT ACGTCATGCT
CCCAGCGTCA AGGCCGTGGT CAATATCACC ACTGACAAAT GCTACGAAAA CCGCGAATGG
GCTTGGGGCT ATCGTGAAAA CGAGCCAATG GGCGGCTACG ACCCCTACAG CAACAGCAAG
GGTTGTGCTG AACTGGTCAG CTCTGCTTAC CGGTCTTCCT TCTTCAACGC GAACAGCCAT
GCACAACACG GCGTTGGCCT GGCTACCGTC CGAGCCGGCA ACGTCATCGG CGGGGGCGAT
TGGGCGCAGG ATAGATTAAT TCCCGACATC CTCGCTGCTT TCGAACAAGG CCAGCGCGTC
AACATCCGCA ACCCCCACTC CATCCGCCCT TGGCAACATG TGTTGGAGCC CTTGCGCGGC
TACCTCACGC TGGCCGAACG CCTTTTTGAG CACGGCCCCA GCTATGCCGA GGGCTGGAAC
TTTGGACCCA ACGATGAAGA TGCCAAACCG GTTGGCTGGA TCGTCGAGCA AATGGCTGCG
ATGTGGGGCG AGGGTGCACA GTCGCAAATC GACAACGGCG AGCATCCGCA CGAGGCGAAC
TACCTCAAGC TCGATATATC CAAAGCCCGC AGCCGCCTGG ACTGGCACCC CACGCTGCGC
CTGAACGATG CCTTGGCACT CATCATCGAA TGGTCCAAGC AGCGCCAGGC GGGTGCTGAC
ATCCGCGAGC TGACCTTGGC CCAGATACAC TCTTATCAAA CATTGACCGA AAACTGA
 
Protein sequence
MNPAFWHGRR VFLTGHTGFK GSWMSLWLQS LGANLTGYAL QSPTQPSLFD EAKVGLGMRS 
IIGDIRDLAF LQKAMQECQP EIVIHMAAQP LVRYSYANPV ETYSTNVMGT VHLLETVRHA
PSVKAVVNIT TDKCYENREW AWGYRENEPM GGYDPYSNSK GCAELVSSAY RSSFFNANSH
AQHGVGLATV RAGNVIGGGD WAQDRLIPDI LAAFEQGQRV NIRNPHSIRP WQHVLEPLRG
YLTLAERLFE HGPSYAEGWN FGPNDEDAKP VGWIVEQMAA MWGEGAQSQI DNGEHPHEAN
YLKLDISKAR SRLDWHPTLR LNDALALIIE WSKQRQAGAD IRELTLAQIH SYQTLTEN