Gene Bpro_4072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4072 
Symbol 
ID4013244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4280358 
End bp4281563 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content61% 
IMG OID637943720 
Productenamidase 
Protein accessionYP_550863 
Protein GI91789911 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.815996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0735984 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAG CAGCATCGAC CGGCAAATCA GGAAAAGTCG TCATCAGGAA TATCGGCTTG 
CTGCTGTCCG GAGACATCGA CAAACCCATC CTGGATGCGG ACACCATTGT CGTGAACGAT
GGCCTGATCG TGGCTGTCGG CAAGGCCAAG GACTGCGACA TCGCGCATGC GCAGACCGTT
ATTGACGCGC ACCAGACCTG CGTCTGTCCG GGTTTGATAG ACAGCCATGT CCATCCGGTA
TTTGGTGACT GGACGCCACG GCAAAACCAG ATTGGCTGGA TCGACTCCAC CATGCACGGC
GGCGTCACCA CGATGATCTC GGCCGGAGAA GTTCATCTTC CCGGGCGGCC CAAGGACATT
GTTGGCCTCA AAGCGCTGGC CATTACGGCG CAGCGCGCGT TTGACAATTT TCGCCCCGGT
GGCGTCAAGG TTCTTGCCGG CGCACCCATC ATCGAAAAAG GCATGACCGA GCAGGATTTC
AAGGATTTGG CCGAGGCCGG CGTCACGCTG CTTGGCGAGG TCGGCCTGGG TTCCGTCAAG
GCGGGCTACG AGGCCAAGGA AATGGTGGGC TGGGCGCGCA AGTACGGCAT CCAGAGCACC
ATCCATACAG GCGGCCCCTC CATTCCCGGC TCCGGCCTGA TTGACAAGGA TGTGGTGCTT
GAAGCCGATG CCGACGTCAT TGGCCACATC AACGGCGGGC ACACGGCATT ATCGGAGGCG
CATGTCTGCG AGCTGTGCGA AAGGTCCTCT CGCGCCATCG AGATTGTCCA CAACGGCAAT
GAAAAAGTGG CGATCGCGGC CGCTCAAGCC GCGCTGCAGC TCAAATGTCC GCACCGTGTC
ATTCTGGGCA CCGATGGCCC GGCCGGATCA GGCGTGCAAC CCCTGGGCAT GTTGCGGCTC
ATCGCCCTGC TCTCAAGCCT GGGAAACATT CCGGCCGAAT TGGCGCTCTG TTTTGCCACC
GGCAATACCG CGCGCATTCG CAATCTCAAT TGCGGGCTGA TCGAAGTCGG TCGCGCCGCT
GACTTCGTGT TCATGGACAA GGCCCAGCAT TCTGCCGGGC TTGACCTCCT GGACAGCATT
CAATGCGGTG ACATTCCGGG GGTGGGCATG GTGATGATTG ACGGCATGGT GCGCTGCGGC
CGCAGCCGGA ACACCCCGCC GGCCACACAA ATCCCCGGTG TTCAACACCA CACCGTTCCC
GCCTGA
 
Protein sequence
MAEAASTGKS GKVVIRNIGL LLSGDIDKPI LDADTIVVND GLIVAVGKAK DCDIAHAQTV 
IDAHQTCVCP GLIDSHVHPV FGDWTPRQNQ IGWIDSTMHG GVTTMISAGE VHLPGRPKDI
VGLKALAITA QRAFDNFRPG GVKVLAGAPI IEKGMTEQDF KDLAEAGVTL LGEVGLGSVK
AGYEAKEMVG WARKYGIQST IHTGGPSIPG SGLIDKDVVL EADADVIGHI NGGHTALSEA
HVCELCERSS RAIEIVHNGN EKVAIAAAQA ALQLKCPHRV ILGTDGPAGS GVQPLGMLRL
IALLSSLGNI PAELALCFAT GNTARIRNLN CGLIEVGRAA DFVFMDKAQH SAGLDLLDSI
QCGDIPGVGM VMIDGMVRCG RSRNTPPATQ IPGVQHHTVP A