Gene Bpro_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_1034 
Symbol 
ID4012155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp1060402 
End bp1061853 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content71% 
IMG OID637940712 
ProductN-formimino-L-glutamate deiminase 
Protein accessionYP_547885 
Protein GI91786933 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0932501 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA TCGCCCCCTT CGCTGCCCCG ACATCCCATG CCCCAGCCAT GGGCAGCCTG 
TTTGCCACCG ATGCCCTGCT GCCCGACGGC TGGGCGCGCA ACGTGCTGCT GGAATGGAAC
GCCGCCGGCC AGCTGCTGGC CGTCACGCCG GACAGCAGCG CCCCGCGCAC CACGCCCCGG
GCCGCCGGCC CCCTCATCCC GGGCATGCCC AACCTGCATT GCCACGCCTT CCAGCGGGCC
TTTGGCGGAC TGACGGAGTT TCGCGCCGAG GCGCAGGACA GCTTCTGGAG CTGGCGCACG
CTGATGTACC GCTTCGCCGC GCAGGTCACG CCCGAGCTGC TGGAAGACAT CGCCACCTGG
CTTTACATCG AGATGCTGGA GGCCGGCTAC ACCTCGGTGT GTGAATTCCA CTACGTGCAC
CACGACCTGG ACGGCCGGCC CTATGCCGAC GACGCCACGC TGGCCCAATG CCTGTTGCGT
GCCGCACAGC GCGCCGGCAT CGGCATCACG CTGCTGCCCG TGCTTTACCA GACCAGCGGT
TTTGGCGGCA CGCCGCCAAA TGCGGGGCAA CGCCGCTTCA TCCGGTCTAC CGACTCGATG
CTGCGCCTGC TGGAGCGCCT GCAGCCTTGT TGCGAGGTGC AGGGCGCGCG CCTGGGGCTG
GCGCCGCACT CCCTGCGCGC GGTGCCGCCC GACAGCCTGC GCGAGGTGCT GGCCGGACTG
GACGCCATCG ACCCCACCGC GCCCATACAC ATCCATATTG CCGAGCAAAC GGCCGAGGTG
GATGCCTGCC TGGCCTGGAG CGGCCAGCGC CCGGTGGAGT GGCTCTTGGA CCACGCCGCC
GTCGATGCGC GCTGGTGCCT GGTGCACGCC ACGCACATGA CCGACACCGA ATATCAGCGC
GCCGCCCGCA CTGGCGCCGT GGCCGGCCTG TGCCCGACCA CCGAGGCCAA TCTGGGAGAC
GGCATTTTTG ACCTGCCGCG CTGGCGCGCC GCCGGGGGCG CCTGGGGCGT CGGCTCGGAC
AGCAACGCCT GCGTCAACGC GGCGGAAGAG CTGATGCTGC TTGAATATGG TCAGCGCCTG
CAGGGTCGCC AGCGCAATGT GCTAGCCACC GCGCAGCAAC CGCAGGTCGC CACCGCGATG
ACGCTGCAGG CCGTGCAGGG CGGCGCCCGT GCCTCGGGTC GTGCCTTGCC GCGCGGCACT
GCCGGACTGG TTACCGGCCA GCGCGCCGAT TTTGCGGTGC TGGACGCCCG GCACCCGGCC
TTGTGCGAGC TGAGCGCGCC CGACATGCTG TCGGCCCATG TGTTCGCCAG CCACCGCACG
TCCGCGCTCG ACGCGGTCTG GGTCGGCGGC GTTCAGCAAA CCCGCCAGGG CAGCCGCCAT
CCGCTGCGCG AGACGGCCGC CGCGGCCTTC ATCGCCGCCC GCTCACGCCT GCTGGCGCAA
ACCCAGGCCT GA
 
Protein sequence
MSDIAPFAAP TSHAPAMGSL FATDALLPDG WARNVLLEWN AAGQLLAVTP DSSAPRTTPR 
AAGPLIPGMP NLHCHAFQRA FGGLTEFRAE AQDSFWSWRT LMYRFAAQVT PELLEDIATW
LYIEMLEAGY TSVCEFHYVH HDLDGRPYAD DATLAQCLLR AAQRAGIGIT LLPVLYQTSG
FGGTPPNAGQ RRFIRSTDSM LRLLERLQPC CEVQGARLGL APHSLRAVPP DSLREVLAGL
DAIDPTAPIH IHIAEQTAEV DACLAWSGQR PVEWLLDHAA VDARWCLVHA THMTDTEYQR
AARTGAVAGL CPTTEANLGD GIFDLPRWRA AGGAWGVGSD SNACVNAAEE LMLLEYGQRL
QGRQRNVLAT AQQPQVATAM TLQAVQGGAR ASGRALPRGT AGLVTGQRAD FAVLDARHPA
LCELSAPDML SAHVFASHRT SALDAVWVGG VQQTRQGSRH PLRETAAAAF IAARSRLLAQ
TQA