Gene Bpro_4233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4233 
Symbol 
ID4013074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4461603 
End bp4463099 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content67% 
IMG OID637943885 
ProductGntR family transcriptional regulator 
Protein accessionYP_551023 
Protein GI91790071 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGATGA AAACCGCTTC GCAATCCCTG ACGGAACAGC TCAGCGCACG CTTTGCCGAG 
CGCATTCGCA GCCGCCTGCT GGCACCCGGC GCGCGCCTGC CGTCGGTGCG GCTGTGTGCT
GAACAGCAGG GTGTGAGCGC GTCCACGGTG GTGGCCGCCT ATGACCAGCT GCTGGCGCAG
GGGCTGGTGG TGGCGCGCAA GAATCGCGGC TTTTTTGTCC GTGATGCATC GCTCAATGCG
GCGCTGGCCA CCGTTCATAA AGCGCCGTCA GCCGTAAAAA CCATGGCGAC CGCGACGGGC
GCCGCACCCG CAGCCGAAGA GCCGGCCCCG CTGAACAGCT GGAGCACCGC CACCTGGATG
GCCACGCGCC AGGCGCCCGT GGACGCCACC GCGCTGATTC GCGGCATGTT TCACAAGATC
AGCAACAAGC CGCAGCCCGG CATGGGCGTG TTTCCGCCGG ACTGGCTGGA AACCACCTTC
ATGCCGGCGG CGGTGCGCAA GGTCACCAGC GTCAGCGCGC TGCGGGATTT CTCGCTGCAA
TATGGCGAAC CCATGGGCGA CAGCGGCCTG CGCCGGGCGC TGTCGCAGAA GTTGAGCGCG
CTCAATGTGC ATGCCGTGCC CGAGCAGATC ATCACCACCG TCGGCGCCAC CCATGCGCTG
GATATTGTGA GCCGCACCCT GCTGCGCGCC GGCGACTGCG TGATGGTCGA AGAACCCGGC
TGGGCCGTGG AGTTTGCCCG GCTCGATGCC TTGGGCATGC GCATTCTGCC GGTGCCGCGC
CGCGCCGACG GGCCTGACCT GGAGGTGATG GCGCAGTACT GTGAAATCCA CCAGCCCAAA
CTGTTTGTCA GCGTCAGCGT GTTCCACAAC CCCACCGGCT ACTGCCTCAC GCCCGGCAGC
GCTCACCGTG TGCTGCAACT GGCCAACCAG CACAACTTTC ACATTGCCGA AGACGACACC
TACAGCCACC TGGCGCCCGA GCACGCCACC CGGCTGTGCG CCCTCGACGG CCTGCAGCGC
ACGATTTACG TCAGCGGCTT TGCCAAAATC CTCGCCCCCG GCTGGCGCGT CGGCTTCATG
GCCGCACCGC CCGATCTCGT CGAACGCCTG CTCGACACCA AGCTGCTGGC GACGCTGACC
ACGCCCGCCC TGCTTGAGAA AGCGCTGGCC TGGTGCATAG ACCAGGGCCA GCTGCGGCGC
CACGCCGAAC GCATACGCAC CCGGCTCGAC CAGGCGCGCG CGCGCAGCGT CAAGCTCGCG
CTGGCGCACG GCTGCACCTT TGCGGCCGAG CCGGCCGGCC TGTTTGGCTG GGTTGATACC
GGGGTAGACA CCGACGCGCT GGCGCAGCGC ATGCTCGACG AGGGCTACCT GATCGCCCCC
GGCGCGCTGT TCCATGCGGT GCGCAAGCCC AGCACCCTGA TGCGCATCAA CTTCGCCACC
ACGCAGGAGG CGGCATTCTG GAAGGTGTTT GCGCGCTTGC GGGATGGCAT GAAGTAA
 
Protein sequence
MLMKTASQSL TEQLSARFAE RIRSRLLAPG ARLPSVRLCA EQQGVSASTV VAAYDQLLAQ 
GLVVARKNRG FFVRDASLNA ALATVHKAPS AVKTMATATG AAPAAEEPAP LNSWSTATWM
ATRQAPVDAT ALIRGMFHKI SNKPQPGMGV FPPDWLETTF MPAAVRKVTS VSALRDFSLQ
YGEPMGDSGL RRALSQKLSA LNVHAVPEQI ITTVGATHAL DIVSRTLLRA GDCVMVEEPG
WAVEFARLDA LGMRILPVPR RADGPDLEVM AQYCEIHQPK LFVSVSVFHN PTGYCLTPGS
AHRVLQLANQ HNFHIAEDDT YSHLAPEHAT RLCALDGLQR TIYVSGFAKI LAPGWRVGFM
AAPPDLVERL LDTKLLATLT TPALLEKALA WCIDQGQLRR HAERIRTRLD QARARSVKLA
LAHGCTFAAE PAGLFGWVDT GVDTDALAQR MLDEGYLIAP GALFHAVRKP STLMRINFAT
TQEAAFWKVF ARLRDGMK