Gene Bpro_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBpro_4149 
Symbol 
ID4013165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas sp. JS666 
KingdomBacteria 
Replicon accessionNC_007948 
Strand
Start bp4360669 
End bp4363428 
Gene Length2760 bp 
Protein Length919 aa 
Translation table11 
GC content62% 
IMG OID637943796 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_550939 
Protein GI91789987 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0591] Na+/proline symporter
[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.315835 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACAAG GCTGGGTCAT CATCAGCGTG TCCTTTGCCT ACCTGGGCGT TCTGTTTGCC 
ATTGCCTACT ACGGCGACAA GCGCGCCGAT GCGGGACGCT CCATCATCGC CAATCCGGCC
ATTTACGCCT TGTCACTGGC GGTCTACTGC ACCACCTGGA CCTTCTACGG CAGCGTAGGA
CGGGCCGCTT CGTCAGGCAT CGGCTTTTTG CCCATCTACC TGGGCCCCAC ACTGATCGCC
GGCCTCTGGT GGTTTGTGAT GCTCAAGATC ATTCGCATCA GCAAGGCCAA CCGCATCACG
TCCATCGCTG ACTTCGTGGC CTCGCGCTAT GGCAAGAGCC AGCTGCTGGG CGGGATGGTG
ACCGTGATTG CGGTGGTCGG CGTTGTTCCC TATATCTCGC TGCAGCTGAA GGCCGTCTCC
AACAGTTTCA CGATCCTGCT GCACTACCCC GATATCGTCA TGCCCACCAA AGCCACCGCG
CAGGCCCTGT GGCAGGACAG CGCGCTGTAC ATCGCCATGA TCCTGGCCGC CTTCACCGTG
TTGTTCGGCA CCCGGCACCT TGATGCCACC GAGCGGCATG AAGGCATGGT GGCGGCAATT
GCGTTTGAAT CGGTGGTCAA GCTGCTGGCG TTCCTGGCCG TGGGCTTGTT CGTGACCTTT
GGCATTTTCA ACGGCATGGG CGACATCTTT CAGCGCGCCG ACCTGGTGCC CAAACTCAAG
TCACTCATGA CCGTGGCCGA CACCGGTAGC AGCTATGGCA GCTGGTGGTC GCTGACCATT
CTCTCCATGC TCTCGGTCAT GCTGCTGCCC CGGCAGTTCC AGATTGCCGT GGTGGAAAAC
GTCAATGAGC AGCACCTGCG CAAGGCGGTC TGGCTGTTCC CGCTGTACCT GTTGCTGATC
AATATCTTCG TGCTGCCGAT TGCCCTGGGT GGCATGCTGC ACTTCCCGGC GGGCAATGTG
GATGCCGACA CCTTCGTGCT GACCCTGCCG ATGGCCCAGC GGCACGAATC ACTGACCCTG
TTCGTGTTCA TCGGGGGCCT CTCCGCCGCC ACCGGCATGG TGATTGTGGA GACAATTGCG
CTGTCCACCA TGATCTGCAA CGACCTGGTG ATGCCCCTGG CGCTGCGTAT CAAGGCCTTG
CGCCTGAACG AGCGGCAGGA CCTCTCCGGC TTGCTGCTGG GCATACGCCG CTGGGCAATT
GTGGCCATCC TGTTGCTGGG CTACATCTAC TTCCGCGCCG CGGGTGACGC TTACGCGCTG
GTGAGTATCG GCCTGATCTC GTTTGCCGCG GTGGCCCAGT TTGCGCCGGC CATTTTTGGC
GGCATTTACT GGAAAGGCGG CACCCGCAAC GGCGCCATGG CCGGGCTGCT CGGGGGCTTC
GCGGTGTGGG GCTACACCCT GCTGCTGCCT TCGTTCGCCA AATCGGGCTG GTTGCCATCC
ACATTTCTGC GTGAGGGCCT GTTCGGACTG GAACTGCTGC GGCCCCAGCA ACTGTTCGGC
CTGTCCGGGC TGGACGAGAT TTCGCACAGC CTGTTCTGGA GCCTGCTGGT CAACATCGGC
TTCTATCTGG GCGTGTCTTT GCGTGGCCAG CCATCCGTGG TGGAAACACG GCAGGCCACG
GCTTTTGTCG ACGTGTTTCG CCACACCGCC AGCGCAGCGG ACGGTTCGCG CCTGTGGCGC
GGCAGTGCCC AGGTGCAGGA CCTGCTGCCG CTGATCGGCC GCTTCCTCGG GCCGGTGCGT
GCGCAGGAAG CCTTCCTGAC CTATGCACGG GGACGCGGCC TGTCATCCGT CGCGGAATTG
AAGGCCGACG CGGACCTGGT GCATTACGCC GAGACGCTGC TGGCCGGCGC CCTTGGCGGC
GCTTCCGCGC GGGTCATGGT GGCCTCGGTG GTGCAGGAGG AACCGCTGGG CATTGACGAG
GTGATGAACA TCCTTGATGA GGCCTCCCAG GTCCGAGCCT ACTCCCGGGA ACTCGAACTC
AAGTCGCAGG AGCTCGAAGC CGCCACCGCC AAGTTGCGGG CCGCCAATGA CCGCCTGAAG
GAACTCGATC GCATGAAGGA TGACTTCATG TCCACCGTGA CCCATGAGCT GCGCACGCCC
CTCACCTCCA TTCGCGCACT CTCGGAAATC CTGCTGGAAA CCCCCCAGGC AAGCCTGGCC
GAGCGCCAGA AGTTCCTGGG CATCATCGTC AAGGAAGCCG AGCGCCTGAC CCGGCTGATC
AATCAGGTGC TGGACATGGC CAAGATCGAA TCGGGCAATG CCGAATGGCA CACGTCGGAG
CTGGACCTGC GCGAAGTGAT CGAGGAATCA GTGGCGGCCA CCTACGCGAC GTTCAACGAC
CGGCATGTGA CACTGACGCA AGACCTGGCA TCCCACACAC CGCGCATTCG GGCTGACCGC
GACCGCCTGA TTCAGGTGAT GCTCAACCTG CTGTCCAATG CGGTCAAGTT CTGCCACCCG
ACCCGGGGCC AGGTGCATGT CTCACTGCGC CAGGAACCCG GCGGTCTGCG CGTGGACGTT
CGGGACAACG GCAGTGGCAT CAGTGCCGCC AATCAAAAAA TCATTTTTGA GCGCTTTCGC
CAGGTCGGCG ATACCATGAC CAACAAGCCC CAGGGTACCG GGCTGGGATT GCCGATCAGC
CGGCACATCA TCGAACGTTT CGGCGGTCGG CTGTGGGTGC AAAGCGAACC CGGCAACGGG
GCCACATTCT CGTTTACACT GCCCTTCAGT CAGGGCACCG GGGCCGCAGC GCCTCAATAG
 
Protein sequence
MLQGWVIISV SFAYLGVLFA IAYYGDKRAD AGRSIIANPA IYALSLAVYC TTWTFYGSVG 
RAASSGIGFL PIYLGPTLIA GLWWFVMLKI IRISKANRIT SIADFVASRY GKSQLLGGMV
TVIAVVGVVP YISLQLKAVS NSFTILLHYP DIVMPTKATA QALWQDSALY IAMILAAFTV
LFGTRHLDAT ERHEGMVAAI AFESVVKLLA FLAVGLFVTF GIFNGMGDIF QRADLVPKLK
SLMTVADTGS SYGSWWSLTI LSMLSVMLLP RQFQIAVVEN VNEQHLRKAV WLFPLYLLLI
NIFVLPIALG GMLHFPAGNV DADTFVLTLP MAQRHESLTL FVFIGGLSAA TGMVIVETIA
LSTMICNDLV MPLALRIKAL RLNERQDLSG LLLGIRRWAI VAILLLGYIY FRAAGDAYAL
VSIGLISFAA VAQFAPAIFG GIYWKGGTRN GAMAGLLGGF AVWGYTLLLP SFAKSGWLPS
TFLREGLFGL ELLRPQQLFG LSGLDEISHS LFWSLLVNIG FYLGVSLRGQ PSVVETRQAT
AFVDVFRHTA SAADGSRLWR GSAQVQDLLP LIGRFLGPVR AQEAFLTYAR GRGLSSVAEL
KADADLVHYA ETLLAGALGG ASARVMVASV VQEEPLGIDE VMNILDEASQ VRAYSRELEL
KSQELEAATA KLRAANDRLK ELDRMKDDFM STVTHELRTP LTSIRALSEI LLETPQASLA
ERQKFLGIIV KEAERLTRLI NQVLDMAKIE SGNAEWHTSE LDLREVIEES VAATYATFND
RHVTLTQDLA SHTPRIRADR DRLIQVMLNL LSNAVKFCHP TRGQVHVSLR QEPGGLRVDV
RDNGSGISAA NQKIIFERFR QVGDTMTNKP QGTGLGLPIS RHIIERFGGR LWVQSEPGNG
ATFSFTLPFS QGTGAAAPQ