Gene Plav_1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1850 
Symbol 
ID5455710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2007271 
End bp2010111 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content61% 
IMG OID640877429 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001413124 
Protein GI154252300 
COG category[T] Signal transduction mechanisms 
COG ID[COG3300] MHYT domain (predicted integral membrane sensor domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0866059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.191772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAGGG TCATTGGCTG CATCACTCAG GAACACGATC TAGGGCTCGT CGTTTTGGCG 
GGCGGCTTGT GCCTGTTCGC CTGCTTCACC GCCATGAACA TGCTGGGCAG GGCCAAACTC
GCGCAGGGCC GAGTGCGGAC GCTCTGGCTG ATGGCGGCAG GGGGCGTAGC AGGAAGCGGC
ATCTGGGCGA CGCATTTCGT CGCGATGCTT GCGTTCAAGT CCATCCTGCC CGTGAACTAC
GAGATTGGCT TGACCTTCTT CTCCGCCTTG ATCGCCATCG TGTTGTGCGG GGTGGGGTTT
GCGATTGTTC TCGGGAGGCC CGGCCCCTTG GCCGGGGGTG CCCTGATCGG TGCGGCCATC
AGCGCCATGC ATTTCACGGG CATGACCGCC GTCCGGATTC CGGCCGATCC CGTCTGGGAT
ATGAACTATG TCGTCGCGGC CGTCGTCATC GGCGTACTGA CGAGCGCGGG CGCCATGCAT
GCCGCCATGC GCGTTCCTGC ACTTGGTGGC ATCTTGGTGG GATCGCTTCT CTTCACCATC
GCGATTTGCG CCACACACTT CACGGCAATG ACGGCGCTCA CCTATGTGCC GGACCCGACC
ATCGTCGTAA CGGATGTTGT CGCGGAACCC GGCATGATCG CCATCGCGAT CGCGACCGTT
ACATTCCTGA TCATTGCGCT CGGCCTGGTC GGCGCCTTGG TGGACAATTA TCTGGCTCAA
CGCGCGTCTG GTGAAGCGGA ACGCATGCGC GTCTATATCG CCGAACTCGA AGCGACAAAG
CAGGAATTGA TCGCCGCCAA GGATCAGGCT GAAGCGGGAA ATCGCGCCAA GTCCGACTTT
CTCGCCAATA TGAGCCATGA GATTCGTACG CCCATGAACG GCGTACTGGG CATGACGGAA
TTGCTGCTCA CCACGGCGTT GAACGCCGAG CAGCGAAAAT TCGCCGAGAC CGTTCGAGAG
TCCGGGGAAG CCTTGCTGAC CATCGTCAAC GATATTCTCG ATGTGTCCAA GCTGGAGGCC
GGTCGTCTCG AGCTCGATCA TATCGACTTC GATCTGGTCA ACACGGTCGA AAGCGCGATC
AGCCTGATGG CGGCGCGGGC GGCGGAAAAA CAGATCGATC TCGGCGCCTA TATCGAGCCG
GCCGCGCGGG GCATTTACCG CGGCGATCCC GCGCGTCTGC GCCAGGTCCT GTTGAATCTC
ATCGGCAACG CCATCAAGTT CACCGAGAAG GGCGGCGTCG CCGTGCGGGT GTCGGTTTAC
CGCGTCGAGG ATCCCCAGAC AAAGGCGTCT CATCTGCGTT TTGAAGTGCA TGACACGGGC
ATTGGCATTC CCGAGGACGT CTGCGGCAGG TTGTTTCAGA AATTCAGCCA GGCTGACAGC
TCCGTTACAC GGCGCTATGG CGGCACCGGC CTGGGTCTTG CGATCTGCAA GCAGATCGTC
GAATTGATGG ATGGCAATAT CGGCGTCAAC AGCCGGGTCG GGGCGGGTTC GACGTTCTGG
TTCCAGTTAT CGCTTGTACG CTCGGCCGCC CACCTGCCCG ATCTCAGCCG CCTGCCTGTC
TATCTCGAGA AGCTCAAAGT GCTGGTCGTC GATGACGTTC CAGTGAACCT CGACGTCCTG
ACCTCTCAAC TGGGCACCTA TGGCATTACC GTCACCAGAG CCGAAGATGG TTTTGCCGCG
CAGGCCGAGC TGGAAAGGGC ATGGCATGCC GGTCATCCAT ATGACATTGC CTTCCTCGAT
CAGATGATGC CCGGCATCTC GGGCGAAGAC CTTGCCTCGC GCATTCGCAA CAATCCCGAT
CTCGCCGATA TGAAGCTGGT ACTCGTCTCA TCGGCAGGCT CTCATGCGAG CAAGCTTCAC
GAGGAAAACG ACACGCTGGA TGCCCGCGTT GAAAAGCCGT TGAGGCAGCA TGAGCTGCTG
GATTGTCTGA TGCGCGTCTA TAGCGGAGCG CCGCTCGTCG ATTCGAGTTC AGCGCATAAT
CGAGACGTGC TTGAGGACGG CGCGGCGACG TCCGCCCGTA CCTTGCGGAT TCTGCTGGCG
GAGGACAACA AGATCAACCA GAAGGTCGCT CTGGCGATGC TCGAACAGCT GGGGCATAGC
GTGACCATTG CGGAGAACGG ATTGCAGGCC GTCGATGCCG TTCGCCGTGG GCATTTCGAC
GTGGTGCTCA TGGACATTCA GATGCCGGAG CTCGATGGCG TTGGCGCCAC GCGGGGCATT
CGAGCCTTGC CGGCGCCGAA ATGCGACATC CCCATCATCG CGATGACGGC CAATGCGATG
GAAGGCGCCG AGCGGAAATA TCTGGATTGC GGCATGGACG ACTACATATC CAAGCCGGTG
CGCCGGGACA TTCTTTTCGC CAAGCTTGCC AAAGTGTCCG CCGCGTTGCC CGCGCAGCCG
CCTGCGTCCG ACGCAGTCGC GAATGAAGAG CTATCCGGCG ATCCTGCCAC CTTGGCAATC
GAGGAAGGAC TCGCCGACCT CGATGTGGCG TCACTGGAGA ACCTTTTGTC TGTGCTCTCG
ATGTCGGATT TGCGAGAGTT GCTCGATCTC TATCTGAGCG ACACGGAAGA ACGGATTGCA
ACGATCCGGG AGGCGAACGG CCGTGCCGAT CTTGAGGGGA TGATGGCCGC TGCTCATGTC
ATTGTCGGAA CGGCGGGCGA AATCGGTGCC AGGCGGGAAA GCGCGGCCGC GCGGTTGCTT
GAAAAAGCAT GCCGGGCTGG CGACCTGGAC GCCGCGCACC GCCTTATGGA CACGTTGTTC
TCGGCTCATG ACGCGGCCTC TCGTGCGGTG CGCGGCTGGC TGGCGGCGAA TGCCGAGGCT
GTTCACGCTG TCCGTCTCTG A
 
Protein sequence
MLRVIGCITQ EHDLGLVVLA GGLCLFACFT AMNMLGRAKL AQGRVRTLWL MAAGGVAGSG 
IWATHFVAML AFKSILPVNY EIGLTFFSAL IAIVLCGVGF AIVLGRPGPL AGGALIGAAI
SAMHFTGMTA VRIPADPVWD MNYVVAAVVI GVLTSAGAMH AAMRVPALGG ILVGSLLFTI
AICATHFTAM TALTYVPDPT IVVTDVVAEP GMIAIAIATV TFLIIALGLV GALVDNYLAQ
RASGEAERMR VYIAELEATK QELIAAKDQA EAGNRAKSDF LANMSHEIRT PMNGVLGMTE
LLLTTALNAE QRKFAETVRE SGEALLTIVN DILDVSKLEA GRLELDHIDF DLVNTVESAI
SLMAARAAEK QIDLGAYIEP AARGIYRGDP ARLRQVLLNL IGNAIKFTEK GGVAVRVSVY
RVEDPQTKAS HLRFEVHDTG IGIPEDVCGR LFQKFSQADS SVTRRYGGTG LGLAICKQIV
ELMDGNIGVN SRVGAGSTFW FQLSLVRSAA HLPDLSRLPV YLEKLKVLVV DDVPVNLDVL
TSQLGTYGIT VTRAEDGFAA QAELERAWHA GHPYDIAFLD QMMPGISGED LASRIRNNPD
LADMKLVLVS SAGSHASKLH EENDTLDARV EKPLRQHELL DCLMRVYSGA PLVDSSSAHN
RDVLEDGAAT SARTLRILLA EDNKINQKVA LAMLEQLGHS VTIAENGLQA VDAVRRGHFD
VVLMDIQMPE LDGVGATRGI RALPAPKCDI PIIAMTANAM EGAERKYLDC GMDDYISKPV
RRDILFAKLA KVSAALPAQP PASDAVANEE LSGDPATLAI EEGLADLDVA SLENLLSVLS
MSDLRELLDL YLSDTEERIA TIREANGRAD LEGMMAAAHV IVGTAGEIGA RRESAAARLL
EKACRAGDLD AAHRLMDTLF SAHDAASRAV RGWLAANAEA VHAVRL