Gene Plav_3491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3491 
Symbol 
ID5454643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3736255 
End bp3739155 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content63% 
IMG OID640879076 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001414747 
Protein GI154253923 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.691252 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGCG GAGAAACGCC GTTCCTTCTG GCGGGCAGCG AAGCAAGCGA CATCATCGCC 
TCGTTCGATT GGGCAGCGAC GCCTCTCGGG CCGATGTCAT CCTGGCCGAG CAGCCTCAAA
GCCACGATTG GCCTTATCCT GCGATCGCCC GTTCCCATCG TCACGCTGTG GGGCGGAGAC
GGCATCATGA TCTACAATGA TGCCTATTCG ATTTTCGCGG GCGGGCGGCA TCCCGGCATT
TTCGGCTCGA AGGTCCGCGA CGGCTGGCCC GAGGTCGCTG ACTTCAACGA CAATGTGATG
AAGGTCGGCC TTGCGGGCGG AACGCTCGCC TATCGCGACC AGGAGTTGAC GCTTTACCGG
AATGGCGAAC CGGAACAAGT GTGGATGAAC CTCGACTATT CGCCTCTTCC CGACGAGACC
GGCAAGCCGG TGGGCGTGAT CGCCATTGTG GTCGAAACGA CGCGGCGCGT GGAAGCGGAG
AGACAGCTCG AGCGGGGCTT CGAAACGCTG CGGCGGATGT TCGAGCAGGC GCCGGGCTTC
GTTGCGATCC TGGCGGGGCC AGAACACACC TTCGTCATGG TCAACAACGC CTATATGCAG
CTTATCGGCC ATCGCGACAC GCTGGGAAAA CCCATTCGCG AGGCGCTGCC GGAAATCGTG
GATCAGGGCT TCGCCTCGCT TCTCGACCGC GTGCGCGACA CCCAGCAACC CTATATCGGA
CGCGGCATTC GCGTGTTGCT GCAGCGCGAA CCGGGAATGG CGCCGGAAGA ACGCTATCTC
GATTTCGTCT TCCATCCGGT AGCAGCCGCC CGCAGCGACG TGCGCGGCAT TTTCGTGCAG
GGGCACGACG TGACCGAGCG GCGGCTTGCC GAGATCGCGC TGCGGGAAAG CGAAGAGCGG
TTCCGCCTCG TTGCCGAAAG CGCGCCGGTG ATGCTGTGGA TGGGCGACGA GACCGGCAAG
TGCATCTATC TGAACGATGC GCTGCGCCGG TTCTGGAATG TGACGGCCGA GGGCGTGGCC
GATTTCAACT GGGCGACGAC GGTTCATCCC GACGACAGCG ATACGCTCCA CGGCCCATTC
AGTGCCGCCA TGGAAGCGCA CACCGCGTTC TCCGTGGAAG TGCGGTTTCG CCGCGCCGAT
GGCGAATACC GGATCATCAG GACGAATGCA CAGCCGCGCT TTGGGCCTGA CGGGAAGTTT
TCCGGCATGA TCGGCGTGAA TGTCGACGTG ACCGAGATGA GGCGAGCGGA AGAGGCGCTG
CACGCCGCCA ATGCGAACCT GGAGCAACGC GTGGCACGGG AAGTGGCCGA ACGCTCGAAG
GCCGAAGACG CCTTGTGGCA GGCGCAGAAG ATGGAAGCCA TCGGCAAGCT GACCGGCGGC
GTCGCGCATG ACTTCAACAA CCTGCTGCAG GTCGTCTCCG GCAATCTGCA ATTGCTGCTG
AAGGACCTGG ACGGAAACGA ACGGGCGCAG CGGCGGATAA CAAACGCGCT TGCCGGTGTC
GACCGCGGCT CCAAGCTTGC GAGCCAGCTT CTCGCTTTCG GGCGGCGGCA ACCGCTGGAA
CCGAAGGTCG TCAACATCGG GCGGCTGGTT TCAGGCATGG GCGACATGCT GCGGCGAACC
ATCGGTGAAA ATGTCGAAGT TGAAACCGTG GTGTCGGGCG GGCTGTGGAA TACATTCGCG
GACCCTGCGC AGCTCGAGAA TGCACTGCTC AATCTGGCCA TCAACGCGCG CGACGCGATG
AACGAGCAGG GACGCATGAC GATCGAGGTG GGGAACGCCT ATCTCGACGA TGTCTATGCG
CTCGATCATC CGGAAATTGC GCCCGGCCAA TATGTGGTTC TGGCAGTGAC AGATACGGGC
TGTGGCATTC CGGCGGATGT GTTGCCGCAG GTATTCGAGC CGTTCTTCTC GACCAAGCCA
CAGGAGAAGG GCACGGGTCT CGGGCTTTCG ATGGTCTATG GCTTCGTCAA ACAATCGGGC
GGGCACATCA AGATTTACAG TGAGGTCGGA CACGGGACGA CCGTGAAGCT CTACCTGCCG
CGCGTGAACG AGATGGAAGA CATAAGCGCG CTGCCGGGGG TTGGCGTTCC AACCGGCGGA
ACGGAGACCA TTCTGGTGGC GGAGGACGAC GAGGCCGTGC GCACGGTCGT GGTCGAGATG
CTGAACGATC TCGGGTATCG GGTGCTGACG GCGCGGGACG CGGCGAGCGC GCTGACCGTG
CTGGAAAGCG GGGTGCCGGT GGACCTGCTC TTTACCGACG TGGTGATGCC GGGACCGCTC
AAGAGCACCG ACCTTGCGAG GAAGGCGCGC GAAAGGCTGC CCAGTATCGG CGTGCTCTTC
ACATCGGGCT ATACCGAAAA TTCGATCGTG CATGGCGGAC GGCTCGATCC CGGCGTCGAC
CTTCTGTCGA AGCCCTATAC GCGGGAGGCG CTGGCGCGGA AGCTGCGCCA GGTCCTGGAC
GGCAAGGAGC CGAAGAAGGC GACAGTGACA ATGGCGACGT CGTCCGCCGC GAATGCGCAG
GAAGCGGGAG CAACGCCCGC TGCGGGTCTG ACGATACTGC TATGCGAGGA CGATGCGCTG
ATCCGCATGA GCACAGCGGA CATGCTGCGC GAAGTGGGGC TGATCGTTAT CGAGACGGAT
ACGGCGCAAG AGGCGCTCGA CATAATCGAG AATGGTGCGG TCGACCTTCT TATCACCGAT
GTGGGACTGC CGGATATGTC GGGCGTGGAA CTTGCGCTGG CGTTGCGGAA ATCCAGGCGG
GACATACCGG TAATCTTCGC GACCGGCCAT GCGGAACTCG ACGGCGCTGA CGGCATATTG
CGCAGTGCGG TTGTATCGAA GCCTTATGCG GTGATCGAAC TGAAGCGGTG CATCGACCAG
CTGATGGCGC TCGGCGCCTA G
 
Protein sequence
MQSGETPFLL AGSEASDIIA SFDWAATPLG PMSSWPSSLK ATIGLILRSP VPIVTLWGGD 
GIMIYNDAYS IFAGGRHPGI FGSKVRDGWP EVADFNDNVM KVGLAGGTLA YRDQELTLYR
NGEPEQVWMN LDYSPLPDET GKPVGVIAIV VETTRRVEAE RQLERGFETL RRMFEQAPGF
VAILAGPEHT FVMVNNAYMQ LIGHRDTLGK PIREALPEIV DQGFASLLDR VRDTQQPYIG
RGIRVLLQRE PGMAPEERYL DFVFHPVAAA RSDVRGIFVQ GHDVTERRLA EIALRESEER
FRLVAESAPV MLWMGDETGK CIYLNDALRR FWNVTAEGVA DFNWATTVHP DDSDTLHGPF
SAAMEAHTAF SVEVRFRRAD GEYRIIRTNA QPRFGPDGKF SGMIGVNVDV TEMRRAEEAL
HAANANLEQR VAREVAERSK AEDALWQAQK MEAIGKLTGG VAHDFNNLLQ VVSGNLQLLL
KDLDGNERAQ RRITNALAGV DRGSKLASQL LAFGRRQPLE PKVVNIGRLV SGMGDMLRRT
IGENVEVETV VSGGLWNTFA DPAQLENALL NLAINARDAM NEQGRMTIEV GNAYLDDVYA
LDHPEIAPGQ YVVLAVTDTG CGIPADVLPQ VFEPFFSTKP QEKGTGLGLS MVYGFVKQSG
GHIKIYSEVG HGTTVKLYLP RVNEMEDISA LPGVGVPTGG TETILVAEDD EAVRTVVVEM
LNDLGYRVLT ARDAASALTV LESGVPVDLL FTDVVMPGPL KSTDLARKAR ERLPSIGVLF
TSGYTENSIV HGGRLDPGVD LLSKPYTREA LARKLRQVLD GKEPKKATVT MATSSAANAQ
EAGATPAAGL TILLCEDDAL IRMSTADMLR EVGLIVIETD TAQEALDIIE NGAVDLLITD
VGLPDMSGVE LALALRKSRR DIPVIFATGH AELDGADGIL RSAVVSKPYA VIELKRCIDQ
LMALGA