Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_3491 |
Symbol | |
ID | 5454643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 3736255 |
End bp | 3739155 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640879076 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_001414747 |
Protein GI | 154253923 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.691252 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAGCG GAGAAACGCC GTTCCTTCTG GCGGGCAGCG AAGCAAGCGA CATCATCGCC TCGTTCGATT GGGCAGCGAC GCCTCTCGGG CCGATGTCAT CCTGGCCGAG CAGCCTCAAA GCCACGATTG GCCTTATCCT GCGATCGCCC GTTCCCATCG TCACGCTGTG GGGCGGAGAC GGCATCATGA TCTACAATGA TGCCTATTCG ATTTTCGCGG GCGGGCGGCA TCCCGGCATT TTCGGCTCGA AGGTCCGCGA CGGCTGGCCC GAGGTCGCTG ACTTCAACGA CAATGTGATG AAGGTCGGCC TTGCGGGCGG AACGCTCGCC TATCGCGACC AGGAGTTGAC GCTTTACCGG AATGGCGAAC CGGAACAAGT GTGGATGAAC CTCGACTATT CGCCTCTTCC CGACGAGACC GGCAAGCCGG TGGGCGTGAT CGCCATTGTG GTCGAAACGA CGCGGCGCGT GGAAGCGGAG AGACAGCTCG AGCGGGGCTT CGAAACGCTG CGGCGGATGT TCGAGCAGGC GCCGGGCTTC GTTGCGATCC TGGCGGGGCC AGAACACACC TTCGTCATGG TCAACAACGC CTATATGCAG CTTATCGGCC ATCGCGACAC GCTGGGAAAA CCCATTCGCG AGGCGCTGCC GGAAATCGTG GATCAGGGCT TCGCCTCGCT TCTCGACCGC GTGCGCGACA CCCAGCAACC CTATATCGGA CGCGGCATTC GCGTGTTGCT GCAGCGCGAA CCGGGAATGG CGCCGGAAGA ACGCTATCTC GATTTCGTCT TCCATCCGGT AGCAGCCGCC CGCAGCGACG TGCGCGGCAT TTTCGTGCAG GGGCACGACG TGACCGAGCG GCGGCTTGCC GAGATCGCGC TGCGGGAAAG CGAAGAGCGG TTCCGCCTCG TTGCCGAAAG CGCGCCGGTG ATGCTGTGGA TGGGCGACGA GACCGGCAAG TGCATCTATC TGAACGATGC GCTGCGCCGG TTCTGGAATG TGACGGCCGA GGGCGTGGCC GATTTCAACT GGGCGACGAC GGTTCATCCC GACGACAGCG ATACGCTCCA CGGCCCATTC AGTGCCGCCA TGGAAGCGCA CACCGCGTTC TCCGTGGAAG TGCGGTTTCG CCGCGCCGAT GGCGAATACC GGATCATCAG GACGAATGCA CAGCCGCGCT TTGGGCCTGA CGGGAAGTTT TCCGGCATGA TCGGCGTGAA TGTCGACGTG ACCGAGATGA GGCGAGCGGA AGAGGCGCTG CACGCCGCCA ATGCGAACCT GGAGCAACGC GTGGCACGGG AAGTGGCCGA ACGCTCGAAG GCCGAAGACG CCTTGTGGCA GGCGCAGAAG ATGGAAGCCA TCGGCAAGCT GACCGGCGGC GTCGCGCATG ACTTCAACAA CCTGCTGCAG GTCGTCTCCG GCAATCTGCA ATTGCTGCTG AAGGACCTGG ACGGAAACGA ACGGGCGCAG CGGCGGATAA CAAACGCGCT TGCCGGTGTC GACCGCGGCT CCAAGCTTGC GAGCCAGCTT CTCGCTTTCG GGCGGCGGCA ACCGCTGGAA CCGAAGGTCG TCAACATCGG GCGGCTGGTT TCAGGCATGG GCGACATGCT GCGGCGAACC ATCGGTGAAA ATGTCGAAGT TGAAACCGTG GTGTCGGGCG GGCTGTGGAA TACATTCGCG GACCCTGCGC AGCTCGAGAA TGCACTGCTC AATCTGGCCA TCAACGCGCG CGACGCGATG AACGAGCAGG GACGCATGAC GATCGAGGTG GGGAACGCCT ATCTCGACGA TGTCTATGCG CTCGATCATC CGGAAATTGC GCCCGGCCAA TATGTGGTTC TGGCAGTGAC AGATACGGGC TGTGGCATTC CGGCGGATGT GTTGCCGCAG GTATTCGAGC CGTTCTTCTC GACCAAGCCA CAGGAGAAGG GCACGGGTCT CGGGCTTTCG ATGGTCTATG GCTTCGTCAA ACAATCGGGC GGGCACATCA AGATTTACAG TGAGGTCGGA CACGGGACGA CCGTGAAGCT CTACCTGCCG CGCGTGAACG AGATGGAAGA CATAAGCGCG CTGCCGGGGG TTGGCGTTCC AACCGGCGGA ACGGAGACCA TTCTGGTGGC GGAGGACGAC GAGGCCGTGC GCACGGTCGT GGTCGAGATG CTGAACGATC TCGGGTATCG GGTGCTGACG GCGCGGGACG CGGCGAGCGC GCTGACCGTG CTGGAAAGCG GGGTGCCGGT GGACCTGCTC TTTACCGACG TGGTGATGCC GGGACCGCTC AAGAGCACCG ACCTTGCGAG GAAGGCGCGC GAAAGGCTGC CCAGTATCGG CGTGCTCTTC ACATCGGGCT ATACCGAAAA TTCGATCGTG CATGGCGGAC GGCTCGATCC CGGCGTCGAC CTTCTGTCGA AGCCCTATAC GCGGGAGGCG CTGGCGCGGA AGCTGCGCCA GGTCCTGGAC GGCAAGGAGC CGAAGAAGGC GACAGTGACA ATGGCGACGT CGTCCGCCGC GAATGCGCAG GAAGCGGGAG CAACGCCCGC TGCGGGTCTG ACGATACTGC TATGCGAGGA CGATGCGCTG ATCCGCATGA GCACAGCGGA CATGCTGCGC GAAGTGGGGC TGATCGTTAT CGAGACGGAT ACGGCGCAAG AGGCGCTCGA CATAATCGAG AATGGTGCGG TCGACCTTCT TATCACCGAT GTGGGACTGC CGGATATGTC GGGCGTGGAA CTTGCGCTGG CGTTGCGGAA ATCCAGGCGG GACATACCGG TAATCTTCGC GACCGGCCAT GCGGAACTCG ACGGCGCTGA CGGCATATTG CGCAGTGCGG TTGTATCGAA GCCTTATGCG GTGATCGAAC TGAAGCGGTG CATCGACCAG CTGATGGCGC TCGGCGCCTA G
|
Protein sequence | MQSGETPFLL AGSEASDIIA SFDWAATPLG PMSSWPSSLK ATIGLILRSP VPIVTLWGGD GIMIYNDAYS IFAGGRHPGI FGSKVRDGWP EVADFNDNVM KVGLAGGTLA YRDQELTLYR NGEPEQVWMN LDYSPLPDET GKPVGVIAIV VETTRRVEAE RQLERGFETL RRMFEQAPGF VAILAGPEHT FVMVNNAYMQ LIGHRDTLGK PIREALPEIV DQGFASLLDR VRDTQQPYIG RGIRVLLQRE PGMAPEERYL DFVFHPVAAA RSDVRGIFVQ GHDVTERRLA EIALRESEER FRLVAESAPV MLWMGDETGK CIYLNDALRR FWNVTAEGVA DFNWATTVHP DDSDTLHGPF SAAMEAHTAF SVEVRFRRAD GEYRIIRTNA QPRFGPDGKF SGMIGVNVDV TEMRRAEEAL HAANANLEQR VAREVAERSK AEDALWQAQK MEAIGKLTGG VAHDFNNLLQ VVSGNLQLLL KDLDGNERAQ RRITNALAGV DRGSKLASQL LAFGRRQPLE PKVVNIGRLV SGMGDMLRRT IGENVEVETV VSGGLWNTFA DPAQLENALL NLAINARDAM NEQGRMTIEV GNAYLDDVYA LDHPEIAPGQ YVVLAVTDTG CGIPADVLPQ VFEPFFSTKP QEKGTGLGLS MVYGFVKQSG GHIKIYSEVG HGTTVKLYLP RVNEMEDISA LPGVGVPTGG TETILVAEDD EAVRTVVVEM LNDLGYRVLT ARDAASALTV LESGVPVDLL FTDVVMPGPL KSTDLARKAR ERLPSIGVLF TSGYTENSIV HGGRLDPGVD LLSKPYTREA LARKLRQVLD GKEPKKATVT MATSSAANAQ EAGATPAAGL TILLCEDDAL IRMSTADMLR EVGLIVIETD TAQEALDIIE NGAVDLLITD VGLPDMSGVE LALALRKSRR DIPVIFATGH AELDGADGIL RSAVVSKPYA VIELKRCIDQ LMALGA
|
| |