Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xaut_2026 |
Symbol | |
ID | 5422451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xanthobacter autotrophicus Py2 |
Kingdom | Bacteria |
Replicon accession | NC_009720 |
Strand | - |
Start bp | 2284111 |
End bp | 2287005 |
Gene Length | 2895 bp |
Protein Length | 964 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640881278 |
Product | multi-sensor hybrid histidine kinase |
Protein accession | YP_001416927 |
Protein GI | 154245969 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTGT CCAGCCGGGG CCTTGTCGGC CTGTTCCTCA AAATCATTTC AGAGCCGCGC TCGCTGGCGT CGCGGCTGTG GTTCGCCGGC GTGGTGCTGG CGGGAGCATT CGGCCTGCGC TACGGCCTCA GCCTGTCGCT TCGGGAGCTG GCGCCGTTCC TGTTCCAGCT GCCGGCCATC ATCGTGGTGA CGCTGATCAG CGGGGCGCGG GTGGGCTTTG GCGCCGTGGT GGCCCTCGCC CTCGCCAATG CCGCGCTGCT GCCCAGCCAT GCGTGGAGTA ATCTGGGCGG CGGCTTGGGC GCCCCTGCCG ACGGCGGCTT CGGCGCGGTG GCGGGGCTGT TCTCCTTCAC GGTGAACGCC CTCGGCCTGT GGCTGATCGG ATCGTTGATA CGGGTCAGCG TCCGCCGCCT GAATGCCACT CACGGCGCGC TCCTGGCGGC GGTGGCCGAG CAGAACTCCG TGGTGGCGAC CCTGGAAGCC CTGCTGCAGC ACGCCCCCGT GGGCTTTGCC TTCTTCGACC GCCAGCTGCG CTTCATTCGG GTCAACGAGA CCCTGGCGCG GATGGTGGGC ATTCCGGCCG GGGAACACGT GGGCCGTTCC CTCGCCGACA TGCTGCCCCA GCTCTCCGGC GCCATCACTC CCGGTCTGGA GCAGGTGCGC GCCACCGGCG CCGTCCTCGC CGACGTGGAG GTGGAAGGCG CCACCCCGGC GGCTCCGGGG GTGTGGCGCC ACTTCCTGGT CAGCTTCTTC CCGGTCCGCA CCCAGGGAGA GGCCATCGGC CTCGTGGGCA TGATCGTCAC CGAGATCACC GGGCGCAAGA CCGCCGAGAA GGCCCTGGCC GAGAGCGAGC AACGCTACCG CCTGCTCGCC GAGGCCCTGC CCAAGATGGT CTGGACCGCG ACCCCGGACG GCAAGGGCGA CTACTACAAT CACCGCTGGA GCGAATATAC CGGCGTCACC CCGCCGACCG GCGAGGTGTC GGAGTGGCAC ACCCACCTCC ATCCGGAAGA CCAGGCCGCC GCCCTCGATG AATGGAAGGG CAGCCTGGAA TCGGGCAAGC CCTATTCGCG CGAATGCCGG TTCCGCGCCG GTGACGGCAG CTATCGCTGG TTCCTGTGCC GCGCGGTCCC GGTGCGCGAC GACGACGGAC GCATCGACCG CTGGTACGGC AGCTGCACCG ACATTTCCGA GATCGTCGCC GCCCGCGAGG CGCTCGCCCG CACCAACGAG GATCTGGAGC GGCTCGCCAG CGCCCGCACC ATGGAACTGG CCCGCGCCAA CGCCCTGCTC AAGCAGGAGA TGGAAGACCG CCTGAAGGCC GAGGCCCAGC TGCGGCAGGC CCAGAAGATG GAGGCGGTGG GCCAGCTCAC CGGCGGCATC GCCCACGATT TCAACAATCT GCTCACCGTC ATCATCGGCA ACCTGGAGGC GGCCGAGCGG CGCGTGCCCA GGGACGACAC CAACAAGGAC GCCACCGACA TCCGGCGCTT CCTCGATTAC GGCCGCCAGG GCGCGCTTCG GGCCGCCACC CTGACCCAGC AGCTGCTCGC CTTTTCGCGC CGTCAGCCGC TGGACCCGCG CCCCACCGAC ATCAACAAGC TGATCACCGG CATGTCCGAC ATGCTGCGCA GCGCGCTGGG CGAGAAGGTG ACGGTGGAAA CCGTGCTGGC CGGCGGCCTC TGGTGCGCCG AGATCGACCA CAACCAGCTG GAGAACGCCA TCCTCAACCT CGGGGTCAAC GGCCGCGATG CCATGCCCGC GGGCGGCACG CTGACCATTG AGACCGCCAA TGCCTATCTG GACGAGGCCT ATTGCGCCGC CCACGAGGAC CTGGAGCCGG GCCAGTATGT GGCGGTCTTC GTCTGCGACA CCGGCTCCGG CATGGCGGAG GAGGTGCGGG CGCGGGCGTT CGAGCCGTTC TTCACCACCA AGGGCCCGCG CGAGGGGACC GGGCTCGGCC TCAGCCAGGT CTACGGCTTC GTCAAGCAGT CGGGCGGCCA CGTGATGATC TACAGCGCCC CCGGCGAGGG CACCACGGTG AAGCTCTACC TGCCGCGTCA CCCCGACGAC GTGGCGGGCG AGCCGGTCGA CCCCGACGCG GACCACGCGC CCCATACCGG CGCCGCCCGC GTGCTGCTAG TGGAGGACGA CGCCGCCCTC CGCGCGCTCT CGACCAAGGC CCTGCGCGAT GCCGGGCACA CGGTGGTGGA GGCCGCCGAC GCCGCCTCCG CCCTCGCGAC GCTGGAGGAC GGCGCCGTGC CCGACCTGCT CCTCACCGAC CTGCGCCTCG GCACCGACGC CCAGCGCATG GACGGACGCC ATCTGGCGGA CGAGGTGCGG CGGCGGCTCA CCACGGTAAG GGTGCTATTT ACGGCCGCAT ATGCGAAAAA TGCTGCCAGT GAGAACGGAC GGCTGGACCA TGGGGTGCGC CTCCTGACCA AGCCGTTCAC CCAGGCAGAG TTGGTCACGA AGGTGAAAGA CGTGCTTGAG GCGCCGGGAC ATCGCGGCAC TGTGCTGCTG GTGGAGGACG AGCCATTCGT CGCCATGGTG GCGCGGCAGA TCCTGGAGGA TCACGGCTTC GAGGTCACGG TGGCGTCCCA TGGCCACGCG GCGCTGGCCC ATGCCGAGGC GTCGGTGCCC GACCCGTCGC GCAACGCCCT GGTGCTGGCG GTGGTGGACG TGGGCCTTCC GGACATGAAC GGGGACGAAG TGGTGCGCCG GCTCGGCGCC ATCGCGCCCG GCCTGCCGGT GATCATCGCC ACCGGCTACG GCACCCAGGA ACTGGAAGCG GAATTCGGCG CCTCGCCCAG GATCGCGCTC ATGGGCAAGC CCTATGACGG CGCCACCCTG CGCAATGGCC TGCGCAAGCT CGGCTTCAAC ATCGAGAGCG AGTGA
|
Protein sequence | MNLSSRGLVG LFLKIISEPR SLASRLWFAG VVLAGAFGLR YGLSLSLREL APFLFQLPAI IVVTLISGAR VGFGAVVALA LANAALLPSH AWSNLGGGLG APADGGFGAV AGLFSFTVNA LGLWLIGSLI RVSVRRLNAT HGALLAAVAE QNSVVATLEA LLQHAPVGFA FFDRQLRFIR VNETLARMVG IPAGEHVGRS LADMLPQLSG AITPGLEQVR ATGAVLADVE VEGATPAAPG VWRHFLVSFF PVRTQGEAIG LVGMIVTEIT GRKTAEKALA ESEQRYRLLA EALPKMVWTA TPDGKGDYYN HRWSEYTGVT PPTGEVSEWH THLHPEDQAA ALDEWKGSLE SGKPYSRECR FRAGDGSYRW FLCRAVPVRD DDGRIDRWYG SCTDISEIVA AREALARTNE DLERLASART MELARANALL KQEMEDRLKA EAQLRQAQKM EAVGQLTGGI AHDFNNLLTV IIGNLEAAER RVPRDDTNKD ATDIRRFLDY GRQGALRAAT LTQQLLAFSR RQPLDPRPTD INKLITGMSD MLRSALGEKV TVETVLAGGL WCAEIDHNQL ENAILNLGVN GRDAMPAGGT LTIETANAYL DEAYCAAHED LEPGQYVAVF VCDTGSGMAE EVRARAFEPF FTTKGPREGT GLGLSQVYGF VKQSGGHVMI YSAPGEGTTV KLYLPRHPDD VAGEPVDPDA DHAPHTGAAR VLLVEDDAAL RALSTKALRD AGHTVVEAAD AASALATLED GAVPDLLLTD LRLGTDAQRM DGRHLADEVR RRLTTVRVLF TAAYAKNAAS ENGRLDHGVR LLTKPFTQAE LVTKVKDVLE APGHRGTVLL VEDEPFVAMV ARQILEDHGF EVTVASHGHA ALAHAEASVP DPSRNALVLA VVDVGLPDMN GDEVVRRLGA IAPGLPVIIA TGYGTQELEA EFGASPRIAL MGKPYDGATL RNGLRKLGFN IESE
|
| |