Gene Xaut_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_2026 
Symbol 
ID5422451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp2284111 
End bp2287005 
Gene Length2895 bp 
Protein Length964 aa 
Translation table11 
GC content70% 
IMG OID640881278 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_001416927 
Protein GI154245969 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGT CCAGCCGGGG CCTTGTCGGC CTGTTCCTCA AAATCATTTC AGAGCCGCGC 
TCGCTGGCGT CGCGGCTGTG GTTCGCCGGC GTGGTGCTGG CGGGAGCATT CGGCCTGCGC
TACGGCCTCA GCCTGTCGCT TCGGGAGCTG GCGCCGTTCC TGTTCCAGCT GCCGGCCATC
ATCGTGGTGA CGCTGATCAG CGGGGCGCGG GTGGGCTTTG GCGCCGTGGT GGCCCTCGCC
CTCGCCAATG CCGCGCTGCT GCCCAGCCAT GCGTGGAGTA ATCTGGGCGG CGGCTTGGGC
GCCCCTGCCG ACGGCGGCTT CGGCGCGGTG GCGGGGCTGT TCTCCTTCAC GGTGAACGCC
CTCGGCCTGT GGCTGATCGG ATCGTTGATA CGGGTCAGCG TCCGCCGCCT GAATGCCACT
CACGGCGCGC TCCTGGCGGC GGTGGCCGAG CAGAACTCCG TGGTGGCGAC CCTGGAAGCC
CTGCTGCAGC ACGCCCCCGT GGGCTTTGCC TTCTTCGACC GCCAGCTGCG CTTCATTCGG
GTCAACGAGA CCCTGGCGCG GATGGTGGGC ATTCCGGCCG GGGAACACGT GGGCCGTTCC
CTCGCCGACA TGCTGCCCCA GCTCTCCGGC GCCATCACTC CCGGTCTGGA GCAGGTGCGC
GCCACCGGCG CCGTCCTCGC CGACGTGGAG GTGGAAGGCG CCACCCCGGC GGCTCCGGGG
GTGTGGCGCC ACTTCCTGGT CAGCTTCTTC CCGGTCCGCA CCCAGGGAGA GGCCATCGGC
CTCGTGGGCA TGATCGTCAC CGAGATCACC GGGCGCAAGA CCGCCGAGAA GGCCCTGGCC
GAGAGCGAGC AACGCTACCG CCTGCTCGCC GAGGCCCTGC CCAAGATGGT CTGGACCGCG
ACCCCGGACG GCAAGGGCGA CTACTACAAT CACCGCTGGA GCGAATATAC CGGCGTCACC
CCGCCGACCG GCGAGGTGTC GGAGTGGCAC ACCCACCTCC ATCCGGAAGA CCAGGCCGCC
GCCCTCGATG AATGGAAGGG CAGCCTGGAA TCGGGCAAGC CCTATTCGCG CGAATGCCGG
TTCCGCGCCG GTGACGGCAG CTATCGCTGG TTCCTGTGCC GCGCGGTCCC GGTGCGCGAC
GACGACGGAC GCATCGACCG CTGGTACGGC AGCTGCACCG ACATTTCCGA GATCGTCGCC
GCCCGCGAGG CGCTCGCCCG CACCAACGAG GATCTGGAGC GGCTCGCCAG CGCCCGCACC
ATGGAACTGG CCCGCGCCAA CGCCCTGCTC AAGCAGGAGA TGGAAGACCG CCTGAAGGCC
GAGGCCCAGC TGCGGCAGGC CCAGAAGATG GAGGCGGTGG GCCAGCTCAC CGGCGGCATC
GCCCACGATT TCAACAATCT GCTCACCGTC ATCATCGGCA ACCTGGAGGC GGCCGAGCGG
CGCGTGCCCA GGGACGACAC CAACAAGGAC GCCACCGACA TCCGGCGCTT CCTCGATTAC
GGCCGCCAGG GCGCGCTTCG GGCCGCCACC CTGACCCAGC AGCTGCTCGC CTTTTCGCGC
CGTCAGCCGC TGGACCCGCG CCCCACCGAC ATCAACAAGC TGATCACCGG CATGTCCGAC
ATGCTGCGCA GCGCGCTGGG CGAGAAGGTG ACGGTGGAAA CCGTGCTGGC CGGCGGCCTC
TGGTGCGCCG AGATCGACCA CAACCAGCTG GAGAACGCCA TCCTCAACCT CGGGGTCAAC
GGCCGCGATG CCATGCCCGC GGGCGGCACG CTGACCATTG AGACCGCCAA TGCCTATCTG
GACGAGGCCT ATTGCGCCGC CCACGAGGAC CTGGAGCCGG GCCAGTATGT GGCGGTCTTC
GTCTGCGACA CCGGCTCCGG CATGGCGGAG GAGGTGCGGG CGCGGGCGTT CGAGCCGTTC
TTCACCACCA AGGGCCCGCG CGAGGGGACC GGGCTCGGCC TCAGCCAGGT CTACGGCTTC
GTCAAGCAGT CGGGCGGCCA CGTGATGATC TACAGCGCCC CCGGCGAGGG CACCACGGTG
AAGCTCTACC TGCCGCGTCA CCCCGACGAC GTGGCGGGCG AGCCGGTCGA CCCCGACGCG
GACCACGCGC CCCATACCGG CGCCGCCCGC GTGCTGCTAG TGGAGGACGA CGCCGCCCTC
CGCGCGCTCT CGACCAAGGC CCTGCGCGAT GCCGGGCACA CGGTGGTGGA GGCCGCCGAC
GCCGCCTCCG CCCTCGCGAC GCTGGAGGAC GGCGCCGTGC CCGACCTGCT CCTCACCGAC
CTGCGCCTCG GCACCGACGC CCAGCGCATG GACGGACGCC ATCTGGCGGA CGAGGTGCGG
CGGCGGCTCA CCACGGTAAG GGTGCTATTT ACGGCCGCAT ATGCGAAAAA TGCTGCCAGT
GAGAACGGAC GGCTGGACCA TGGGGTGCGC CTCCTGACCA AGCCGTTCAC CCAGGCAGAG
TTGGTCACGA AGGTGAAAGA CGTGCTTGAG GCGCCGGGAC ATCGCGGCAC TGTGCTGCTG
GTGGAGGACG AGCCATTCGT CGCCATGGTG GCGCGGCAGA TCCTGGAGGA TCACGGCTTC
GAGGTCACGG TGGCGTCCCA TGGCCACGCG GCGCTGGCCC ATGCCGAGGC GTCGGTGCCC
GACCCGTCGC GCAACGCCCT GGTGCTGGCG GTGGTGGACG TGGGCCTTCC GGACATGAAC
GGGGACGAAG TGGTGCGCCG GCTCGGCGCC ATCGCGCCCG GCCTGCCGGT GATCATCGCC
ACCGGCTACG GCACCCAGGA ACTGGAAGCG GAATTCGGCG CCTCGCCCAG GATCGCGCTC
ATGGGCAAGC CCTATGACGG CGCCACCCTG CGCAATGGCC TGCGCAAGCT CGGCTTCAAC
ATCGAGAGCG AGTGA
 
Protein sequence
MNLSSRGLVG LFLKIISEPR SLASRLWFAG VVLAGAFGLR YGLSLSLREL APFLFQLPAI 
IVVTLISGAR VGFGAVVALA LANAALLPSH AWSNLGGGLG APADGGFGAV AGLFSFTVNA
LGLWLIGSLI RVSVRRLNAT HGALLAAVAE QNSVVATLEA LLQHAPVGFA FFDRQLRFIR
VNETLARMVG IPAGEHVGRS LADMLPQLSG AITPGLEQVR ATGAVLADVE VEGATPAAPG
VWRHFLVSFF PVRTQGEAIG LVGMIVTEIT GRKTAEKALA ESEQRYRLLA EALPKMVWTA
TPDGKGDYYN HRWSEYTGVT PPTGEVSEWH THLHPEDQAA ALDEWKGSLE SGKPYSRECR
FRAGDGSYRW FLCRAVPVRD DDGRIDRWYG SCTDISEIVA AREALARTNE DLERLASART
MELARANALL KQEMEDRLKA EAQLRQAQKM EAVGQLTGGI AHDFNNLLTV IIGNLEAAER
RVPRDDTNKD ATDIRRFLDY GRQGALRAAT LTQQLLAFSR RQPLDPRPTD INKLITGMSD
MLRSALGEKV TVETVLAGGL WCAEIDHNQL ENAILNLGVN GRDAMPAGGT LTIETANAYL
DEAYCAAHED LEPGQYVAVF VCDTGSGMAE EVRARAFEPF FTTKGPREGT GLGLSQVYGF
VKQSGGHVMI YSAPGEGTTV KLYLPRHPDD VAGEPVDPDA DHAPHTGAAR VLLVEDDAAL
RALSTKALRD AGHTVVEAAD AASALATLED GAVPDLLLTD LRLGTDAQRM DGRHLADEVR
RRLTTVRVLF TAAYAKNAAS ENGRLDHGVR LLTKPFTQAE LVTKVKDVLE APGHRGTVLL
VEDEPFVAMV ARQILEDHGF EVTVASHGHA ALAHAEASVP DPSRNALVLA VVDVGLPDMN
GDEVVRRLGA IAPGLPVIIA TGYGTQELEA EFGASPRIAL MGKPYDGATL RNGLRKLGFN
IESE