Gene Xaut_4049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_4049 
Symbol 
ID5424455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009720 
Strand
Start bp4479387 
End bp4481279 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content71% 
IMG OID640883303 
Producthistidine kinase 
Protein accessionYP_001418928 
Protein GI154247970 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.431231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.418791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCCG ATCTCGCCAC CGCAGCGCGC GCCATGCTCC TCGCCGTTCT CGGCGCGGCA 
GCACTGTTCG CCGTGGTGAA GGCGGGGGAG ATGGCGGAGG CCCGCGTCGG CGAGCGCCTC
GCCAGCGAGG CCCGCCACCG CAGCGAGATC TATGCCCAGA GCCTGGAGGG CGCCATCGAG
CGGTTCGGCT ATCTGCCCGC CGCCGCCGCC CTCGACCTCA ATGTGCGCAG CGCCCTCGTC
CAGCCCGACG ACCCGAGGCT GATCGCCACC GTCAACACCT ATCTGGAAAC CCTCAACCGC
GCTGCCGGGG CCAGCGTGCT CTATCTCATC GACCCCTCGG GCATGACCAT CGCCGCGTCC
AACTGGGACA CGCGGGAGAC CTATGTGGGC GTGGACTTCA GCTATCGCCC CTATTTCACC
GAGGCCATGG CGGGCCAGGC CGGCCGGTTC TACGCCATCG GCACGCTGAC CGGCGTGCCC
GGCTATTTCA TCAGCGCCCC GGTGGCGATC GACGGCAGGA TCCGGGGCGT GGTGGTCACC
AAGGTCAATC TCGACCCCTT GGAGGCGGTG TGGCAGGAGG CCGCCGACCG GGTGCTGGTG
ACCGACCCGA ATGGCATCGT CTTCCTCGCC TCCGATACGC GCTTCAAGTT CCACGCTACC
CGCCCCATCG ATGCCGCCAC CGCGCAGGAA TTGAAAAAGA CTCGCCAATA CGGGCGCAAG
ACCTATCCGC TGCTCAATCT CGGCACGGTG GAGACGGTGG GCGGCGTGCC TCTGATCGCG
GCCTCGGACG TGGCCCCTCA CGGCCGTGTC ATCGCCGACG ACAAGCCCCT TGTCCCCTAT
GACTGGCACC TGCTGCTGTT TGCAGACGCC GCCGCGCTGG AGCTGGCCGG GCGCACCGCC
CGCGTGGGCA TGACGCTGGG CTTCGGCGTG CTCGGCCTCA TCGGCCTCTA TTGGTGGCAG
CACCTCAAGC GCGCCCGCGA GAGCCTCGCC GCCCGCGCGG CACTGGAGGC GGCGCAGCGC
GAGCTGGAAG CCAAGGTGGA AGCCCGCACC GCCGACCTCA CCGCCGCCAA CGGCCAGCTC
GCGGGCGAGA TCGAGGAACG CCGCCGCGCC GAGGCGGAGC TGCGCGCCGC GCAGGACGAG
CTGGTGCAGG CCGCCAAGAT GGCGACCCTC GGCCAGATGG CGGCCGGCAT CACCCATGAG
CTGAACCAGC CCCTCACCGC CTTGCGCGGC CTCGCCGACA ATGCCGGCAA GCTTTTGGAG
CAGGGGCGCG AGGAGGAGGC GCAGGCCAAT CTCGGCCGCA TCACCGCCAT GGTGGACCGG
CTCGGCAAGA TCACCGGGCA GTTGCGCGCC TTCGCCCGCA AGAGCAGCAG CGAGACGCTG
CCGGTGGACG TGGCGGCGGC GGTGGCCGAG AGCCTCGCCA TCCTCGCCCC GCGCATCCGC
GTCGCCGGCA TCGGCGTCCA GACCGACCTC GACGCGGCCG CTGCCACCGT GCGGTTCGAA
CCCATCCGCC TCAGCCAGGT TCTGGTGAAC CTCGTGGGCA ACGCGCTGGA CGCGGTGAAG
GGTCGGCCTA GCGCCTGGGT CGGCCTTTCC ACCCGGCGCG AGGGCTCCCG CATCGTCCTC
AGCGTGGAGG ACAACGGCCC CGGCCTGCCG GAGGCCACCC TCGCGCGCAT CTTCGATCCC
TTCTTCACCA CCAAGCCGGC GGGCGAGGGG CTGGGGCTGG GCCTGCCCAT CTCGCTCGCC
ATCGCCCGCG AATTCGGCGC CACCCTCACC GCCCGCGCGC GAACAGGCGG CGGCCTCGCC
TTCGATCTCG TCATGGAGGC TGCGGAGAGC GCGGACGCCG CCCCGCCCCA GCCGGAAAGA
TTGCGCCACG AAAGGACGCA CCATGTGTGC TGA
 
Protein sequence
MRPDLATAAR AMLLAVLGAA ALFAVVKAGE MAEARVGERL ASEARHRSEI YAQSLEGAIE 
RFGYLPAAAA LDLNVRSALV QPDDPRLIAT VNTYLETLNR AAGASVLYLI DPSGMTIAAS
NWDTRETYVG VDFSYRPYFT EAMAGQAGRF YAIGTLTGVP GYFISAPVAI DGRIRGVVVT
KVNLDPLEAV WQEAADRVLV TDPNGIVFLA SDTRFKFHAT RPIDAATAQE LKKTRQYGRK
TYPLLNLGTV ETVGGVPLIA ASDVAPHGRV IADDKPLVPY DWHLLLFADA AALELAGRTA
RVGMTLGFGV LGLIGLYWWQ HLKRARESLA ARAALEAAQR ELEAKVEART ADLTAANGQL
AGEIEERRRA EAELRAAQDE LVQAAKMATL GQMAAGITHE LNQPLTALRG LADNAGKLLE
QGREEEAQAN LGRITAMVDR LGKITGQLRA FARKSSSETL PVDVAAAVAE SLAILAPRIR
VAGIGVQTDL DAAAATVRFE PIRLSQVLVN LVGNALDAVK GRPSAWVGLS TRREGSRIVL
SVEDNGPGLP EATLARIFDP FFTTKPAGEG LGLGLPISLA IAREFGATLT ARARTGGGLA
FDLVMEAAES ADAAPPQPER LRHERTHHVC