Gene YpsIP31758_4098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4098 
SymbolxylR 
ID5386837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4621348 
End bp4622541 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content49% 
IMG OID640867128 
Productxylose operon regulatory protein 
Protein accessionYP_001403042 
Protein GI153949200 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators
[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGAAA AACGCTACCG GATCACCTTG TTGTTTAACG CTAACAAAGT GTATGACCGG 
CAGGTGGTAG AAGGCGTGGG CGAGTATTTA CAAGCCTCGC AATGTAATTG GGATATTTTT
ATTGAAGAGG ATTTTCGCTG CCGAATCGAC AATATTAAGG ATTGGCTGGG CGATGGTGTG
ATTGCGGATT TTGATGATCG GCAGATAGAG CAGCTACTGG CAAATGTGAA CGTACCGATT
GTCGGCGTCG GCGGCTCTTA TCATCAGTCG GAAGATTATC CATCGGTAGA TTATATCGCG
ACTGACAATA AGGCATTGGT CAATGCGGCA TTTATGCATT TGAAAGAGAA GGGATTAAAC
CGTTTTGCTT TCTATGGGTT GCCCGCCAGT TGCGGTATGC GCTGGGCACA GGAGCGGGAA
TATGCGTTTC GCCAATTAGT GTCTGCCGAA CAATATCAAG GCGTGGTTTA TCAAGGGATG
GCAACGGCTC CGGATAATTG GCAATACGCA CAAAACCGGC TGGCCGATTG GGTACAAACC
TTACCGCATC AGACGGGGAT TATCGCGGTG ACCGATGCAC GGGCACGTCA TTTATTGCAA
GTGTGTGAGC ATCTGGATAT TGCCGTACCA GAGAAACTGA GTGTGATCGG TATTGATAAT
GAAGAGTTAA CCCGTTATTT ATCGCGGGTG GCGCTCTCTT CGGTGGTTCA GGGAACCCGA
CAAATGGGGT ATCGGGCGGC CAAGCTACTC CATCAACGTC TCAAGCTACG GCAAAAACAG
CAAACAGACC CGCCCTTACA GCGTATTTTG GTCCCACCAG TGAAAGTCAT GGCCCGCCGC
TCTACGGACT TCCGCTCGTT ACGTGACCCG GCGGTTATTC AGGCGATGCA TTATATTCGC
CACCACGCTT GCAAGGGGAT CAAAGTTGAA CAGGTATTGG ATGCGGTAGG GATGTCGCGC
TCAAATCTGG AAAAGCGTTT TAAAGATGAG GTCGGCCAAA CCATTCATGG CGTGATTCAT
GAAGAAAAAC TCGATAGGGC GCGCAATTTA CTGGCGGCGA CATCACTCCC TATTAATGAG
ATATCACAGA TGTGCGGTTA TCCATCGCTA CAATACTTTT ATTCAGTGTT CAAAAAAGGT
TATTCCATCA CACCGAAGGA GCACCGTGAC AAATACGGCG AAGTGAGTTA TTGA
 
Protein sequence
MFEKRYRITL LFNANKVYDR QVVEGVGEYL QASQCNWDIF IEEDFRCRID NIKDWLGDGV 
IADFDDRQIE QLLANVNVPI VGVGGSYHQS EDYPSVDYIA TDNKALVNAA FMHLKEKGLN
RFAFYGLPAS CGMRWAQERE YAFRQLVSAE QYQGVVYQGM ATAPDNWQYA QNRLADWVQT
LPHQTGIIAV TDARARHLLQ VCEHLDIAVP EKLSVIGIDN EELTRYLSRV ALSSVVQGTR
QMGYRAAKLL HQRLKLRQKQ QTDPPLQRIL VPPVKVMARR STDFRSLRDP AVIQAMHYIR
HHACKGIKVE QVLDAVGMSR SNLEKRFKDE VGQTIHGVIH EEKLDRARNL LAATSLPINE
ISQMCGYPSL QYFYSVFKKG YSITPKEHRD KYGEVSY