Gene YpAngola_A4155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4155 
SymbolxylR 
ID5802635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4445429 
End bp4446622 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content48% 
IMG OID641341928 
Productxylose operon regulatory protein 
Protein accessionYP_001608431 
Protein GI162421168 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators
[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.664411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0516528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAAA AACGCTACCG GATCACCTTG TTGTTTAACG CTAACAAAGT GTATGACCGG 
CAGGTGGTAG AAGGCGTGGG CGAGTATTTA CAAGCCTCGC AATGTAATTG GGATATTTTT
ATTGAAGAGG ATTTTCGCTG CCGAATCGAC AATATTAAGG ATTGGCTGGG CGATGGTGTG
ATTGCGGATT TTGATGATCG GCAGATAGAG CAGCTACTGG CAAATGTGAA CGTACCGATT
GTCGGCGTCG GCGGCTCTTA TCATCAGTCG GAAGATTATC CATCGGTAGA TTATATCGCG
ACTGACAATA AGGCATTGGT CAACGCGGCA TTTATGCATT TGAAAGAGAA GGGATTAAAC
CGTTTTGCTT TCTATGGGTT GCCCGCCAGT TGCGGTATGC GCTGGGCACA GGAGCGGGAA
TATGCGTTTC GCCAATTAGT GTCTGCCGAA CAATATCAAG GCGTGGTTTA TCAAGGGATG
GCAACGGCTC CGGATAATTG GCAATACGCA CAAAACCGGC TGGCCGATTG GGTACAAACC
TTACCGCATC AGACGGGGAT TATCGCGGTG ACCGATGCAC GGGCACGTCA TTTATTGCAA
GTGTGTGAGC ATCTGGATAT TGCCGTACCA GAGAAACTGA GTGTGATCGG TATTGATAAT
GAAGAGTTAA CCCGTTATTT ATCGCGGGTG GCGCTCTCTT CGGTGGTTCA GGGAACCCGA
CAAATGGGGT ATCGGGCGGC CAAGCTACTC CATCAACGTC TCAAGCTACG GCAAAAACAG
CAAACAGACC CGCCCTTACA GCGTATTTTG GTCCCACCAG TGAAAGTCAT GGCCCGCCGC
TCTACGGACT TCCGCTCGTT ACGTGACCCG GCGGTTATTC AGGCGATGCA TTATATTCGC
CACCACGCTT GCAAGGGGAT CAAAGTTGAA CAGGTATTGG ATGCGGTAGG GATGTCGCGC
TCAAATCTGG AAAAGCGTTT TAAAGATGAG GTTGGCCAAA CCATTCATGG CGTGATTCAT
GAAGAAAAAC TCGATAGGGC GCGTAATTTA CTGGCGGCGA CATCACTCCC TATTAATGAG
ATATCACAGA TGTGCGGTTA TCCATCGCTA CAATACTTTT ATTCAGTGTT CAAAAAAGGT
TATTCCATCA CACCGAAGGA GCACCGTGAT AAATACGGCG AAGTGAGTTA TTGA
 
Protein sequence
MFEKRYRITL LFNANKVYDR QVVEGVGEYL QASQCNWDIF IEEDFRCRID NIKDWLGDGV 
IADFDDRQIE QLLANVNVPI VGVGGSYHQS EDYPSVDYIA TDNKALVNAA FMHLKEKGLN
RFAFYGLPAS CGMRWAQERE YAFRQLVSAE QYQGVVYQGM ATAPDNWQYA QNRLADWVQT
LPHQTGIIAV TDARARHLLQ VCEHLDIAVP EKLSVIGIDN EELTRYLSRV ALSSVVQGTR
QMGYRAAKLL HQRLKLRQKQ QTDPPLQRIL VPPVKVMARR STDFRSLRDP AVIQAMHYIR
HHACKGIKVE QVLDAVGMSR SNLEKRFKDE VGQTIHGVIH EEKLDRARNL LAATSLPINE
ISQMCGYPSL QYFYSVFKKG YSITPKEHRD KYGEVSY