Gene YpsIP31758_4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4101 
SymbolxylF 
ID5386948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4625542 
End bp4626537 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content42% 
IMG OID640867131 
ProductD-xylose transporter subunit XylF 
Protein accessionYP_001403045 
Protein GI153946825 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID[TIGR02634] D-xylose ABC transporter, substrate-binding protein 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA AGAACATTTT ACTCTCCGCA TGCGCTGCAC TGGTAATGTT TAGTCAGCCT 
GGATTTAGCA AAGAAATTAA AATCGGTATG GCGATTGATG ACTTACGTCT TGAGCGCTGG
CAAAAAGACC GCGATATTTT TGTTAACAAG GCTGAATCCC TCGGTGCTAA AGTTTTTGTT
CAATCGGCCA ATGGCAATGA AGAAACACAA ATGGCGCAGA TTGAAAATAT GATTAACCGT
GGTGTCGATG TCCTGGTTAT TATTCCCTAC AACGGGCAAG TATTAAGTAA TGTGATCGCT
GAAGCAAAAC GGGAAGGCAT AAAAGTGTTG GCTTATGATC GCATGATAAA TAACGCTGAC
ATTGATTTCT ATATCTCCTT TGACAATGAA AAAGTAGGTG AATTACAAGC TAAAAATTTG
GTTGAACGGG TACCTCAGGG CAATTATTTC CTGATGGGCG GTTCACCAGT GGATAATAAT
GCTAAATTAT TCCGTCAGGG GCAGATGACT GTTCTTAATC CATTGATAAA GGACGGTAAA
ATCAAGATTG TAGGTGACCA ATGGGTTGAT GCCTGGCTAC CAGAAAATGC GTTAAAAATC
ATGGAAAATG CCTTAACGGC AAACAATAAC AATATTGATG CTGTGGTGGC CTCTAATGAT
GCAACCGCCG GTGGTGCCAT TCAGGCTTTA GCCGCACAAG GGCTAGCCGG TAAAGTGGCT
ATTTCTGGTC AAGACGCAGA TTTGGCCGCT ATCAAGCGTA TTGTTGCCGG TACGCAAACC
ATGACGGTAT ACAAGCCCAT CAGTAAATTG GCCAATGATG CGGCTGAGAT CGCCGTGACA
TTGGGTAATG GTGAGCAACC GAAAGCAAAC AGCACGTTAA ATAACGGTAT GAAAGATGTT
CCTGCTTATT TGTTAACACC TATTCAGGTC GATAAAAATA ATATTGATAG CACCATCATT
GCTGACGGGT TCCACAAAAA AGCAGATATT TACTAA
 
Protein sequence
MKFKNILLSA CAALVMFSQP GFSKEIKIGM AIDDLRLERW QKDRDIFVNK AESLGAKVFV 
QSANGNEETQ MAQIENMINR GVDVLVIIPY NGQVLSNVIA EAKREGIKVL AYDRMINNAD
IDFYISFDNE KVGELQAKNL VERVPQGNYF LMGGSPVDNN AKLFRQGQMT VLNPLIKDGK
IKIVGDQWVD AWLPENALKI MENALTANNN NIDAVVASND ATAGGAIQAL AAQGLAGKVA
ISGQDADLAA IKRIVAGTQT MTVYKPISKL ANDAAEIAVT LGNGEQPKAN STLNNGMKDV
PAYLLTPIQV DKNNIDSTII ADGFHKKADI Y