Gene YpsIP31758_4100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4100 
SymbolxylG 
ID5388296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4623802 
End bp4625334 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content49% 
IMG OID640867130 
Productxylose transporter ATP-binding subunit 
Protein accessionYP_001403044 
Protein GI153948579 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID[TIGR02633] D-xylose ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCTACC TACTAGAAAT GAAAGATATT ACCAAGCAGT TCGGCGTCGT CAAAGCCGTA 
GATAATATCA GCCTAACGCT GGAAGCGGGG CAGGTATTAT CGTTGTGCGG TGAAAATGGG
TCTGGAAAAT CCACGCTAAT GAAAGTGCTA TGTGGTATTT ATCCCGCAGG TTCCTATCAG
GGAGAAATAA TATTTTCCGG CGAAACCTTA CAGGCAAAAA ATATCCGCGA AACAGAACAA
AAAGGCATTG CGATTATTCA TCAAGAATTG GCACTGGTGA AACAAATGTC AGTGCTGGAG
AACATGTTCC TCGGCTCCGA ATGGGGCCGT TTCGGTATCA TGGATTACGA CGCCATGTAT
TTACGCTGCC AACGGATGCT GGCGCAGGTC AAACTGGTGG TTGACCCCCA TACACCGGTC
AGTGAATTGG GCCTTGGGCA GCAACAATTG GTCGAAATTG CAAAAGCATT AAATAAACAA
GTGCGGCTGC TGGTACTGGA TGAACCAACG GCATCACTGA CAGAAAGTGA AACTGCCATT
TTACTGGATA TTATTCGTGA CCTGCGTAAC CACGGTATTG CCTGCATCTA TATTTCTCAC
AAATTGAATG AAGTAAAAGA GATATCAGAT CATATCTGTG TGATCCGCGA TGGTCGTCAT
ATCGGCACCC GCCCGGCATC GACCATGAGC GAGGATGACA TTATCGCCAT GATGGTAGGG
CGTGAGCTAA AAGAACTCTA TCCCCACGAA GCCCATCACA TTGGCGAGGA GATTCTACGG
GTTGAAAACC TCTGTGCCTG GCATCCGGTG AATCGGCATA TTCGCCGGGT CGATGATGTT
TCTTTCTCAT TGAAACGCGG TGAAATTCTC GGGATCGCCG GTTTGGTCGG TTCAGGGCGG
ACGGAAACGG TTCAGTGCCT GTTTGGGGTA TATCCGGGCC GCTGGCAGGG CGATATCTTT
ATTAAAGGGC AAGCCGCGAC TATTCGGACG TGCCAGCAAG CGATGAAATT GGGTATCGCG
ATGGTGCCGG AAGATCGCAA AAAAGACGGC ATCGTGCCCG TGATGGGGGT TGGCGCTAAT
ATCACACTGG CGGCACTGGA TGATTTTACT GGCGCTTTCA GTTTGCTGGA TGATGCGAAA
GAACAATCGA TAATTGTACA GTCTTTGGCC CGGTTGAAAG TGAAAACGTC TTCTTCAGAG
CTGGCCATTG CCCGCCTGAG TGGGGGCAAT CAGCAAAAAG CTATTTTGGC TAAGTGCCTG
CTATTAAACC CACAAATATT GATCCTCGAT GAACCGACAC GTGGTATCGA CATCGGTGCA
AAATACGAAA TCTACAAACT TATCAATCAA CTGGTCCAAC AGGGGATCGC GGTCATTGTG
ATTTCCTCTG AACTGCCAGA GGTCTTGGGA TTAAGTGATC GGGTGCTGGT CATGCATCAG
GGGCGCATCA AAGCCGATCT TATCAACCAT AACCTGACTC AAGAAAAGGT CATGGAAGCC
GCACTCAGGA GTGAAACCCA TGTCACAAGC TAA
 
Protein sequence
MPYLLEMKDI TKQFGVVKAV DNISLTLEAG QVLSLCGENG SGKSTLMKVL CGIYPAGSYQ 
GEIIFSGETL QAKNIRETEQ KGIAIIHQEL ALVKQMSVLE NMFLGSEWGR FGIMDYDAMY
LRCQRMLAQV KLVVDPHTPV SELGLGQQQL VEIAKALNKQ VRLLVLDEPT ASLTESETAI
LLDIIRDLRN HGIACIYISH KLNEVKEISD HICVIRDGRH IGTRPASTMS EDDIIAMMVG
RELKELYPHE AHHIGEEILR VENLCAWHPV NRHIRRVDDV SFSLKRGEIL GIAGLVGSGR
TETVQCLFGV YPGRWQGDIF IKGQAATIRT CQQAMKLGIA MVPEDRKKDG IVPVMGVGAN
ITLAALDDFT GAFSLLDDAK EQSIIVQSLA RLKVKTSSSE LAIARLSGGN QQKAILAKCL
LLNPQILILD EPTRGIDIGA KYEIYKLINQ LVQQGIAVIV ISSELPEVLG LSDRVLVMHQ
GRIKADLINH NLTQEKVMEA ALRSETHVTS