Gene YPK_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYPK_1998 
SymbolaraG 
ID6089192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis YPIII 
KingdomBacteria 
Replicon accessionNC_010465 
Strand
Start bp2220530 
End bp2222101 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content50% 
IMG OID641597065 
ProductL-arabinose transporter ATP-binding protein 
Protein accessionYP_001720738 
Protein GI170024233 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGCAC CCCATTCTGC GTTACAAGCC GAGTTGGACG CCGCACAGTC ACCTTATCTG 
GCTTTTCGTG GCATCGGAAA AAGTTTCCCC GGTGTTCTGG CGCTGGATGA TATCAGTTTC
ACCTGTCAGG CGGGCCAGAT CCATGCGCTG ATGGGCGAGA ATGGCGCGGG GAAATCAACC
CTGTTAAAGA TCCTCAGTGG TAACTACACC CCGACACAGG GTGAAATCCA CATTAAAGGG
AAAGCCGTTA ACTTTACCAA TACTACGGAT GCGTTGGATG CTGGTGTGGC GATCATTTAT
CAGGAACTGC ATTTGGTGCC TGAAATGACA GTGGCAGAAA ACATCTATCT GGGCCAATTA
CCCACCAAGA TGGGTATGGT TGATCGAAAA TTGCTGCGTT ATGAATCTCG CATACAGCTA
TCACATCTGG GGTTGGACAT TGATCCCGAT ACCCCACTGA AATATCTCTC CATCGGCCAA
TGGCAGATGG TGGAAATTGC CAAAGCATTA GCCCGCAATG CAAAAATAAT CGCCTTTGAT
GAACCCACCA GTTCGCTCTC TGCCCGAGAA ATTGAGCAAC TGTTCCGCGT GATCCGCGAG
TTACGGGCCG AAGGGCGGGT CATCTTGTAT GTCTCCCATC GAATGGAAGA AATTTTTGCC
CTGAGTGATG CCATTACGGT GTTTAAAGAT GGCCGCTATG TTCGTACGTT TGATGATATG
ACCCAAGTGA ATAATGCGTC ACTGGTGCAA GCTATGGTAG GGCGTAATTT AGGGGATATC
TATGGTTATC AGCCCCGAGA GATAGGTTCT GAACGCTTAA CGCTACAAGC GGTGAAGGCC
ATCGGTGTGG CCTCGCCGAT CAGCTTGACT GTACACCAAG GGGAAATTGT GGGGCTGTTT
GGGTTAGTGG GGGCCGGGCG TAGTGAACTG CTCAAGGGGC TGTTTGGTGA CACCAAACTG
ACCAGTGGGA AACTCTTGCT TGATGGCCAA CCACTGACTA TCCGTTCGCC GATTGACGCT
ATTTCTGCTG GGATCATGTT GTGTCCAGAA GATCGAAAAG CGGATGGCAT CATTCCTGTT
CACTCGGTAC AGGACAATAT CAATATCAGT GCCCGCCGCA AAACATTAAC CGCAGGCTGT
CTGATTAACA ACCGCTGGGA AGCGGAGAAT GCGTTGCTGC GTATTCAGTC TCTGAATATT
AAAACGCCAG GCCCCCAACA ACTCATTATG AATCTATCCG GGGGGAATCA GCAGAAAGCC
ATTTTAGGGC GCTGGTTGTC CGAGGACATG AAAGTGATCC TGTTGGATGA ACCGACCCGT
GGTATTGACG TCGGGGCCAA ACATGAAATC TATAACGTGA TTTATCAACT GGCGAAACAG
GGCATTGCGG TGCTGTTTGC TTCCAGTGAT TTGCCGGAAG TGCTTGGGCT GGCAGATCGT
ATTGTGGTGA TGCGTGAGGG CGCTATCTCT GGTGAGCTAG ACCATGAATA TGCCACTGAA
GAGCAAGCCT TAAGTCTGGC AATGTTACGC ACCCCGAATA TTGCCACCAA TACCGCGTCT
GCGGTTGCCT GA
 
Protein sequence
MSAPHSALQA ELDAAQSPYL AFRGIGKSFP GVLALDDISF TCQAGQIHAL MGENGAGKST 
LLKILSGNYT PTQGEIHIKG KAVNFTNTTD ALDAGVAIIY QELHLVPEMT VAENIYLGQL
PTKMGMVDRK LLRYESRIQL SHLGLDIDPD TPLKYLSIGQ WQMVEIAKAL ARNAKIIAFD
EPTSSLSARE IEQLFRVIRE LRAEGRVILY VSHRMEEIFA LSDAITVFKD GRYVRTFDDM
TQVNNASLVQ AMVGRNLGDI YGYQPREIGS ERLTLQAVKA IGVASPISLT VHQGEIVGLF
GLVGAGRSEL LKGLFGDTKL TSGKLLLDGQ PLTIRSPIDA ISAGIMLCPE DRKADGIIPV
HSVQDNINIS ARRKTLTAGC LINNRWEAEN ALLRIQSLNI KTPGPQQLIM NLSGGNQQKA
ILGRWLSEDM KVILLDEPTR GIDVGAKHEI YNVIYQLAKQ GIAVLFASSD LPEVLGLADR
IVVMREGAIS GELDHEYATE EQALSLAMLR TPNIATNTAS AVA