Gene YpsIP31758_1884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1884 
SymbolaraG 
ID5387024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2183306 
End bp2184877 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content50% 
IMG OID640864868 
ProductL-arabinose transporter ATP-binding protein 
Protein accessionYP_001400859 
Protein GI153950595 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.326668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGCAC CCCATTCTGC GTTACAAGCC GAGTTGGACG CCGCACAGTC ACCTTATCTG 
GCTTTTCGTG GCATCGGAAA AAGTTTCCCC GGTGTTCTGG CGCTGGATGA TATCAGTTTC
ACCTGTCAGG CGGGCCAGAT CCATGCGCTG ATGGGCGAGA ATGGCGCGGG GAAATCAACC
CTGTTAAAGA TCCTCAGTGG TAACTACACC CCGACACAGG GTGAAATCCA CATTAAAGGG
AAAGCCGTTA ACTTTACCAA TACTACGGAT GCGTTGGATG CTGGTGTGGC GATCATTTAT
CAGGAACTGC ATTTGGTGCC TGAAATGACA GTGGCAGAAA ACATCTATCT GGGCCAATTA
CCCACCAAGA TGGGTATGGT TGATCGAAAA TTGCTGCGTT ATGAATCTCG CATACAGCTA
TCACATCTGG GGTTGGACAT TGATCCCGAT ACCCCACTGA AATATCTCTC CATCGGCCAA
TGGCAGATGG TGGAAATTGC CAAAGCATTA GCCCGCAATG CAAAAATAAT CGCCTTTGAT
GAACCCACCA GTTCGCTCTC TGCCCGAGAA ATTGAGCAAC TGTTCCGCGT GATCCGCGAG
TTACGGGCCG AAGGGCGGGT CATCTTGTAT GTCTCCCATC GAATGGAAGA AATTTTTGCC
CTGAGTGATG CCATTACGGT GTTTAAAGAT GGCCGCTATG TTCGTACGTT TGATGATATG
ACCCAAGTGA ATAATGCGTC ACTGGTGCAA GCTATGGTAG GGCGTAATTT AGGGGATATC
TATGGTTATC AGCCCCGAGA GATAGGTTCT GAACGCTTAA CGCTACAAGC GGTGAAGGCC
ATCGGTGTGG CCTCGCCGAT CAGCTTGACT GTACACCAAG GGGAAATTGT GGGGCTGTTT
GGGTTAGTGG GGGCCGGGCG TAGTGAACTG CTCAAGGGGC TGTTTGGTGA CACCAAACTG
ACCAGTGGGA AACTCTTGCT TGATGGCCAA CCACTGACTA TCCGTTCGCC GATTGACGCT
ATTTCTGCTG GGATCATGTT GTGTCCAGAA GATCGAAAAG CGGATGGCAT CATTCCTGTT
CACTCGGTAC AGGACAATAT CAATATCAGT GCCCGCCGCA AAACATTAAC CGCAGGCTGT
CTGATTAACA ACCGCTGGGA AGCGGAGAAT GCGTTGCTGC GTATTCAGTC TCTGAATATT
AAAACGCCAG GCCCCCAACA ACTCATTATG AATCTATCCG GGGGGAATCA GCAGAAAGCC
ATTTTAGGAC GCTGGTTGTC CGAGGACATG AAAGTGATCC TGTTGGATGA ACCGACCCGT
GGTATTGACG TCGGGGCCAA ACATGAAATC TATAACGTGA TTTATCAACT GGCGAAACAG
GGCATTGCGG TGCTGTTTGC TTCCAGTGAT TTGCCGGAAG TGCTTGGGCT GGCAGATCGT
ATTGTGGTGA TGCGTGAGGG CGCTATCTCT GGTGAGCTAG ACCATGAATA TGCCACTGAA
GAGCAAGCCT TAAGTCTGGC AATGTTACGC ACCCCGAATA TTGCCACCAA TACCGCGTCT
GCGGTTGCCT GA
 
Protein sequence
MSAPHSALQA ELDAAQSPYL AFRGIGKSFP GVLALDDISF TCQAGQIHAL MGENGAGKST 
LLKILSGNYT PTQGEIHIKG KAVNFTNTTD ALDAGVAIIY QELHLVPEMT VAENIYLGQL
PTKMGMVDRK LLRYESRIQL SHLGLDIDPD TPLKYLSIGQ WQMVEIAKAL ARNAKIIAFD
EPTSSLSARE IEQLFRVIRE LRAEGRVILY VSHRMEEIFA LSDAITVFKD GRYVRTFDDM
TQVNNASLVQ AMVGRNLGDI YGYQPREIGS ERLTLQAVKA IGVASPISLT VHQGEIVGLF
GLVGAGRSEL LKGLFGDTKL TSGKLLLDGQ PLTIRSPIDA ISAGIMLCPE DRKADGIIPV
HSVQDNINIS ARRKTLTAGC LINNRWEAEN ALLRIQSLNI KTPGPQQLIM NLSGGNQQKA
ILGRWLSEDM KVILLDEPTR GIDVGAKHEI YNVIYQLAKQ GIAVLFASSD LPEVLGLADR
IVVMREGAIS GELDHEYATE EQALSLAMLR TPNIATNTAS AVA