Gene YpAngola_A0800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0800 
Symbol 
ID5799262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp819071 
End bp820471 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content47% 
IMG OID641338797 
Productmajor facilitator transporter 
Protein accessionYP_001605375 
Protein GI162420128 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00491576 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0752982 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCAAG ATCACAATTC ATCTCAAGTC GGCTCATCCC AAATCGACTC ATCTCAACAT 
CATGTTGATC AACAACATAA TCATAATCGT CGATTGAATA AGCAAGATTA CAAAACACTC
ACCTTAGCCG CATTAGGTGG AGCGTTAGAG TTCTATGATT TTATTATTTT TGTCTTCTTT
GCCGCGGTTA TTGGTGATCT TTTCTTCCCG GCTGACATGC CTGAATGGCT TCGTCAGGTG
CAGACGTTCG GTATTTTTGC TGCGGGTTAC TTGGCTCGCC CATTGGGGGG GATCATCATG
GCACATTTTG GTGATTTGGT TGGGCGTAAA AAGATGTTTA CTCTTAGCAT CTTATTAATG
GCGTTGCCAA CACTGGCTAT TGGCATGTTG CCGACATATG CCACTATCGG GATCACCGCA
CCTTTACTTC TACTATTAAT GCGGGTATTA CAAGGTGCTG CCATTGGTGG CGAAGTTCCC
GGTGCATGGG TCTTTGTGGC AGAACATGTG CCACGTAAAC GTATTGGCAT TGCCTGCGGT
ACCTTAACGG CCGGTCTGAC TGCTGGGATT TTGTTAGGAT CGCTGGTTGC TACAGTCATG
AATACTACGC TGGGCCATCA GGCGATTTTG GAGGGGGGAT GGCGTATACC GTTCTTCTTG
GGTGGAATTT TCGGTTTGTT TGCCATGTAC TTACGCCGTT GGTTGCAGGA AACTCCGATC
TTTAAAGAAA TGCAGGCGCG TAAAACATTG GCTGAAGAAT TGCCGCTGAA ATCGGTGGTG
GTAAACCATA AGAAAGAAGT GGTTGTTTCG ATGCTGCTGA CTTGGTTGCT CTCTGCTGGC
ATCGTCGTCG TTATTTTGAT GACACCAACC TATCTACAGA AGCAGTTTAA TGTACCGCCA
GAGTTGGCAT TGCAGGCAAA CAGTTTGGCG ATTATCGCAT TGGTTATCGG CTGTGTGGTT
GCCGGATTGG CAATTGACCG CTTTGGGGCC AGTAAAACCT TTATCGTTGG CAGCCTGATG
CTGGCGATGT CGACATGGTC GTTTTATCAC ACCAATCTGA CTAATCCATC TCAGTTATTT
CCCCACTATA TGTTGGCTGG CTTCTGTGTC GGTATTGTTG GTGCAGTGCC TTATGTCATG
GTGCGTGCCT TCCCGGCAGA AGTCCGCTTC ACGGGCATCT CTTTCTCGTA CAATGTGGCC
TATGCCATTT TTGGTGGTTT AACACCTATT CTGGTGACCT TATGGATGAA GTCATCTGCC
ATGGCTCCGG CGTATTACAT GCTGGTGCTA TCGCTGGTGC TATCGCTGGT GGGATTGTTA
TTGGGTATTT ATCTGCGTAA CGACATTAAT AGTGAAGTGA AAGTTCAGAT GCCTAAAAGG
GTGATGAACG GAGTTAATTA A
 
Protein sequence
MSQDHNSSQV GSSQIDSSQH HVDQQHNHNR RLNKQDYKTL TLAALGGALE FYDFIIFVFF 
AAVIGDLFFP ADMPEWLRQV QTFGIFAAGY LARPLGGIIM AHFGDLVGRK KMFTLSILLM
ALPTLAIGML PTYATIGITA PLLLLLMRVL QGAAIGGEVP GAWVFVAEHV PRKRIGIACG
TLTAGLTAGI LLGSLVATVM NTTLGHQAIL EGGWRIPFFL GGIFGLFAMY LRRWLQETPI
FKEMQARKTL AEELPLKSVV VNHKKEVVVS MLLTWLLSAG IVVVILMTPT YLQKQFNVPP
ELALQANSLA IIALVIGCVV AGLAIDRFGA SKTFIVGSLM LAMSTWSFYH TNLTNPSQLF
PHYMLAGFCV GIVGAVPYVM VRAFPAEVRF TGISFSYNVA YAIFGGLTPI LVTLWMKSSA
MAPAYYMLVL SLVLSLVGLL LGIYLRNDIN SEVKVQMPKR VMNGVN