Gene YpAngola_A1443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1443 
Symbol 
ID5799910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1497614 
End bp1498768 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content50% 
IMG OID641339397 
Productmajor facilitator transporter 
Protein accessionYP_001605961 
Protein GI162421391 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00314587 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCA AAATACACCA ACAAGCAGTA CAGCCCGGTA TTAGCCAACA AGTTTCTACC 
CGGTTAGCTT TTTTTATTGC CGGGTTAGGC ATGGCCGCTT GGGCACCACT TGTTCCCTTT
GCAAAAGCGC GCATTGGTCT TAATGATGCC TCATTGGGTT TATTACTGTT ATGCATTGGT
ATTGGATCGA TGCTGGCGAT GCCGCTCACT GGCGTGCTTA CCGCGAAGTG GGGCTGTCGG
GCCGTCATTT TACTGGCAGG CGCAGTGCTC TGTTTAGATT TGCCTTTACT CGTATTGATG
AATACTCCCG CGACGATGGC TATCGCACTA TTAGTATTCG GTGCAGCTAT GGGCATAATA
GATGTGGCGA TGAACATTCA GGCTGTCATT GTTGAAAAAG CCAGTGGCCG GGCGATGATG
TCTGGCTTCC ACGGTTTATT CAGTGTCGGT GGGATTGTTG GTGCAGGAGG TGTCAGTGCT
CTATTGTGGC TAGGCCTCAA CCCACTGACA GCGATTATGG CTACCGTAGT ACTCATGATT
ATTTTGCTGC TGGCAGCCAA TAAGAATCTG TTACGTGGCA GCGGTGAACC CCATGATGGG
CCATTGTTTG TTTTTCCCCG TGGCTGGGTG ATGTTCATCG GCTTTTTATG TTTTGTCATG
TTTTTGGCAG AAGGCTCGAT GCTTGACTGG AGTGCCGTCT TCCTGACGAC GCTACGCGGC
ATGTCGCCAT CACAAGCAGG TATGGGCTAC GCCGTATTCG CCATCGCTAT GACACTTGGC
CGCCTAAACG GTGATCGGAT TGTCAATGGG CTGGGCCGTT ACAAGGTCTT ATTAGGTGGC
AGTTTATGTT CTGCCATCGG GATTATTATC GCAATCAGTA TTGATAGCTC AATGGCTGCC
ATTATTGGCT TCATGTTAGT GGGTTTCGGC GCATCGAATG TGGTACCGAT CTTGTTTACC
GCCGCAGGTA ATCAAACCGT TATGCCTGCC AACCTGGCGG TTGCGTCAAT TACAACGATC
GGTTACGCGG GAATTTTGGC TGGCCCGGCA GCTATCGGCT TTATTGCACA ATTAAGTAGT
CTATCGGTTG CTTTTGGCTG TGTAGCACTT CTGTTATTAA CCGTTGCTGC CAGCGCCAGA
GCCGTCACGC GCTAA
 
Protein sequence
MSTKIHQQAV QPGISQQVST RLAFFIAGLG MAAWAPLVPF AKARIGLNDA SLGLLLLCIG 
IGSMLAMPLT GVLTAKWGCR AVILLAGAVL CLDLPLLVLM NTPATMAIAL LVFGAAMGII
DVAMNIQAVI VEKASGRAMM SGFHGLFSVG GIVGAGGVSA LLWLGLNPLT AIMATVVLMI
ILLLAANKNL LRGSGEPHDG PLFVFPRGWV MFIGFLCFVM FLAEGSMLDW SAVFLTTLRG
MSPSQAGMGY AVFAIAMTLG RLNGDRIVNG LGRYKVLLGG SLCSAIGIII AISIDSSMAA
IIGFMLVGFG ASNVVPILFT AAGNQTVMPA NLAVASITTI GYAGILAGPA AIGFIAQLSS
LSVAFGCVAL LLLTVAASAR AVTR