Gene YpAngola_A2102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2102 
SymbolybtX 
ID5800572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2193768 
End bp2195048 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content59% 
IMG OID641340014 
Productyersinabactin region putative transporter YbtX 
Protein accessionYP_001606560 
Protein GI162420704 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATG TTCAGTCGAA TGTGAAACCG CTGACGTTGA CGACCGGGCG GGTGATTTTT 
GCTATTGCCG GCGTCTATGT GACGCAGAGT CTGGTATCGG CGCTGTCTAT GCAGTCCTTA
CCCGCGCTGG TGCGCGCTGC TGGCGGTTCG CTGGCGCTTG CCGGTGCGAC AACCCTGTTT
ATGCTGCCCT GGGCGCTGAA GTTTATTTGG GCGCCGTGGA TCGAGCGCTG GCGGCTTCCG
CCCGGTAGCC AGGAACGCCG TTCCCGCATG TTAATCCTGC GTGGTCAGGT CGCGCTAGCG
GCGATCCTGA CTATTGCCGC AGCGATTGGC TGGTTTGGGC GAGAAGGGGG ATTCCCCGAT
ACGCAAATCG TCGCGTTATT TGTTCTGTTT ATGGTGGCAG GCACGGTCGC CTCCACCATT
GATATCGCCA GCGACGGCTT TTGCGTCGAT CAACTGACCC GCACGGGTTA CGGCTGGGGA
AACAGCGTGC AGGTCGGCGG CAGCTATCTG GGAATGATGT GCGGCGGCGG GGTGTTCCTG
ATGCTGTCGG CGGCATCCGG CTGGCCTGTC GCCATGCTGA TGATGGCGGT GCTGATTATG
GCGCTGTCAC TCCCGCTGTG GCGCATTACG GAGCCGACGC GAACAGCGAC TATCCCGCAT
GTTCCGGCGT TAGGTTATGC GCTAAGGAGG AAGCAGGCGC GCCTGGGCTT ACTGCTGGTA
TTGATGCTGA ATTCAGGCAT GCGGTTTGTG CTGCCTCTTC TGGCACCGCT GTTGTTGGAT
CATGGGTTGA GCATGTCCGC ATTGGGCGCG CTGTTCAGCG GCGGCAATAT TGCAGCGGGC
ATAGCAGGAA CGCTGGCCGG CGGATTGCTG ATGAAATACA CCTCACCCGG CAGAGCGCTG
TTGACGGCTT ATGGCGTCCA GGGGATCGCG CTGCTGGCGG TGGTGATGAC GCTCATGATG
GCGCCGGGTC ATCTGCTGCT GCCGATTCTC CAGTGTCTGG TCATTGTCCA GTCCATTTCG
CTGGCCTGCG CGCTGGTCTG TCTTTACGCC ACGCTGATGT CGCTTTCATC GCCTTTGCAG
GCCGGTGTCG ACTTCACCCT CTTTCAATGT ACTGACGCGG CAATAGCCAT TCTGGCAGGC
GTTATCGGTG GCGTTGTTGC TCAACATTTT GGCTATGCGG CCTGCTTCCT GTTTGCCGGG
GCATTCACGT TGCTGGCGGC GTGGGTTGCT TATATCCGGC TGCATTCGGC AAGAGAACTG
ATGACAAGCG CAATTGATTG A
 
Protein sequence
MSDVQSNVKP LTLTTGRVIF AIAGVYVTQS LVSALSMQSL PALVRAAGGS LALAGATTLF 
MLPWALKFIW APWIERWRLP PGSQERRSRM LILRGQVALA AILTIAAAIG WFGREGGFPD
TQIVALFVLF MVAGTVASTI DIASDGFCVD QLTRTGYGWG NSVQVGGSYL GMMCGGGVFL
MLSAASGWPV AMLMMAVLIM ALSLPLWRIT EPTRTATIPH VPALGYALRR KQARLGLLLV
LMLNSGMRFV LPLLAPLLLD HGLSMSALGA LFSGGNIAAG IAGTLAGGLL MKYTSPGRAL
LTAYGVQGIA LLAVVMTLMM APGHLLLPIL QCLVIVQSIS LACALVCLYA TLMSLSSPLQ
AGVDFTLFQC TDAAIAILAG VIGGVVAQHF GYAACFLFAG AFTLLAAWVA YIRLHSAREL
MTSAID