Gene YpAngola_A2131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2131 
Symbol 
ID5800601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2226545 
End bp2228134 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content52% 
IMG OID641340039 
Productmajor facilitator transporter 
Protein accessionYP_001606584 
Protein GI162420978 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.025245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGATGA TTACCATCGC GCTGGCCCAG ATCCTGATGT CATTCAACGT GGCCTCCCTA 
CCTGTTGCGT TGGGCGGGAT GGTAAAGAGC TTTAATGTGC CACCAACCAC TATTGCCACG
GCGATAGTCA TGTACTCATT GTCGGTTGCA GGCTTTGTGA TGTTGGGTGC CAAACTCAAC
CAACGCTTTG GGCCATTGAT AGTATTCCGC TGTACAGTTC TGTTATTCGG CCTGGCTCAG
ACCATGATGA CATTCAGCCC GAATGTCACT GTAATGATCG GTGCGCAGGC ACTGAGTGGT
CTGGCAGGTG CGGCATTGGT ACCGGCACTG GTGGCGTTAA TTGCTGAAAA CTACCGTGGG
CCTCAACAGG CCACCGCACT GGGGGCATTA GGTTCTGCTC GCGCAGGGGC GGGTGTTGCT
GCGTTCCTGA TCGGCGGTAT TTTGGGAACC CATATCGGCT GGCGTCCAGC GTTCGGTATT
TTGATTGTGT TGTCTGTTAT CGTTTTTGTA CTGAGTTTCC GTCTGAAAGC AGATAAAGGC
CGCCCAGAAG TGGGTATTGA TGTTATCGGT GTCGTTTTAG CTGCATCTGC GATTATTTTG
TTGTCGTTCG GCTTTAATAA CCTGAACCGT TGGGGCTTTG GTCTGGTTCG TGACGGCGCG
CCGTTTGACC TGCTCGGTTT CTCACCAGCG CCGTTTATGA TCGTGTTGGG TATCGTCTTG
GGTCAGGCAT TTGTGGTCTG GACCCGCCGC CGTCAGGAGC AAGGTAAAAC GCCATTACTG
GCGCTAGACG TGATTACATC ACCCAGTGAG CGGGCGGCGG TATTTGCCAT GTTTGCGGTG
GTTGCGCTGG AAGCCATGCT GAACTTTTCT GTTCCGCTGT ATATACAAAT CGTGCAGGGC
AGTTCGCCGA TGGCAACAGC TATCGCTATG ATGCCGTTTA ACCTATCGGT ATTCTTCTCT
GCGATGCTGA TTGTACGTTT CTACAAGAAA CTGACTCCGC GTAAAATTGG TCGTTACGGT
TTCATCACTT GTACTTTGGC GCTGTTGTGG CTGGCATTCG TGGTACGTAA TAACTGGAGT
GAGTGGTCGG TCTTGATTGG TTTGGTGGTG TTCGGTATCG GACAAGGCTC ACTCGTCACC
TTGCTGTTCA ACGTACTGGT CAGTGCATCA CCAAAAGAAT TGGCAGGCGA TGTGGGTTCT
TTACGTGGTA CGACCAACAA CCTGGCAAGC GCCATCGGTA CAGCGGTTGC AGGTGCCTTG
CTGGTGGGCT TGTTAAGTGC CAACGTGATG CGTGGTGTCG CTGAAACGCC GATTCTGACG
GATGAGATCC AAGCTCAGGT CAATATGGAT AGCATCAACT TCGTCAGTAA TGATCGCCTG
AACAGTGTAT TGGCTCAAAC CTCCGCGACC CCAGAACAGG TTGCCGAAGC GGTGCGGGTG
AATGAAGAAG CACGGTTACG TGCGCTGAAA TTCGGTTTAC TGGTTATGGC GCTGCTATCG
CTGTTGGCTA TCTTCCCTGC TGGCCGCTTA CCTGACTATC TGCCGGGTGA ACTGCCTGCT
GATAATCTGG ATAAAAAAGC CAGCAAGTAA
 
Protein sequence
MPMITIALAQ ILMSFNVASL PVALGGMVKS FNVPPTTIAT AIVMYSLSVA GFVMLGAKLN 
QRFGPLIVFR CTVLLFGLAQ TMMTFSPNVT VMIGAQALSG LAGAALVPAL VALIAENYRG
PQQATALGAL GSARAGAGVA AFLIGGILGT HIGWRPAFGI LIVLSVIVFV LSFRLKADKG
RPEVGIDVIG VVLAASAIIL LSFGFNNLNR WGFGLVRDGA PFDLLGFSPA PFMIVLGIVL
GQAFVVWTRR RQEQGKTPLL ALDVITSPSE RAAVFAMFAV VALEAMLNFS VPLYIQIVQG
SSPMATAIAM MPFNLSVFFS AMLIVRFYKK LTPRKIGRYG FITCTLALLW LAFVVRNNWS
EWSVLIGLVV FGIGQGSLVT LLFNVLVSAS PKELAGDVGS LRGTTNNLAS AIGTAVAGAL
LVGLLSANVM RGVAETPILT DEIQAQVNMD SINFVSNDRL NSVLAQTSAT PEQVAEAVRV
NEEARLRALK FGLLVMALLS LLAIFPAGRL PDYLPGELPA DNLDKKASK