Gene YpAngola_A3914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3914 
SymbolmalE 
ID5802392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4155604 
End bp4156815 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content49% 
IMG OID641341705 
Productmaltose ABC transporter periplasmic protein 
Protein accessionYP_001608215 
Protein GI162421834 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGCA GCTTCACGAA ATCCGGCATT GGTAAGACCG CACGTGTTTT GGCTCTTTCG 
GCGTTAACCA CGCTGGTGCT CTCTTCCTCT GCTTTTGCCA AAATTGAAGA AGGTAAACTG
GTTATCTGGA TCAATGGCGA TAAAGGTTAT AACGGATTGG CAGAGGTCGG TAAGAAATTT
GAGAAAGATA CCGGCATCAA AGTGACCATC GAGCATCCAG ATAAATTGGA AGAAAAATTC
CCACAAGTCG CCGCAACCGG TGATGGCCCG GACATTATCT TCTGGGCCCA TGACCGTTTT
GGTGGCTATG CACAATCAGG TTTACTGGCT GAACTGACCC CATCGAAAGC CTTCCAGGAA
AAATTGTTCC CATTCACTTG GGATGCCGTT CGCTTTAATG GCAAGCTGAT TGGTTACCCT
GTTGCAGTCG AAGCGCTGTC ACTGATTTAC AACAAAGATC TGGTGAAAGA AGCACCAAAA
ACGTGGGAAG AGATCCCTGC ACTGGATAAA ACACTGCGTG CTAATGGCAA AAGCGCCATT
ATGTGGAACC TACAAGAACC GTACTTCACT TGGCCCGTTA TCGCCGCTGA TGGCGGTTAT
GCATTCAAGT TTGAAAACGG GGTTTATGAT GCGAAGAACG TGGGCGTAAA TAATGCGGGC
GCCCAAGCTG GCCTACAATT TATTGTCGAT CTCGTTAAGA ATAAGCACAT CAATGCCGAT
ACTGATTACT CCATCGCAGA AGCAGCCTTT AATAAAGGTG AAACCGCGAT GACCATTAAT
GGCCCATGGG CATGGTCCAA TATCGATAAG AGTAAAATTA ATTACGGCGT AACCCTGCTG
CCAACCTTCC ATGGCCAGCC ATCTAAACCC TTTGTCGGTG TACTGACCGC CGGTATTAAC
GCAGCGAGTC CAAATAAAGA ACTGGCAACG GAATTCCTGG AAAACTATCT GATCACCGAC
CAAGGCCTGG CTGAGGTCAA CAAAGATAAA CCACTGGGTG CCGTAGCGCT GAAATCATTC
CAGGAGCAAC TGGCAAAAGA TCCGCGGATT GCAGCAACAA TGGATAACGC CACCAACGGC
GAAATCATGC CAAACATTCC GCAAATGGCA GCCTTCTGGT ATGCCACCCG TAGTGCGGTA
CTGAACGCCA TCACTGGGCG TCAAACCGTT GAAGCGGCAC TGAACGATGC GGCAACCCGT
ATCACGAAGT AA
 
Protein sequence
MTRSFTKSGI GKTARVLALS ALTTLVLSSS AFAKIEEGKL VIWINGDKGY NGLAEVGKKF 
EKDTGIKVTI EHPDKLEEKF PQVAATGDGP DIIFWAHDRF GGYAQSGLLA ELTPSKAFQE
KLFPFTWDAV RFNGKLIGYP VAVEALSLIY NKDLVKEAPK TWEEIPALDK TLRANGKSAI
MWNLQEPYFT WPVIAADGGY AFKFENGVYD AKNVGVNNAG AQAGLQFIVD LVKNKHINAD
TDYSIAEAAF NKGETAMTIN GPWAWSNIDK SKINYGVTLL PTFHGQPSKP FVGVLTAGIN
AASPNKELAT EFLENYLITD QGLAEVNKDK PLGAVALKSF QEQLAKDPRI AATMDNATNG
EIMPNIPQMA AFWYATRSAV LNAITGRQTV EAALNDAATR ITK