Gene YpAngola_A3883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3883 
SymbolmalE1 
ID5802361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4118408 
End bp4119643 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content48% 
IMG OID641341675 
Productmaltose/maltodextrin ABC transporter periplasmic maltose/maltodextrin-binding protein 
Protein accessionYP_001608185 
Protein GI162419935 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.265603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA ATAAAATCAC TGCGCTTGTT TTAACCGCAC TGGCAGTCAC CCAGTTTGCT 
GGTTTCGCTG CCCACGCTGC TACACAGCAA TTAACAGTTT GGGAAGACAT CAAAAAATCT
GCCGGTATTA AAGAGGCTAT CGCTGATTTT GAAAAACAGC ATCAGGTTAA GGTCAATGTG
CTGGAAATGC CTTACGCACA ACAAATTGAA AAACTCCGCC TTGATGGCCC TGCAGGTATC
GGCCCTGATG TGTTGGTGAT TCCCAATGAT CAGTTAGGTG GTGCGGTAGT GCAAGGTTTG
CTGACACCGC TCAGCGTTGA TCCAACCATA GTCACTACTT TTACTAAACC TTCTATCGCT
GCCTTCACCA TGGATAATGC CCTCTACGGT TTACCGAAAG CCGTGGAAAC GCTGGTGATG
ATCTACAACA AAGACATGCT GCCAACGCCG TTAGCTACCT TGGATGAGTA CGCCGCATTC
TCTAAGAAAC AACGCGCAGA AAATAAATAT GGTCTGTTGG CGAAGTTCGA TCAGATCTAT
TACAGCTGGG GAGCGATCGA GCCAATGGGC GGTTACATCT TTGGTAAAGA TGCTAACGGT
AGCTTGAAGG CTAACGATAT CGGGCTAAAT ACACCAGGGG CTGTTGAGGC CGTAACCTAT
TTGAAAACAT TCTATGCTAA CGGTCTGTTT CCAATTGGCA CCATCGGTGA TAACGGCTTG
AATGCTATTG ACTCATTATT CACTGAGAAA AAAGCGGCTG CGGTAATTAA CGGGCCATGG
GCATTCCAAC CGTATGAAGC CGCTGGTATT AACTTTGGTG TGTCACCACT GCCAGCATTG
CCGAACGGCA AAGATATGAG CTCCTTCCTC GGTGTGAAAG GGTATGTCGT TTCTACCTGG
AGCAAAGATA AGGCACTCGC CCAGCAGTTC ATCGAATTTA TTAACCAACC GCAATACGTG
AAAACCCGCT ATCAGGTCAC CAAAGAGATC CCCGCGTTGA CGGCCATGAT TGACGATCCA
TTGATTAAAA ATGATGAAAA AGCCAGTGCG GTAGCCATTC AGGCAAGCCG TGCCTCTGCG
ATGCCTGGTA TTCCAGAAAT GGGCGAAGTG TGGGGACCTG CTAACTCAGC ATTAGAGCTA
AGCGTAACGG GCAAACAGGA GCCTAAAGTC GCTCTCGATA ACGCCGTTAA GCAGATCAAT
ATGCAAATCG AGGCCATGCA GGCCAGTAAT CAGTAA
 
Protein sequence
MKINKITALV LTALAVTQFA GFAAHAATQQ LTVWEDIKKS AGIKEAIADF EKQHQVKVNV 
LEMPYAQQIE KLRLDGPAGI GPDVLVIPND QLGGAVVQGL LTPLSVDPTI VTTFTKPSIA
AFTMDNALYG LPKAVETLVM IYNKDMLPTP LATLDEYAAF SKKQRAENKY GLLAKFDQIY
YSWGAIEPMG GYIFGKDANG SLKANDIGLN TPGAVEAVTY LKTFYANGLF PIGTIGDNGL
NAIDSLFTEK KAAAVINGPW AFQPYEAAGI NFGVSPLPAL PNGKDMSSFL GVKGYVVSTW
SKDKALAQQF IEFINQPQYV KTRYQVTKEI PALTAMIDDP LIKNDEKASA VAIQASRASA
MPGIPEMGEV WGPANSALEL SVTGKQEPKV ALDNAVKQIN MQIEAMQASN Q