Gene YpAngola_A3248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3248 
SymbollysA 
ID5801726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3449509 
End bp3450771 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content51% 
IMG OID641341078 
Productdiaminopimelate decarboxylase 
Protein accessionYP_001607600 
Protein GI162418960 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0000168986 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCACGCG CACTGAATGA CCGTTCTTCA GCACTGACTG CCCAAAATCT GATTGCATTG 
CCTGAGCGTT TTGGTTGCCC GGTGTGGGCC TATGATGGTG ATATTATTGC TGAGCGTATT
AATCAATTAC GCCATTTTGA TGTTATCCGC TTCGCGCAAA AAGCCTGTTC GAATATCCAT
ATACTGCGCC TGATGCGTGA GCAAGGTGTT AAGGTTGATT CGGTATCTTT GGGTGAAATC
GAGCGTGCCT TGCACGCGGG TTATCAACCG GGGCAGGAAC CTGCCGAAAT TGTCTTCACG
GCTGATTTAC TGGATCAGGC GACCTTATTG CGCGTTACTG AATTGAATAT CCCGGTCAAT
GCCGGGTCTA TCGATATGTT GGATCAACTG GGGCAACAGG CTCCGGGTCA CCCTGTTTGG
CTGCGCGTCA ATCCGGGCTT TGGTCATGGG CATAGCCAGA AAACCAATAC CGGCGGTGAA
AACAGCAAAC ACGGCATTTG GCATGAAGAA CTCCCTCGCG CACTCAAGAA AATTGAGCAC
TATGGCTTAA CACTGGTGGG GATTCATATG CATATTGGCT CGGGTGTCGA TTATCAACAC
CTTGAGCAAG TCTGCGATGC CATGGTTCAG CAAGTGATTA CCTTAGGGCA CGATATCAGT
GCTATCTCCG CTGGCGGTGG GTTATCGATC CCCTATCAGT TTGGCGATGA TGTGATTGAT
ACTGAACACT ATTATGGGTT ATGGAATAGC GCCAGAGAGC GAATTGCTGC TCATTTGGGC
CACCCGGTTA GCCTTGAAAT TGAACCTGGC CGCTTCCTGG TAGCAGAGTC TGGTGTATTG
ATCGCTCAGG TTCGTGCTGT TAAGGATATG GGCCGCCGGC ATTATGTCTT GGTTGATGCC
GGGTTTAACG ATCTGATGCG TCCAGCGATG TATGGCAGTT ATCACCATAT TTCATTGCTT
CCTGCGGATG GGCGCGATCT GACGTCAGCT CCGCTGATTG ATACCGTGGT TGCTGGCCCG
CTTTGCGAAT CTGGCGATGT CTTCACTCAG CAAGAGGGCG GAGGTGTGGA GACCCTTGCT
CTGCCTGCTG CGGTTATTGG CGATTATCTG GTCTTCCATG ATACGGGGGC TTATGGCGCG
TCGATGTCAT CTAACTACAA CAGCCGCCCG TTATTACCTG AAGTACTGTT TGAAAAGGGC
CAACCGCGCT TGATCCGCCG CCGTCAAACC ATTGAAGAAT TGATTGACCT GGAACGCGTT
TAA
 
Protein sequence
MPRALNDRSS ALTAQNLIAL PERFGCPVWA YDGDIIAERI NQLRHFDVIR FAQKACSNIH 
ILRLMREQGV KVDSVSLGEI ERALHAGYQP GQEPAEIVFT ADLLDQATLL RVTELNIPVN
AGSIDMLDQL GQQAPGHPVW LRVNPGFGHG HSQKTNTGGE NSKHGIWHEE LPRALKKIEH
YGLTLVGIHM HIGSGVDYQH LEQVCDAMVQ QVITLGHDIS AISAGGGLSI PYQFGDDVID
TEHYYGLWNS ARERIAAHLG HPVSLEIEPG RFLVAESGVL IAQVRAVKDM GRRHYVLVDA
GFNDLMRPAM YGSYHHISLL PADGRDLTSA PLIDTVVAGP LCESGDVFTQ QEGGGVETLA
LPAAVIGDYL VFHDTGAYGA SMSSNYNSRP LLPEVLFEKG QPRLIRRRQT IEELIDLERV