Gene YpAngola_A3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3052 
Symbollon 
ID5801525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3225364 
End bp3227718 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content48% 
IMG OID641340889 
ProductDNA-binding ATP-dependent protease La 
Protein accessionYP_001607418 
Protein GI162418498 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000101207 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCTG AGCGTTCCGA ACGCATAGAA ATCCCCGTAT TGCCTCTGCG CGATGTGGTG 
GTTTATCCGC ATATGGTGAT CCCACTATTT GTTGGCCGGG AAAAGTCGAT TCGGTGCCTG
GAAGCTGCGA TGGACCATGA TAAAAGAATC ATGCTGGTTG CGCAGAAAGA AGCTTCGACC
GACGAACCTG GTATCAACGA TCTGTTTTCG GTGGGTACCG TAGCCTCGAT TTTGCAAATG
CTGAAGTTGC CTGACGGCAC AGTAAAAGTG TTGGTTGAAG GTTTACAGCG TGCGCGTATC
ACCACGCTTT CTGACAGTGG CGAGCATTTT GCTGCCCAAG CGGAATACCT TGAATCACCC
GTGATGGATG ATCGCGAGCA AGAAGTCTTG GTGCGTACCG CGATTAATCA GTTTGAAGGT
TATATCAAAC TGAACAAAAA AATTCCGCCG GAAGTGCTGG CTTCGCTGCA CAGTATTGAT
GATGCGGCAC GTCTTGCTGA TACCATCGCT GCACATATGC CATTGAAGTT AAATGATAAA
CAAGCTGTTC TGGAAATGTT CGATATCACC GAACGTCTGG AATACTTGAT GGCGATGATG
GAGTCGGAAA TCGATCTGTT ACAGGTCGAA AAGCGGATCC GTAATCGTGT TAAAAAACAG
ATGGAAAAGA GCCAGCGCGA GTACTATCTG AATGAGCAAA TGAAAGCTAT TCAGAAAGAA
CTGGGCGAGA TGGACGATAC GCCAGACGAG CATGAAGCGC TGAAGCGTAA AATTGAAGCG
GCTAAAATGC CGAAAGATGC ACGTGAAAAA ACCGAAGCGG AACTGCAAAA ACTGAAAATG
ATGTCGCCAA TGTCTGCGGA AGCAACCGTG GTACGTGGTT ACATCGACTG GATGTTGCAG
GTTCCTTGGA ATAGCCACAG CAAAGTTAAA AAAGATCTGG TTAAAGCACA AGAAGTTCTG
GATACCGACC ACTACGGTTT AGAGCGTGTT AAAGATCGTA TCTTGGAATA TCTCGCAGTC
CAGAGCCGGG TCAGCAAAAT TAAAGGGCCA ATCCTCTGCT TGGTTGGGCC TCCTGGGGTC
GGTAAAACCT CTCTAGGGCA GTCAATTGCT AAGGCAACGG GCCGCCAGTA TGTGCGTATG
GCATTGGGTG GGGTGCGTGA TGAAGCTGAA ATCCGTGGTC ACCGTCGGAC GTATATTGGT
TCTATGCCGG GTAAATTGAT CCAGAAGATG GCAAAAGTGG GTGTGAAAAA TCCACTCTTC
CTATTGGATG AGATCGATAA AATGGCATCG GATATGCGCG GAGATCCTGC TTCTGCGTTA
CTGGAGGTGC TGGATCCAGA ACAAAACGTT GCATTTAACG ATCACTACCT GGAAGTGGAT
TACGATCTCT CGGATGTGAT GTTTGTGGCG ACCTCTAACT CCATGAATAT TCCAGCCCCG
TTGCTGGATC GTATGGAAGT TATTCGTCTG TCCGGCTATA CCGAAGATGA GAAACTCAAT
ATTGCTAAAC AGCATTTGCT GCCAAAACAA TTTGAGCGTA ATGCCATCAA GAAAGGTGAG
TTGACCATTG ATGACAGCGC CATTATGAGC ATCATCCGTT ACTACACCCG TGAAGCTGGG
GTGCGTAGTT TGGAACGTGA AATTTCTAAA CTGTGTCGTA AGGCGGTAAA AAATCTGCTG
ATGGACAAAA CGGTTAAGCA CATTGAAATC AACGGGGATA ACCTAAAAGA TTTCCTTGGC
GTTCAGAAGG TTGACTATGG TCGTGCCGAT ACTGAAAACC GCGTAGGTCA GGTAACGGGT
CTAGCGTGGA CTGAAGTGGG TGGTGACTTA CTTACTATCG AGACCGCTTG TGTTCCAGGT
AAAGGCAAGT TGACTTATAC CGGCTCACTG GGTGAAGTGA TGCAAGAGTC GATTCAGGCG
GCTCTAACCG TAGTGCGTGC GCGTGCGGAT AAATTGGGTA TCAATCCTGA TTTCTATGAA
AAACGCGACA TCCACGTGCA TGTGCCGGAA GGGGCGACAC CTAAAGATGG CCCAAGCGCA
GGTATTGCAA TGTGCACAGC ACTGGTTTCT TGTCTGACGG GTAACCCCGT TCGTGCTGAT
GTTGCAATGA CGGGTGAGAT AACCTTACGC GGCTTAGTAT TGCCGATTGG CGGTTTGAAA
GAGAAATTAC TGGCCGCTCA CCGTGGTGGG ATCAAAGTGG TGTTGATTCC AGATGATAAC
AAACGTGATC TGGAAGAGAT TCCTGACAAT GTTATCGCTG ATCTGGAGAT CCACCCGGTT
AAACGAATTG ATGATGTTTT AGCCATTGCG TTGGAACACC CGGCCTTTGG TGCCCAGCCA
GTAGCGCCAA AATAG
 
Protein sequence
MNPERSERIE IPVLPLRDVV VYPHMVIPLF VGREKSIRCL EAAMDHDKRI MLVAQKEAST 
DEPGINDLFS VGTVASILQM LKLPDGTVKV LVEGLQRARI TTLSDSGEHF AAQAEYLESP
VMDDREQEVL VRTAINQFEG YIKLNKKIPP EVLASLHSID DAARLADTIA AHMPLKLNDK
QAVLEMFDIT ERLEYLMAMM ESEIDLLQVE KRIRNRVKKQ MEKSQREYYL NEQMKAIQKE
LGEMDDTPDE HEALKRKIEA AKMPKDAREK TEAELQKLKM MSPMSAEATV VRGYIDWMLQ
VPWNSHSKVK KDLVKAQEVL DTDHYGLERV KDRILEYLAV QSRVSKIKGP ILCLVGPPGV
GKTSLGQSIA KATGRQYVRM ALGGVRDEAE IRGHRRTYIG SMPGKLIQKM AKVGVKNPLF
LLDEIDKMAS DMRGDPASAL LEVLDPEQNV AFNDHYLEVD YDLSDVMFVA TSNSMNIPAP
LLDRMEVIRL SGYTEDEKLN IAKQHLLPKQ FERNAIKKGE LTIDDSAIMS IIRYYTREAG
VRSLEREISK LCRKAVKNLL MDKTVKHIEI NGDNLKDFLG VQKVDYGRAD TENRVGQVTG
LAWTEVGGDL LTIETACVPG KGKLTYTGSL GEVMQESIQA ALTVVRARAD KLGINPDFYE
KRDIHVHVPE GATPKDGPSA GIAMCTALVS CLTGNPVRAD VAMTGEITLR GLVLPIGGLK
EKLLAAHRGG IKVVLIPDDN KRDLEEIPDN VIADLEIHPV KRIDDVLAIA LEHPAFGAQP
VAPK