Gene YpAngola_A1958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1958 
SymbolcomEC 
ID5800428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2036045 
End bp2038336 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content47% 
IMG OID641339881 
Producthypothetical protein 
Protein accessionYP_001606431 
Protein GI162420003 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0001218 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.195131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTTTA TCGCGTTATC AACAGATCGG GTCGCCGCTG CTGTCATTGT CGGGATCTTG 
CCTCTTATTT TTCTGCATCA ACTACCAGGG CCCACTATCA TTGGTTCTCT ATTAGCCCTA
AGTGCATTTT TGTGGTTAAG CCGCAATCGT TACTGCCAAT TTTTAGCGCT GATTGTGATT
AGCTTCCTGT GGGGGGTTTG GCATAGCAAT GGGATACTGA TGCAAACAGA AGCATTAACC
CAGGGGGATC AGCAGATAGT TGCGACAATA AACAGTTCAT CTCTTTCATG GGATGACGGC
CAGAAAGTTG TGATAAGTAT TCAGAAAATT AATGAAAAAC GGGTTTTCCC TCCAATAGCA
GTCAACGTTA AGTGGCCAGA ACGGTTGGAT CAATACTGTG CAGGGCAACG CTGGGCGTTT
AGGTTACGTA TGAGGGCAGT ACATAGCGTA CTGAATGAGG GCGGGTTTGA CAGTCAGCGC
TGGGCTATTG CTAACAGACG TCCACTACAA GGGCGCATCA TTGAGGCTAA ATTACTGGAT
GCAGAGTGTA ATTTTCGTCA GCAAATTATC AGTAGCATAG AGCAGCAACT GGTGGGATAT
GATCAGCGGC GGATCATGCT AGCACTGGCC TTTGGTGAAA GATCACAACT GAACAAAGAA
GAGTGGTCAC TCTTACGCTA CACCGGCACT GCGCACCTGA TGGCGATTTC CGGTTTACAT
ATTGCACTGG CGGCCTTATT TGGTGGGATG CTTGCCCGAT TAGTACAGCT CCTTTTTCCT
GTCAGTTGGA TTGGGCCTTT GCTACCGCTA CTGATTGGCT GGCTGATTGC TATGATTTAT
GTCTGGCTGG CAGGAGCAAA CTCACCAGCA ATCCGGGCGG CCATTGCGTT AACGCTGTGG
CTGCTGCTAC GTTTGTTCGG TATTTTATGT AGCCCATGGC AGGTGTGGAG GTGGGCTTTG
GGGCTAATTT TAGTCAGCGA CCCGCTAGCC GTATTATCAG ACAGCTTTTG GCTCTCCTGC
CTGGCGGTAT TTAGCCTGAT ATGTTGGTTT CACTTGGCCC CTGTTTCTTC TCGTTTCATT
ACTGGTTGGT ATGGCTTGGT TATCCGTTGG TTTCATTTGC AGTTTGGTAT GATGCTACTG
CTGATGCCGT TACAGATAGG GTTATTCCAC GGTATAAGTT TATTTTCTAT ACCCGCCAAT
CTGTGGGCGG TCCCAATAGT CTCATTGTTC ACCGTACCCT GCGTATTATT AGCATTGGCT
TTAGCATTGC TCCCTGCTGT TGCTGATATA TTTTGGTTCT TGGCTGATAT CTCGTTGACT
GTGGTGCTAT TCCCGCTCAA TCAGCTAAAA GAAGGCTGGT TACACACTGG CATGGCTTCT
GTTGCGATCG GTTATGGTGG TTGGCTGGCA TTATTTATCT GGCGCTTTCA ATGGTGGCGC
AGTCATCCTC TTGGTGTCAT TGTGCTGTGT ATGAACATGG TATTACTGAC TCAACGGCGT
GATGAGTATC ACTGGCGTGT GGATATGCTG GATATCAGGC ATGGTCTGGC TGTGGTGATT
GAGCGTGAGG GGAAAGCCAT TATCTATGAC ACTGGCAATC ATTGGTCTAC AGGTAATATG
GCCGCTATTG TCGTTTTACC GCTCCTTAAA TGGCGTGGCA TTACTGTTGA ACAGATTATC
CTTAGCCATG ACCATCAGGA CCATACTGGC GGTTTGGCTG TACTCTTGGA TGCTTTTCCG
CAGGCAACGG TACGTGCGCC TTTCTCTGTA AAAAACGTAG CCAATACTCT GCCTTGTAAA
CAGGGGGAGA GATGGCAGTG GCAAGGTTTA GATTTTGACG TGTTATGGCC GAAAGAACAG
GTGGTTAATG CTCAGAATAA TGACTCATGT GTCATTCGCA TTAATGATGG TAAACATAGT
GTATTGTTGA CCGGGGATCT TGAGTCCCAA GGAGAACGGC AGTTGGTGAG CGATATCCGG
GGAGAATTAA CATCAACGGT GCTGCAAGTG CCCCATCACG GCAGTAATAC TTCTTCAACC
GCGCCTTTTC TACGGGCAGT TAGCCCAGAA TTGGCCCTCG CTTCTGTTGC TCGTTATAAC
CAATGGCGAC TACCTGCGAA AAAAGTGATC AATCGCTATC AAAAAAATGG CATTATTTGG
CGTGATACAT CAGTATCAGG GCAATTAAGT GTATATTTTC ACTGCGATAC TTGGTTCGTT
AAAGGCTATC GGGAACAATT AAAACCACGT TGGTATCACC AGCGGTTTGG CGTTAGAGGT
CATAATGAGT AG
 
Protein sequence
MVFIALSTDR VAAAVIVGIL PLIFLHQLPG PTIIGSLLAL SAFLWLSRNR YCQFLALIVI 
SFLWGVWHSN GILMQTEALT QGDQQIVATI NSSSLSWDDG QKVVISIQKI NEKRVFPPIA
VNVKWPERLD QYCAGQRWAF RLRMRAVHSV LNEGGFDSQR WAIANRRPLQ GRIIEAKLLD
AECNFRQQII SSIEQQLVGY DQRRIMLALA FGERSQLNKE EWSLLRYTGT AHLMAISGLH
IALAALFGGM LARLVQLLFP VSWIGPLLPL LIGWLIAMIY VWLAGANSPA IRAAIALTLW
LLLRLFGILC SPWQVWRWAL GLILVSDPLA VLSDSFWLSC LAVFSLICWF HLAPVSSRFI
TGWYGLVIRW FHLQFGMMLL LMPLQIGLFH GISLFSIPAN LWAVPIVSLF TVPCVLLALA
LALLPAVADI FWFLADISLT VVLFPLNQLK EGWLHTGMAS VAIGYGGWLA LFIWRFQWWR
SHPLGVIVLC MNMVLLTQRR DEYHWRVDML DIRHGLAVVI EREGKAIIYD TGNHWSTGNM
AAIVVLPLLK WRGITVEQII LSHDHQDHTG GLAVLLDAFP QATVRAPFSV KNVANTLPCK
QGERWQWQGL DFDVLWPKEQ VVNAQNNDSC VIRINDGKHS VLLTGDLESQ GERQLVSDIR
GELTSTVLQV PHHGSNTSST APFLRAVSPE LALASVARYN QWRLPAKKVI NRYQKNGIIW
RDTSVSGQLS VYFHCDTWFV KGYREQLKPR WYHQRFGVRG HNE