Gene YpAngola_A2834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2834 
SymbollacZ 
ID5801306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2973072 
End bp2976224 
Gene Length3153 bp 
Protein Length1050 aa 
Translation table11 
GC content50% 
IMG OID641340686 
Productbeta-D-galactosidase 
Protein accessionYP_001607216 
Protein GI162421202 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.677484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.179493 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAACTGA GTCTTCCCCA GATTTTGTCC CGCCGAGATT GGGAAAACCC GCAGATCACA 
CAGTATCATC GCCTGGAGGC CCACCCGCCT TTTCACAGTT GGCGTGATGT AGAATCTGCC
CAGAAGGATC GTCCTTCACC ACAGCAACAA ACACTCAATG GGCTATGGTC ATTCAGCTAT
TTCACACAAC CTGAAGCGGT ACCCGAGCAC TGGGTGAGGT GCGATTTAGC CGAGGCAAAG
CCGCTCCCTG TACCGGCTAA CTGGCAACTT CATGGTTATG ACGCACCAAT TTACACCAAT
ATACAATACC CTATTCCCGT CAACCCACCA CGGGTCCCGG ATCTAAATCC AACGGGTTGC
TATTCCCGTG ATTTCACGTT AGAGCCAAGC TGGTTGGCAT CGGGTAAGAC TCGCATTATT
TTTGACGGTG TCAGTTCTGC ATTTTATCTG TGGTGTAATG GGCAATGGGT AGGTTATTCA
CAAGACAGCC GCCTACCTGC TGAGTTCGAT CTCACCCCCT ATTTGCAGGC TGGCAGTAAC
CGTATCGCAG TTTTAGTTCT GCGCTGGAGT GATGGGAGTT ATCTTGAAGA TCAAGATATG
TGGCGTATGA GCGGAATTTT TCGTGATGTG AAATTGTTGC ATAAACCCGA GATTCACTTA
CGGGATATCC ACATCATGAC GCATCTATCC CCTGAATTCA CCTCTGCAAA TTTAGAGGTA
ATGGCGGCCG TCAATATCCC CTCTCTACAG CTCAATGATC CGCAGGTGAC CGGATCCTAT
CAGCTCCGTG TACAACTTTG GTTAGCCGAT AAATTGGTCG CCAGTTTACA ACAGCCTTTA
GGCACCCAAG CCATTGATGA ACGAGGTCCT TATACTGATC GTACCCAGCT AGTATTGCGA
ATAGATCAGC CTCTGCTCTG GAGTGCCGAG CAGCCGACGC TATACCGAGC CGTGGTTTCC
TTGCTCAATC ATCAGCAAGA ATTGATTGAG GCCGAAGCCT ATGACGTGGG TTTCAGGCAA
GTGGCAATCC ATCAAGGCTT GCTTAAAATC AATGGCAAAG CGGTGCTGAT CAGAGGGGTG
AATCGACATG AACATCACCC GCAAACAGGT CAGGCCATTG ATGAAGAGAG TCTGTTGCAA
GACATTTTAT TAATGAAACA GCATAATTTT AATGCTGTGC GCTGCTCCCA CTATCCCAAT
CATCCTTTAT GGTACCGCCT TTGTGACCGC TATGGTTTGT ATGTGGTTGA TGAAGCGAAT
ATTGAGACAC ACGGTATGCA GCCCATGAGC AGGCTGTCCG ATGACCCAAG CTGGTTTTCA
GCTTTCAGTG AACGGGTGAC GCGGATGGTT CAGCGAGATC GCAACCATCC ATGCATTATT
ATCTGGTCAC TGGGCAATGA ATCAGGCCAT GGCGCAACCC ATGATGCCCT CTATCGTTGG
ATAAAAACCA ATGACCCCAC CCGCCCTGTG CAATATGAAG GGGGCGGTGC CAACACCTTA
GCGACCGACA TTCTGTGTCC GATGTATGCC CGTGTTGATG AAGACCAGCC CTTTCCTGCC
GTCCCCAAGT GGTCAATCAA AAAATGGATT GGCTTACCGA ATGAATCTCG CCCCTTGATC
CTATGTGAAT ACGCCCATGC GATGGGCAAT AGCTTCGGTG GATTTGCCCG CTATTGGCAG
GCATTTCGTC AGTACCCGCG CTTACAGGGC GGGTTTATTT GGGACTGGGT AGACCAAAGT
CTGACTCATC ATAATGACCA TGGTCAGCCT TATTGGGCGT ATGGGGGTGA TTTTGGTGAT
ACTCCCAATG ACCGCCAGTT CTGCATGAAC GGATTAGTCT TCCCTGACCG CAGCCCGCAC
CCGAGCCTTT ATGAAGCGCA GTGCGCACAG CAATTCTTCC AATTTTCGTT GCTGAGTACG
ACCCCGTTGG TGATCAACAT TACCAGTGAA TATTTGTTCC GAGAGAGTGA TAACGAACAA
TTATATTGGC GGATAATGTT AGAGGGAGAA TCCGTGTTGG AGGGTAGCCA ACCCCTGAAT
TTGTCGCCTG AAAGCTCACA GTGCTACAGG TTGGCAGAGA AATTACCCAC GCTTAATAAA
CCTGGGCAGC TATGGCTGAA TGTTGAGATA AGGCAACCAA AAGAAACCCC GTGGTCCCCT
GCTCAACATC GCAGTGCCTG GCATCAATGG CGCTTACCAC AACCACTCTT TTCGCCGTCC
AGTGATCTGA CCAATGCTAC AGCGCATTAT GCCCCTCAAC TGCAACATAA CCTTCAACTA
CAACATGACC TTCAACTGCA GCAAGATGAA CAGCATATTA AGGTGACTTA TCAGCAACAA
TGCTGGCAAT TCAGTCGTCA AACGGGGCGG TTGGCGCAAT GGTGGGTGGC GGATAAACCG
ATGCTACTGC GCCCACTACA AGATCAATTT GTGCGTGCGC CGCTGGATAA CGATATCGGT
ATCAGCGAAG CTACGCATAT TGACCCCAAT GCTTGGGTTG AGCGCTGGAA GAAAGCCGGA
ATGTATCAAC TCCAGCAACG CTGCCTCTCT CTACACGTAG ATCATTTATC CCATTCAGTA
CAAATCAGTG CCGAATACGG TTATGAATTC GAGCAAGAGC CCTTGCTACA CAGCCATTGG
GTATACCGTT TTGACCGACA TGGCCGTATG ACCATTGATG TTAACGTCCG TATCGCTACC
TCACTCCCTG CGCCAGCCAG AATTGGCATG TGTTGCCAAC TGGCTGATAT CTCACCTACG
GTTGAATGGC TAGGGTTGGG GCCACATGAA AACTACCCTG ATCGGCAGCT TGCAGCACAA
TATGGGCACT GGTCCCTGCC ATTAGAGCAG ATGCACACCG CGTATATTTT CCCCAGTGAG
AATGGCTTGC GCTGCAATAC CCATACGCTG AATTATGGCC GCTGGACGTT AACGGGCGAT
TTCCACTTTG GTATAAGTCG CTACAGCACC CAGCAACTGA TGGTGACCTC CCATCAACAT
CTATTGGAAC CCGAAGAGGG CACCTGGCTC AATATTGATG GTTTCCATAT GGGGGTGGGC
GGTGATGATT CATGGAGCCC GAGTGTTCAC ATTGATGACA TACTCACCCG TGAAACCTAT
CAGTACCAAA TCTGTTGGCA ATACAAGGTG TAA
 
Protein sequence
MQLSLPQILS RRDWENPQIT QYHRLEAHPP FHSWRDVESA QKDRPSPQQQ TLNGLWSFSY 
FTQPEAVPEH WVRCDLAEAK PLPVPANWQL HGYDAPIYTN IQYPIPVNPP RVPDLNPTGC
YSRDFTLEPS WLASGKTRII FDGVSSAFYL WCNGQWVGYS QDSRLPAEFD LTPYLQAGSN
RIAVLVLRWS DGSYLEDQDM WRMSGIFRDV KLLHKPEIHL RDIHIMTHLS PEFTSANLEV
MAAVNIPSLQ LNDPQVTGSY QLRVQLWLAD KLVASLQQPL GTQAIDERGP YTDRTQLVLR
IDQPLLWSAE QPTLYRAVVS LLNHQQELIE AEAYDVGFRQ VAIHQGLLKI NGKAVLIRGV
NRHEHHPQTG QAIDEESLLQ DILLMKQHNF NAVRCSHYPN HPLWYRLCDR YGLYVVDEAN
IETHGMQPMS RLSDDPSWFS AFSERVTRMV QRDRNHPCII IWSLGNESGH GATHDALYRW
IKTNDPTRPV QYEGGGANTL ATDILCPMYA RVDEDQPFPA VPKWSIKKWI GLPNESRPLI
LCEYAHAMGN SFGGFARYWQ AFRQYPRLQG GFIWDWVDQS LTHHNDHGQP YWAYGGDFGD
TPNDRQFCMN GLVFPDRSPH PSLYEAQCAQ QFFQFSLLST TPLVINITSE YLFRESDNEQ
LYWRIMLEGE SVLEGSQPLN LSPESSQCYR LAEKLPTLNK PGQLWLNVEI RQPKETPWSP
AQHRSAWHQW RLPQPLFSPS SDLTNATAHY APQLQHNLQL QHDLQLQQDE QHIKVTYQQQ
CWQFSRQTGR LAQWWVADKP MLLRPLQDQF VRAPLDNDIG ISEATHIDPN AWVERWKKAG
MYQLQQRCLS LHVDHLSHSV QISAEYGYEF EQEPLLHSHW VYRFDRHGRM TIDVNVRIAT
SLPAPARIGM CCQLADISPT VEWLGLGPHE NYPDRQLAAQ YGHWSLPLEQ MHTAYIFPSE
NGLRCNTHTL NYGRWTLTGD FHFGISRYST QQLMVTSHQH LLEPEEGTWL NIDGFHMGVG
GDDSWSPSVH IDDILTRETY QYQICWQYKV