Gene YpAngola_A2654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2654 
Symbologl 
ID5801126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2776051 
End bp2777217 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content46% 
IMG OID641340521 
Productoligogalacturonate lyase 
Protein accessionYP_001607060 
Protein GI162418416 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.619254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0112854 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAG GTAAACAAAT CCCCCTGACA TATCACACTT ATCAGGATGC CTCTACCGGC 
GCACAAGTGA CCCGACTGAC GCCCCCTGAT GTCACTTGTC ACCGTAACTA TTTCTATCAG
AAATGTTTCA CCCGTGATGG TAGTAAACTG TTGTTCGGTG GTGCTTTTGA TGGCCCATGG
AATTACTATT TACTGGATCT TAACAGCCGG GTCGCAACAC AACTGACTGA AGGGCGCGGT
GATAATACCT TTGGGGGTTT CCTCTCACCA GAAGATGATG CGCTGTTCTA CGTGAAGGAT
GGCCGTAATT TGATGCGTGT TGATCTCGCC ACATTAGAAG AGAACGTGGT TTATCAGGTG
CCAGAAGAGT GGGTCGGTTA TGGAACGTGG GTAGCAAATT CTGATTGTAC TAAATTGGTC
GGTATTGAGA TTAAGCGGGA AGACTGGCAA CCGTTGACAG ATTGGAAGAA ATTCCACGAA
TTTTATTTCA CCAAACCGTG TTGCCGCCTG ATGCGCGTTG ATTTGAAAAC TGGCGAATCA
GCGGTTATCC TGCAAGAAAA TCAGTGGCTT GGTCACCCTA TTTATCGCCC TTATGACGAT
AGCACCGTGG CGTTCTGCCA TGAAGGGCCA CACGATCTGG TTGATGCACG TATGTGGCTC
ATCAATGAAG ATGGCAGCAA TATGCGTAAA GTGAAAACCC ATGCAGAAGG TGAGAGCTGT
ACGCATGAAG TCTGGGTTCC AGATGGTTCA GCATTGGTTT ATGTCTCTTA TTTGAAAGGC
AGCTCCGATC GTTTTATTTA TAGTGCTGAT CCAGTGACAT TGAAAAATCG TCAATTGACT
TCTATGCCTG CCTGCTCACA TTTAATGAGT AATTATGATG GCACACTGAT GGTAGGTGAT
GGGTCTGACG CACCGGTTGA TGTGCAAGAT AACAGCGGTT ATAAGATCGA AAATGATCCT
TTCTTGTATG TCTTCAATAT AAAGAATGGG ACACAGCATC GTATTGCTCG ACATGATACA
TCATGGCAGG TATTTGAAGG TGATCGCCAG GTTACGCATC CTCATCCTTC TTTTACGCCT
GATGATAAGC AGGTCCTGTT TACGTCAGAC GTCCACGGCA AGCCCGCATT ATATATGGTG
ACATTGCCTG AGTCTGTTTG GCAATAG
 
Protein sequence
MAKGKQIPLT YHTYQDASTG AQVTRLTPPD VTCHRNYFYQ KCFTRDGSKL LFGGAFDGPW 
NYYLLDLNSR VATQLTEGRG DNTFGGFLSP EDDALFYVKD GRNLMRVDLA TLEENVVYQV
PEEWVGYGTW VANSDCTKLV GIEIKREDWQ PLTDWKKFHE FYFTKPCCRL MRVDLKTGES
AVILQENQWL GHPIYRPYDD STVAFCHEGP HDLVDARMWL INEDGSNMRK VKTHAEGESC
THEVWVPDGS ALVYVSYLKG SSDRFIYSAD PVTLKNRQLT SMPACSHLMS NYDGTLMVGD
GSDAPVDVQD NSGYKIENDP FLYVFNIKNG TQHRIARHDT SWQVFEGDRQ VTHPHPSFTP
DDKQVLFTSD VHGKPALYMV TLPESVWQ