Gene YpAngola_A0300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0300 
Symbolgcp 
ID5798764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp317808 
End bp318821 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content52% 
IMG OID641338311 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_001604917 
Protein GI162419198 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000060043 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGTAT TGGGTATAGA AACGTCCTGC GATGAAACCG GAATTGCAGT CTATGACGAT 
AAAGCCGGTC TGTTAGCTAA CCAATTGTAC AGTCAGGTAA AATTACACGC TGACTACGGT
GGTGTCGTAC CTGAACTGGC CTCCCGTGAT CACGTGCGTA AGACCGTACC CCTGATTCAG
GCGGCGTTGA AAGAAGCCAA TTTGAGCGCC AAAGATATCG ATGCGGTGGC TTATACAGCA
GGCCCTGGTT TGGTGGGGGC GTTGTTAGTG GGGGCGACCA TTGGTCGGGC ATTAGCTTTC
GCATGGGGAG TCCCTGCGGT ACCGGTTCAT CATATGGAAG GCCATTTGTT GGCACCAATG
CTGGAAGAAA ATGCACCAGA GTTCCCGTTT GTTGCGCTGC TGGTATCCGG CGGCCATACC
CAATTGATTA GTGTGACCGG TATTGGTGAA TATTTGCTGT TGGGCGAATC TGTTGATGAT
GCGGCAGGCG AGGCCTTTGA TAAGACAGCA AAACTACTAG GGCTGGATTA CCCCGGTGGG
CCAATGTTGT CGCGTATGGC TCAACAAGGT ACTGTGGGGC GTTTTACCTT TCCGCGCCCA
ATGACTGATC GCCCTGGGCT GGATTTTAGT TTTTCCGGCC TGAAAACGTT CGCGGCGAAT
ACTATTCGTG CTAATGGTGA TGATGACCAA ACCCGTGCCG ATATCGCCCG AGCATTCGAA
GATGCGGTGG TTGATACGTT GGCGATCAAG TCTAAACGTG CATTAGATCA GACGGGTTTT
AAACGCTTAG TCATTGCCGG GGGCGTGAGC GCTAACCAGA CACTGCGATT AAAACTGGCC
GATATGATGC AAAAACGAGG CGGTGAAGTA TTCTATGCCC GACCCGAATT TTGCACTGAC
AATGGCGCGA TGATTGCCTA TGCCGGGATG GTCAGGTTGC GAAGTAACCT GAACAGTGAG
TTGAGTGTGT CGGTGCGGCC GCGGTGGCCG TTATCTGAGT TACCAAAAGT CTGA
 
Protein sequence
MRVLGIETSC DETGIAVYDD KAGLLANQLY SQVKLHADYG GVVPELASRD HVRKTVPLIQ 
AALKEANLSA KDIDAVAYTA GPGLVGALLV GATIGRALAF AWGVPAVPVH HMEGHLLAPM
LEENAPEFPF VALLVSGGHT QLISVTGIGE YLLLGESVDD AAGEAFDKTA KLLGLDYPGG
PMLSRMAQQG TVGRFTFPRP MTDRPGLDFS FSGLKTFAAN TIRANGDDDQ TRADIARAFE
DAVVDTLAIK SKRALDQTGF KRLVIAGGVS ANQTLRLKLA DMMQKRGGEV FYARPEFCTD
NGAMIAYAGM VRLRSNLNSE LSVSVRPRWP LSELPKV