Gene YpAngola_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_0039 
Symbol 
ID5798367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010158 
Strand
Start bp28285 
End bp29370 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content54% 
IMG OID641337939 
ProductSer/Thr protein phosphatase family protein 
Protein accessionYP_001604556 
Protein GI162417843 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones74 
Plasmid unclonability p-value1.16094e-34 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones467 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTGC CATACGGGGT GATATCAGAT CCCCATTATC ATCGTTGGGA TGCTTTTGCG 
ACAACAAACG CTGACGGGCT GAACTCTCGA CTGGAGATCC AACTGGATGC CACGAAAGAA
GCTGCCAAAG CCATGAAAGC TGCGGGCTGC AAGCACATGC TGGTGGCTGG TGATACTTTC
CATGTTCGTG GTGCTATATC GCCTTCCGTC CTGCATTTCG TGACCGAAAC TTACGAGTGG
ATCATCAAAG AGTTGGGCCT CGAAGTGGTT ATGCTGGCCG GCAACCACGA CCTCGAAACC
AACGATTCCG TATACAGCGC CAATGCAGCG GCCTCTCTGC GCTCAATCGG TGTGGAAATC
GTCTGCGGCA AACGTCCTCA CTCCATCAAA ATTGGCGACG TTACCGTCCA TCTGATTAGC
TGGCGCAATA ACCACGCAGA GCTTATCAGC GACCTCAAAA CACTGCGTTC CGGGCTGGAT
GGCGACAATC ACGATGTCGT TGTGCATACC TCGATCAACA AAGCGATCCC TACCATGCCT
GATGTCGGCA TCGACGCACA GGAACTGAAA GATATCGGCT TCCGTTTGTT GTTGTCCGGA
CACTACCACA ACCACAAAGA AGTGCTGCCT GGGGTGGTTA GCATCGGGGC GCTGACGCAC
CAGAATTGGG GTGATGTTGG CTCGCTGGCT GGCTTCATGA TCGTCAACCC TGACGGCACA
TTCACCCACC ACGAAACCTC TGCACCCAAG TTCGTGAACC TTGAGGACGA TGTGGAAGAC
GATCAAATTC GCGGTAACTA CGTGCGCTTT CGTGCCGTTG TTGAGAACGA TGAAGAAGGC
ATCAAACTCC AGAACGTCCT GAAAACAATG GGCGCGAAGG GTGTCGTCTG CAACTTCATC
CGCAAGGCAT CGATGATGGA AGGCTCTGCC AGTACTGCGG AGACCAGCAA AATAGACAGC
CTGGGCGAGT CCGTCGCGGC GTACTGCAAG ATCGTTCACG ACACTGATGG CGGCTTCGAC
CTGAGCAAGC TGGACATGCT GTGTCAGGAA ATCCTGACCG AAGCGGAGAG TGCGGAGGCA
GTGTGA
 
Protein sequence
MTLPYGVISD PHYHRWDAFA TTNADGLNSR LEIQLDATKE AAKAMKAAGC KHMLVAGDTF 
HVRGAISPSV LHFVTETYEW IIKELGLEVV MLAGNHDLET NDSVYSANAA ASLRSIGVEI
VCGKRPHSIK IGDVTVHLIS WRNNHAELIS DLKTLRSGLD GDNHDVVVHT SINKAIPTMP
DVGIDAQELK DIGFRLLLSG HYHNHKEVLP GVVSIGALTH QNWGDVGSLA GFMIVNPDGT
FTHHETSAPK FVNLEDDVED DQIRGNYVRF RAVVENDEEG IKLQNVLKTM GAKGVVCNFI
RKASMMEGSA STAETSKIDS LGESVAAYCK IVHDTDGGFD LSKLDMLCQE ILTEAESAEA
V