Gene YpAngola_0076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_0076 
Symbol 
ID5798372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010158 
Strand
Start bp50960 
End bp52216 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content54% 
IMG OID641337971 
Producthypothetical protein 
Protein accessionYP_001604588 
Protein GI162417876 
COG category 
COG ID 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones476 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones619 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCC CGTCGTCTCT GAGTCTCGTA CAGCTGCATT CTGGGCAGAT GCAAGTCTTC 
CAGTCGCCAC ATCGTTTCAA AGTAGTGTGT GCGGGTCGAC GCTGGGGTAA ATCCCGGTTG
TCGATTTCCA CCATCATTCG CGCGGCAGCC AAAGAGAAGA AGCAACGTGT CTGGTATGTC
GCACCGACGT ACCAGATGGC TCGCCAGATC TTGTGGGATG ACCTGCAGGA AGTTCTGCCG
CGTAAATGGG TTCGTAAGAA AAACGACACC ACGATGACCA TCGTGCTGAA GAACGGCTCT
GAAATCGCGC TGAAAGGTGC GGATAAGCCC GATACGCTTC GTGGTGTGGC ACTGCACTTT
GTGGTGCTCG ATGAATTTCA GGATATGAAG CCGGATACCT GGTACAAGGT ACTTCGTCCG
ACACTGTCCT CAACCCGTGG CGGTGCGCTG ATCATCGGTA CGCCAAAAGG CTTCTCCGAG
TTCCACAAGC TGTGGACTAT CGGTCAGAAC AAAGATTTGC AACGCAAAGG GCAGTGGAAG
AGCTGGCAGT TCGTTACGGC CGATTCTCCG TTCGTACCGA GCGCGGAAAT CGAAGCGGCG
AAGAACGATA TGGACCCTAA ATCGTTCGCA CAGGAATACC TGGCCAGCTT CGAAAACATG
TCCGGACGCG TTTACTACCC GTTCGACCGC AATGTGCATG TGAAGCCACT CCAGTTCAAT
CCGAAACTGC CGATCTGGGT TGGTCAGGAC TTCAACATCG ACCCTATGTC ATCGGTCATC
CTGCAGCCGC AGCCAAATGG TGAGTTGTGG GCCGTGGACG AGGTTGTGCT GTTCTCTTCC
AACACGGCTG AAGTGTGTGA TGAGCTGGAG CGCCGTTTCT GGCGCTGGAA GTCTCAGGTC
ACTATCTTCC CTGACCCGGC TGGTGCGTAT CGCCAGCACG CACGCGGCGA ATCTGACGTC
GATATATTCA AGGAAAAAGG TTTCCTCCGA GTCGATTATC CGAAGAAGCA CCCGCCTATC
GCAGACCGTG TGAACGCCGT GAACCGGATG TTGATGAGTG CCTCGGGCGA AACCCGGTTG
TACATCGATC CGAAGTGCAA ACATCTCATC GACTCGCTGG AGAAGGTGAT CTACAAGCCA
GGCTCACGCG ATATGGATAA GACTGGCGGC ATCGAACACA GTGCGGATGC GTTGGGTTAT
CCGGTTCATC GTAGGTATCC GGTGAAAAAT CGTGTTATTC TTGGTGGATC TAGATAA
 
Protein sequence
MAIPSSLSLV QLHSGQMQVF QSPHRFKVVC AGRRWGKSRL SISTIIRAAA KEKKQRVWYV 
APTYQMARQI LWDDLQEVLP RKWVRKKNDT TMTIVLKNGS EIALKGADKP DTLRGVALHF
VVLDEFQDMK PDTWYKVLRP TLSSTRGGAL IIGTPKGFSE FHKLWTIGQN KDLQRKGQWK
SWQFVTADSP FVPSAEIEAA KNDMDPKSFA QEYLASFENM SGRVYYPFDR NVHVKPLQFN
PKLPIWVGQD FNIDPMSSVI LQPQPNGELW AVDEVVLFSS NTAEVCDELE RRFWRWKSQV
TIFPDPAGAY RQHARGESDV DIFKEKGFLR VDYPKKHPPI ADRVNAVNRM LMSASGETRL
YIDPKCKHLI DSLEKVIYKP GSRDMDKTGG IEHSADALGY PVHRRYPVKN RVILGGSR