Gene YpAngola_A0423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0423 
Symbol 
ID5798887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp438928 
End bp441348 
Gene Length2421 bp 
Protein Length806 aa 
Translation table11 
GC content39% 
IMG OID641338430 
Producthypothetical protein 
Protein accessionYP_001605029 
Protein GI162420804 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0374953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0190212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATATA ATCGTAACCA TATGCTAACG TTAATGTTCT CTTTATTAAT TTTTGATGCC 
TATGCAAATG AACCCAAAGT CTGTTTTTAT ATGGATGATG ACTATAATGG TGAGTCTCTC
TGCGCGGCTC AAGGGAACTC AGTTGCCAGT ATTCCGGATA AATGGAATGA TCGCATATCC
TCAATTTCCA TTCCCCACGG TTTAGTTGTC ACCGTCTATG AAGATGTCGA TTTTTTGGGC
GCTTCTCGGT CTTTTGAAGC CGATGTGGAT TTGGTTTCAG ATAAGGATTT GATATATTTA
AATGATAATA TAAGTGCATT TAAAATAAAA AAGGCAGTTT GTTTTTATGG CGAAAGTAAC
TTTACAGGTG ACTCGTTGTG CTTATCTAGT GGTGAGCAAT TTGACTTGTA TCGTGGTAAC
TATCCTGAAA GGAAGCAATC TCATTTGGTA AATCCCTTAA ATGATGAGGT ATATTCAATA
AAAATCCCTC CAGGAATGCA AACTACAGTC TATGAGGACG ATGATTATAA TGGTAAATAC
TTTGTTTTGA CAGAGGATTA CACACCTGAT GATTTGTTGA TAATAAGAAT GAACAATAAA
ATCAGTAGCA TGCGAGTTTC TCAAGATGAA GATTTCATTT GTGATCGTTT CTGTTCTATA
AAAAAATCAA TTAAATTTAA AATACAAAAA AGTTTTGGTT CCTATTGGAT TGACCCACGA
ATAAAATACA GAGATATATT GCTCGATTTT CAGTTATCTG ACACTGATGA TTATTCAATA
AAATTTTTTG ATGAGGGTCT TATAAAAGTT AAAAATTATA AATTATTTTT CTTTGATGGT
AATGAAAAAC ATGGTGAGGG TTTTTTGTTT AATTTAAGTG ACAAAAGTAG TAATTTGTCA
TTTTTATTGC GTTTTAACGG TGTTTTTTTT CAAATTCAAT TTATTGAATC ATTAAATAAT
GAAATTGTTT ATTCTTCTCC TTTGATTGGA ACGTTATTTG ATGATGCTAA CTCAGATGTT
ATTTTTGATA TCAATAATGT AGATGCAAAT AGCCCAATCG TTATTGATAA AATGGTATTA
ACCGTTGGTA AGGAGCATAA CCGGGCTGAA CGTAGTACTC TGGGGCTAGC GACTTGTTGG
GCAATACCCA TACTGAGTAT TTATAACTAT ATTGTTCAGG GGCATTGCAA TCAGGCAGAT
AAGTTTGTCC ATAATGCTGC CGATTTTTTT GGTAGCCCGG CAGATAAGAT ATTACAAATA
TTTGGTTCCT CTTTACCCTT ACCCCCAAAA AATAGCAATG AAACATTAGA TGACGAATCT
GCATTAATTA ATTTTATGTC TGATGTTAAA GGGACGTTAA CCTATATTAC GACCAATATG
GGCGAACACG CATTAACCGT ACCGGCAACG GCACTCGCGT GCAAAGCCTC AATGCGAGAG
AAAATCCTGC CTCATCTGCG TAACCGACGT GATCTTGCTC CAGGGTGTAT CGATTGGACT
TTGAGTATTT TAACCGATTT TACCTTGTTG TTTGGTGCCA GCATTGAGTA TTGGAATAGC
AAAAATTTTG GGCGAGTAAT AGAAAATATT GTTCGCGAAG GGGAGATCGG TTCTGCTGTG
GCAGATGTGG ACACAGCATC ACGTTTAATC GAGTCAGTTC AGGCCCATGT GGCTGAAGCG
ACAGAGGTTG CAGATATCAT TCATCTCAAA ACGGCATTCG ACTTTTCTCA ACTCAGTTAT
GTCAGTTATG TACGTCATAC CGAAGAAGAA GTGGTATCGC CACAAAGCGC GCAGGCGCTA
CCCTTAGGGC GTTATGAACT GGCACTGGTA GATTACCATT TTATCGAGAC GGTACCCAGA
ATTCGTCAGG ATGGCCGTTG GGTAGAACGT CCAGATCTGC ATTTTGATAT TGAAGTGATT
AGCGGGGTTC CCGAAGAAAC TGATGCGAGC CGGCAAGTGC TTTTGCCGAT TATCGAAGCG
TGGCAGCATA GCTATAGCCA ACAAGAATTA CTTTTTCACG CCTCGGATAT TCTTCCAGAA
AGTAGCCTGG CTCGATATGA AATTGAGCCG TTGTTAGAGG CTTCACGGCT GACCAGCGAA
GTTGCAATCA GTTGGTTAAG GACTAGCCGT GATGATTTTG TTTACGTCGT AGTCCGTTTG
GAGGGCGAGG TTATTAGCCT CACCATGGCA GTCGATATCA ATATTAATGA TGTGGGGGTT
GCCGGGTCAT TAACCAATCC AGATTATGTA ATAACACCTA ATGCAGAAGG GGCCGTCAGA
GGCGCGGGTA CGGCGGCGAT CCGAGCATTA GCTGACCACT TTAAACGTAA AGGGAAAATA
TCCTTGGTCT CTTCTGTTAT TAGTCAGCCA TCCGCTATTG TGAAGAAAAA AGTGGGTTTT
AGATTTATTG AAGAACTTTA A
 
Protein sequence
MRYNRNHMLT LMFSLLIFDA YANEPKVCFY MDDDYNGESL CAAQGNSVAS IPDKWNDRIS 
SISIPHGLVV TVYEDVDFLG ASRSFEADVD LVSDKDLIYL NDNISAFKIK KAVCFYGESN
FTGDSLCLSS GEQFDLYRGN YPERKQSHLV NPLNDEVYSI KIPPGMQTTV YEDDDYNGKY
FVLTEDYTPD DLLIIRMNNK ISSMRVSQDE DFICDRFCSI KKSIKFKIQK SFGSYWIDPR
IKYRDILLDF QLSDTDDYSI KFFDEGLIKV KNYKLFFFDG NEKHGEGFLF NLSDKSSNLS
FLLRFNGVFF QIQFIESLNN EIVYSSPLIG TLFDDANSDV IFDINNVDAN SPIVIDKMVL
TVGKEHNRAE RSTLGLATCW AIPILSIYNY IVQGHCNQAD KFVHNAADFF GSPADKILQI
FGSSLPLPPK NSNETLDDES ALINFMSDVK GTLTYITTNM GEHALTVPAT ALACKASMRE
KILPHLRNRR DLAPGCIDWT LSILTDFTLL FGASIEYWNS KNFGRVIENI VREGEIGSAV
ADVDTASRLI ESVQAHVAEA TEVADIIHLK TAFDFSQLSY VSYVRHTEEE VVSPQSAQAL
PLGRYELALV DYHFIETVPR IRQDGRWVER PDLHFDIEVI SGVPEETDAS RQVLLPIIEA
WQHSYSQQEL LFHASDILPE SSLARYEIEP LLEASRLTSE VAISWLRTSR DDFVYVVVRL
EGEVISLTMA VDININDVGV AGSLTNPDYV ITPNAEGAVR GAGTAAIRAL ADHFKRKGKI
SLVSSVISQP SAIVKKKVGF RFIEEL