Gene YpAngola_A1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1120 
Symbol 
ID5799584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1156054 
End bp1158123 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content50% 
IMG OID641339095 
Producthypothetical protein 
Protein accessionYP_001605666 
Protein GI162421317 
COG category[R] General function prediction only 
COG ID[COG3107] Putative lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAT TACTGCTCGA TCGTGTTGAT TCATTGTTTC ATCCGAATTA CCGATTTAAT 
ATTGAGCATC TTGAAAAAAA CATCACTGGA TACAGTATGC TTTCCTCAAC GTTCGTTCGT
TCTAAAGCAG GGCTTGTTCC TGTCATTCTG GCTGCTCTTA TTTTAGCGGC CTGTACAGGC
GATGCGCCAC AAACGCCGCC CCCCGTAAAT ATACAGGACG AAGCAAGCGC TAACTCTGAT
TATTATCTGC AACAGTTGCA GCAGAGCAGT GATGATAACA AGGCTGACTG GCAATTACTT
GCCATTCGTG CCCTATTACG TGAGGCAAAA GTGCCTCAGG CCGCCGAACA ACTCAGCACT
CTCCCTGCAA ACCTGAGCGA TACACAGCGC CAGGAACAGC AATTGCTGGC GGCTGAACTG
TTGATCGCGC AGAAAAATAC GCCAGCGGCG GCTGATATTC TTGCCAAATT AGAGGCAACT
CAACTCTCAG CTAACCAAAA AGTACGCTAC TATCAGGCCC AAATTGCCGC CAATCAGGAT
AAAGCCACCC TGCCATTGAT TCGTGCATTT ATCGCTCAGG AACCATTACT GACAGATAAA
GCCCATCAAG ATAATATTGA TGGCACTTGG CAGTCACTGT CCCAACTGAC ACCACAAGAA
TTAAATACCA TGGTGATCAA CGCAGACGAA AATGTGCTGC AAGGCTGGCT GGATTTACTG
CGTGTTTATC AAGATAACAA GCAAGACCCA AAGCTACTGA AAGCCGGGAT TAAAGACTGG
CAAACCCGTT ACCCACAAAA CCCGGCAGCG AAAAATCTGC CAACTGCATT AACTCAGATC
AGTAATTTCA GCCAGGCATC CACCGCCAAG ATTGCTCTGC TGCTGCCATT AAGTGGCCCG
GCACAAGTAT TCGCCGATGC CATCCAGCAA GGTTTTACTG CCGCCCAAAA TGGCTCAGCG
GTAACAGCTT CAGTACCAGT AACGCCAAAT GTGACGGAAA GCAGCCCAAC GGATACTGCT
GCGGTTGTTT CGGATGATAC CCCGGCCACC CTTCCGGCCC CAGTGCCCCC CCCCGTCGTC
ACCAACGCCC AAGTGAAAAT CTACGATACC AACACTCAAC CACTGGCAGC GCTATTGGCT
CAAGCCCAGC AAGATGGTGC AACACTGGTC GTTGGCCCTC TGCTAAAACC CGAAGTTGAG
CAACTCAGTG CCACCCCAAG CACATTGAAT ATTCTGGCGT TGAACCAACC AGAAGCCAGT
AATAACAGCC CAAACATCTG TTACTTTGCC CTATCGCCAG AAGATGAAGC CCGTGATGCA
GCGCATCACC TGTGGGAACA GCAAAAAAGA ATGCCGCTGT TGCTGGTGCC TCGTGGTGCC
CTTGGTGAAC GCATTGCCAA AGCCTTCGCT GACGAGTGGC AAAAACAAGG TGGGCAAACG
GTATTACAAC AGAACTTCGG TTCAACCACT GAGTTGAAGC AATCCATCAA CAGTGGTGCC
GGTATCCGCC TGACCGGTAC CCCCGTTAGC GTTTCTAATG TAGCCGCCGC CCCGGCCTCC
GTCACTATTG CGGGCCTGAC CATTCCAGCA CCGCCAATCG ATGCACCGGT AGTGTCAACG
TCTTCGAGCG GTAACATTGA TGCGGTCTAT ATCATTGCGA CGCCATCTGA ATTAACCCTG
ATTAAGCCAA TGATTGATAT GGCAACCAGT TCACGCAGTA AACCTGCGCT GTTTGCCAGT
TCACGTAGCT ACCAGGCTGG CGCTGGCCCA GATTACCGTC TGGAAATGGA AGGTATACAG
TTTAGTGATA TTCCGCTGAT GGCCGGCTCT AACCCCGCTT TGCTGCAACA AGCATCGGCT
AAATACGCTA ACGATTATTC TCTGGTACGC TTATACGCCA TGGGGATTGA TGCCTGGGCA
TTGGCAAATC ATTTTTCTGA AATGCGCCAA ATCCCTGGCT TCCAAGTCAA AGGGGTCACC
GGTGATTTAA CTGCATCATC AGATTGTGTT ATCACCCGCA AGCTACCTTG GTTACAATAT
CGCCAGGGAA TGGTGGTGCC ACTCGCATAA
 
Protein sequence
MQKLLLDRVD SLFHPNYRFN IEHLEKNITG YSMLSSTFVR SKAGLVPVIL AALILAACTG 
DAPQTPPPVN IQDEASANSD YYLQQLQQSS DDNKADWQLL AIRALLREAK VPQAAEQLST
LPANLSDTQR QEQQLLAAEL LIAQKNTPAA ADILAKLEAT QLSANQKVRY YQAQIAANQD
KATLPLIRAF IAQEPLLTDK AHQDNIDGTW QSLSQLTPQE LNTMVINADE NVLQGWLDLL
RVYQDNKQDP KLLKAGIKDW QTRYPQNPAA KNLPTALTQI SNFSQASTAK IALLLPLSGP
AQVFADAIQQ GFTAAQNGSA VTASVPVTPN VTESSPTDTA AVVSDDTPAT LPAPVPPPVV
TNAQVKIYDT NTQPLAALLA QAQQDGATLV VGPLLKPEVE QLSATPSTLN ILALNQPEAS
NNSPNICYFA LSPEDEARDA AHHLWEQQKR MPLLLVPRGA LGERIAKAFA DEWQKQGGQT
VLQQNFGSTT ELKQSINSGA GIRLTGTPVS VSNVAAAPAS VTIAGLTIPA PPIDAPVVST
SSSGNIDAVY IIATPSELTL IKPMIDMATS SRSKPALFAS SRSYQAGAGP DYRLEMEGIQ
FSDIPLMAGS NPALLQQASA KYANDYSLVR LYAMGIDAWA LANHFSEMRQ IPGFQVKGVT
GDLTASSDCV ITRKLPWLQY RQGMVVPLA