Gene YpAngola_A1705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1705 
Symbol 
ID5800174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1756788 
End bp1759940 
Gene Length3153 bp 
Protein Length1050 aa 
Translation table11 
GC content46% 
IMG OID641339643 
ProductIg-like domain-containing protein 
Protein accessionYP_001606198 
Protein GI162420164 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.012785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00366104 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCTCT ATCGCATATC ATCACTACAT CAAGCCAAGC AGTTAAATAA AAACAAGCAG 
TTAAATAAAA CCCGAATTTC AAAATCAGTC GTCTGGGCAA ATATTGTAAT CCAAGCCATA
TTTCCTTTGA GTATTGCTTT TACCCCAGCG GTAATGGCGG CTGAAACCGT CGGAGCCTCG
GATGAAAAAC CGCGTTCGGC CTCACAGGCT GAACAGTCTA CCGCCAATGC AGCAACACGG
TTGGCATCAA TATTGACAAA TGATGACTCT GCCAAACAAG CGAGTTCCAT TGCGCGCGGC
ACCGCTGCCA ATGCAGGTAA TGAAGCATTG CAAAAGTGGT TTAATCAGTT TGGCAGTGCG
AAAGTACAAC TAAATCTAGA TGAGAAATTG AGCCTTAAAG GCAGCCAACT GGATGTATTG
CTGCCGCTGA CCGATAGCCC AGATCTACTC ACCTTTACTC AGTTGGGTGG CCGCTATATT
GATGACCGAG TGACATTGAA CGTTGGTTTG GGTCAGCGTC ATTTCTTTGC ACAGCAAATG
CTGGGCTACA ACCTGTTTAT TGATCATGAT GCCAGCTATA GCCATACCCG TATTGGCGTC
GGTGCTGAAT ATGGTCGTGA TTTTATTAAT CTGGCAGCTA ACGGCTACTT TGGTGTCAGT
GGTTGGAAAA ATTCGCCAGA TCTTGATAAG TATGATGAAA AAGTCGCAAA TGGCTTTGAT
TTACGCAGTG AAGCTTATCT GCCAACGTTG CCACAATTGG GGGGGAAACT GATATATGAA
CAATATTTTG GTGATGAAGT TGGCTTGTTT GGTGTGGATA ACCGTCAGAA AAACCCTCTT
GCGGTCACTC TGGGTGTGAA TTATACCCCA ATTCCTTTAT TTACTGTTGG TGTCGACCAT
AAAATGGGGC GCGCAGGAAT GAATGACACC CGGTTCAACC TTGGTTTTAA CTATGCATTT
GGCACTCCTC TGACACATCA GCTCGATTCG GATGCCGTCG CAATTAAACG TAGCTTAATG
GGTAGCCGCT ATAATCTGGT CGACCGTAAT AATCAGATTG TGATGAAATA CCGTAAGCAG
AATCGGGTTA CCCTAGAGCT GCCAGCACGT GTTAGTGGTG CGGCAAGACA AACAATGCCA
TTAGTGGCAA ATGCCACAGC ACAACAAGGT ATTGATCGTC TTGAATGGGA AGCCAGTGCC
TTAACGCTAG CGGGTGGAAA AATAACCGGT AGCGGCAATA ATTGGCAGAT AACATTGCCA
AGCTATTTGT CTGGTGGTGA GGGTAATAAC ACTTATCGTA TTAGTGCTAT CGCATACGAC
ACCCTTGGTA ATGCTTCTCC CGTTGCTTAC AGCGATCTCG TGGTAGATAG CCATGGTGTG
AATACTAACG CCTCAGGCTT GACTGCTGCG CCAGAAATTC TGCCGGCAAA TGCGAGTGCA
AGCAGTGTTA TTGAATTCAA TATTAAAGAT AATGCTAACC AGCCCATCAC GGGGATTGCC
GATGAACTGG CATTTTCTCT CGAACTGGTA GAGTTACCTG AAGAATTGGC TAAGGCTAAA
GCACGTTCAG TGCCATTGAA GACGGTGTCT CATACTCTAA CGAAGATTAC TGAATCTGCT
CCAGGTATTT ATCAGGCAAC ACTCACATCG GGAAGTAAGC CACAACTGAT TAATATTACC
GCCCAGATTA ATGGTGTGCC ATTAGCCGAT GTGCAAACCA AGGTGACGTT GATTGCCGAT
GAAAGCACGG CAACGTTACA AACGAGCAGC CTGCAAATCA TCACCAATGG TAGCCTGGCA
GATGATACTG ATGCTAACCA AATACGCGCC GTGGTGGTTG ATGCTTATGG CAATAAATTG
TCTGGTGTTC AAGTCAATTT CACTGTGGGC AATAATGCCA AAATAACAGA AACCACCTTG
AGCGACAAAC AAGGGGGAGT AACCGCAGCA ATCACCAGTA CCAAAGCGGG TACATATACG
GTCACAGCCG AACTTAATGG GGTAACACAA CAGATCGATG TTAACTTTAT CCCAGATGCT
GGCACTGCAA CACTGGATGA CAGTGATGAG TATAAATTGC AGTGGGTCAC TAATGGTCAG
GTAGCCGATG GTGAAAGCAC CAATAGCGTT CAACTGACGG TAGTCGATAA GTTTGGTAAC
ACCGTACCTG GTGTGGATGT CGCCTTTACC ACGGATATCG GGGCGATAAT TAGTGAAGTC
ACCCCAACGG ATGCTAATGG TGTCGCAACA GCAAAAATCA TCAGTAGTCA GGCTAAAAGC
CATACAGTGA AAGCAACGCT CAACCGCAAG GAACAAACCG TAGAGGTTAA CTTTATTGCC
GATACTGCCA CGGCAGAAAT TACGGCTAAT AACTTTACGG TAGAAGTCGA TGGTCAAGTC
GCTGGGAGCG GAACTAACCA AGTACAGGCC CTTGTTGTGG ATAAAAAGGG AAATCCTGTT
GCTAATATGA CCGTGAATTT TACCGCCACT AACGGCGTGG TCGTAGAGAC AACCTCAGCC
AAAACAGATG AGAATGGTAA AGTAACGACT AACCTCTCTA TGACCAATGT TGGTGGGACT
ATCAGTACGG TGACGGCAAC GATGATCAAT TCAGCGAACG TGACCAGTAC ACAAGATAAA
CCCGTCATCT TCTATCCAGA TTTCACTAAA GCCACGTTGA ATACGCCAGC GAATACTTAT
AGTGGCTTTA ATATCAACAG TGGTTTCCCA ACAACAGGAT TTAAAAATAC TCACTTCCAA
TTATCGCCAC ATGGTATTAC CGGCGCTAAC AGTGACTATG ATTGGGTAAG TAGTCATCCT
AACGTGAGTG TCAGTAACAC AGGTGCAATT ACGCTTCAGG ATAATCCTGG AGGGAAAGTG
ACCATTACGG CAACCTGGAA ACATGACAGC AGCAAAGTGT TCACTTATGA CTTTACGCTA
AATTATTGGG TAGGCCTCTA TAGCTCGACT AATCTGAGCT GGGCGCAGGC CAATGCGTCA
TGTATCAACG CCGGAATGAG ATTACCGACC AATAGAGAAG TATCGGCGGG TCAAGATGTT
CGTGGTGTAG GTTCGTTATT GAGCCTGTCC ATAATTCTGT GTAACTGCCA CCGTATTAAA
GGTGATCGCT CAGGCGGTCA CCGAACTCGA TAA
 
Protein sequence
MSLYRISSLH QAKQLNKNKQ LNKTRISKSV VWANIVIQAI FPLSIAFTPA VMAAETVGAS 
DEKPRSASQA EQSTANAATR LASILTNDDS AKQASSIARG TAANAGNEAL QKWFNQFGSA
KVQLNLDEKL SLKGSQLDVL LPLTDSPDLL TFTQLGGRYI DDRVTLNVGL GQRHFFAQQM
LGYNLFIDHD ASYSHTRIGV GAEYGRDFIN LAANGYFGVS GWKNSPDLDK YDEKVANGFD
LRSEAYLPTL PQLGGKLIYE QYFGDEVGLF GVDNRQKNPL AVTLGVNYTP IPLFTVGVDH
KMGRAGMNDT RFNLGFNYAF GTPLTHQLDS DAVAIKRSLM GSRYNLVDRN NQIVMKYRKQ
NRVTLELPAR VSGAARQTMP LVANATAQQG IDRLEWEASA LTLAGGKITG SGNNWQITLP
SYLSGGEGNN TYRISAIAYD TLGNASPVAY SDLVVDSHGV NTNASGLTAA PEILPANASA
SSVIEFNIKD NANQPITGIA DELAFSLELV ELPEELAKAK ARSVPLKTVS HTLTKITESA
PGIYQATLTS GSKPQLINIT AQINGVPLAD VQTKVTLIAD ESTATLQTSS LQIITNGSLA
DDTDANQIRA VVVDAYGNKL SGVQVNFTVG NNAKITETTL SDKQGGVTAA ITSTKAGTYT
VTAELNGVTQ QIDVNFIPDA GTATLDDSDE YKLQWVTNGQ VADGESTNSV QLTVVDKFGN
TVPGVDVAFT TDIGAIISEV TPTDANGVAT AKIISSQAKS HTVKATLNRK EQTVEVNFIA
DTATAEITAN NFTVEVDGQV AGSGTNQVQA LVVDKKGNPV ANMTVNFTAT NGVVVETTSA
KTDENGKVTT NLSMTNVGGT ISTVTATMIN SANVTSTQDK PVIFYPDFTK ATLNTPANTY
SGFNINSGFP TTGFKNTHFQ LSPHGITGAN SDYDWVSSHP NVSVSNTGAI TLQDNPGGKV
TITATWKHDS SKVFTYDFTL NYWVGLYSST NLSWAQANAS CINAGMRLPT NREVSAGQDV
RGVGSLLSLS IILCNCHRIK GDRSGGHRTR