Gene YpAngola_A2107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2107 
Symbol 
ID5800577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2199113 
End bp2200390 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content48% 
IMG OID641340017 
Producthypothetical protein 
Protein accessionYP_001606562 
Protein GI162421600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0010339 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGT TATCTTCACT GTGCCTGCTC GGTTTTACCT TACCCAGTCA GGCACAATGT 
GTCTGGAAAG GGAGTGATAT TGGCGGGGAT AACTACGGTG CCTCACTGTA TTTAGGCAAT
ATCAATATCA CCAGTAATTA TATTCAACCC GTCGATTCAA TTTTGGCCTC TAACGTCATC
AGCTTAGTGC CCGCCCATCG GTGGCCAGAT CCAGAAGCCG TTATCTATGA ATGTGACATC
GCCGATAAAG ACAGCTTATT TGAAGTGTTT GCTACCAATG GTGACAGCAA TGTTGGCGGC
TACACCCATA TGGGTGATAA CTATTTTCAA ACGTTTTTCC CTTATACCGC GCTGAAACTG
ATTCATGTCG ATTCAGGTGT TGAATTCACC CGTATCTGGC AAGAAATCCC ATTAAAAAAG
TACGATATCC TTGACAATAA AATCCAGATA AAAGGTAAAC ACTTCAGCCA GATCCGCGCA
GAGTTGAAAA AAACAGGCTC AGTAGATCGC TCACCCGGCC CGACCAGTTG GGGTTGCCCT
GGACCGGCGG AGGACAACTA CTCAGGCGGC TATACCTGTA ATCAACCTAA TGGTTATGTG
GTATTCAAAG GACCCGGCAT GGCGGTGCCA GAAGCAGGAT ATGATTCAGC GACGAATTAC
CAAACTTGGG GAACGGGCCG CTATATGGCA TTTGGCATGA ACACATCCCC CATCACTATC
CTGACGCGTA AAAATACCTG TGTAGTACGC AATGTGACGC CTTATGTAGT TTTCCCGATC
ATTACAGTGA ATGAACTTAA TGATAACCAA ACACGTAGTG CTGAAATCAC AGTAGATATC
GAGTGTCAGT CAGGTACCGA ATCAGGGACA TCCAGTGGCC AAACCGCCAT TGGGATCCAA
ACCTCATTAC CGGGCTACCT TAAAGCGCTG GGATTAGGAT TGGTGAATAC GGCAGGAGGA
GTGAGTTACT TGCTCTCAGA CAGTTACGGT ACTGATAGCC GTATCGCCAC TGGCGTGGGT
ATCAGCCTAA GCGATAGCAA TGGCAGTACC ATGAATTTTG TTGGTTGGGG AGGATGTGCA
CAGACTCAGG ACTGTCTAAC TACCGCCGAT GCGGGCTGGT ACCCGATACT CACTGGGGCC
AGTGGTAATG GCAGCCACTC TGCAGGTTAC AACAATTATG TCCATCACTT CACCGCCACC
TTAAAAAAAC TGCCTAATGG TCACCCTACT GCGGGGAAGA TCGACGCCAC AGCTTACGTT
CTGGTGAAAA TACAATGA
 
Protein sequence
MSVLSSLCLL GFTLPSQAQC VWKGSDIGGD NYGASLYLGN INITSNYIQP VDSILASNVI 
SLVPAHRWPD PEAVIYECDI ADKDSLFEVF ATNGDSNVGG YTHMGDNYFQ TFFPYTALKL
IHVDSGVEFT RIWQEIPLKK YDILDNKIQI KGKHFSQIRA ELKKTGSVDR SPGPTSWGCP
GPAEDNYSGG YTCNQPNGYV VFKGPGMAVP EAGYDSATNY QTWGTGRYMA FGMNTSPITI
LTRKNTCVVR NVTPYVVFPI ITVNELNDNQ TRSAEITVDI ECQSGTESGT SSGQTAIGIQ
TSLPGYLKAL GLGLVNTAGG VSYLLSDSYG TDSRIATGVG ISLSDSNGST MNFVGWGGCA
QTQDCLTTAD AGWYPILTGA SGNGSHSAGY NNYVHHFTAT LKKLPNGHPT AGKIDATAYV
LVKIQ