Gene YpAngola_A1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1074 
Symbol 
ID5799537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1102297 
End bp1104582 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content55% 
IMG OID641339058 
Productputative autotransporter protein 
Protein accessionYP_001605630 
Protein GI162419459 
COG category 
COG ID 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.633584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAACC ATAAGATCTG GCGTTTATCC GCCGTTGCTG TTGCTTTACT GATTAGCGGT 
AATAGCTATG CCGATCAGAC CCTCCACTTC GCTCAGCAAC CGCATAACAC GGGCGATCAT
ACTGTGATCG CCGCACCAGA GGTCAATAAC TTTTCATTGG CAGATCTGAC CGCCAACGCA
GAAGGGCAAG CAGTGGATGC GACCAACGCA GGCGCACTGT CGCTGCGGGT TGAACCCACT
CAAGGTGACG CTCAGGCGAA TGGCGTACTG GTCACCTCCG GCACCTTTAC TAATCAGGCT
CAGGGCTCGA TCACCGTGAG TGCCAGCAGT CAGGATCAAC ATGCTGTTGC CAGCGGGTTG
CAGGCAACGG ATGCCTATTG GTCCACTCTG CGTAATCAGG GTAGCATCCG TGCCACGAGC
AACAGCCTAC ACGGCAGTGC AGAGGTCTAT GGTATCCAGG CTGGCCTGGC CTTTTCCGAT
GCCGCCGGGG ATCACTACGG TGACCACATG ATGTTGAGCA ACGAGGGTGT GATGCAGGTG
AGTGCCACGG CCGAGCAGGA CGCCTTTGCC AGCGGTATCT ATACCAATGC CGGTAATGGC
GTGGTCAACA ATGGCCGTCT GGATGTCAGT GCTCATGCCA GTGCTGGCGA TGCACGGGCA
ACCGGCCTGC ATACCCTCTC TGCCAACCCA GGCGTCGATC CCGATCCATG GAACCCCGCT
AACCCCACTC ATCTGATGAG CAACAATGGC TCGTTGTCAG TCACGGCCAG CGGTAACAAT
GCGACCGCAA GCGCGATCAG TACCGAGGGC GAAGGGGTCT ACATCTACAA CAACGGGACA
CTGATCGTTG ATGCATTTAG CGAGAGTGCG CAGGGCGGTG CATCCGCCTA TGGTATTCAT
GTGCTCAACG GCAGTGCCAC GATCAACTCC AGCGGCAGAA TTTTGGCGAC AGCCACCGGC
GGAAGCCAAC AGCAAGCCTA TGAAGTGATG GCCGACGGTT CGGTAGTTAA TATTGAACGC
TATACACTCG CCTTGGGCAA TGAGACACCG TGGGCGGTCA GTAATGGCGG TTCGATCGTG
CTGGGTAACG GTACGCAAGG GGCCGATCTG GTGCTGATGG CTGGGCAGGC CAGTGAAGGT
TTCGCCTATG GTAAAGAGTA CAGCTTGGAC AATCTGGCTT ACGATACCAC CTCCGGCCAG
CAGAGTACGG TGGGGGGTTC TATTGCGGGG TTGTACGGTA TTACGCCTGA TATCAAGGTG
ATTCACAGCG GTAGCAGCAG TAGTTCGTTG GGATCTGCTG CACTGGTGTA TGCACCGGAT
GTCTCTCATG CCGGAGTCTC CGCATTGGCA CAGCGTTCTT CCATGGAGCA GGCATCCAAT
ATCATCTCCT CGCAACTGCA TAGTCAGCTG GTAAGTAACG CCGGTTGTAA CCAGAAGTTG
GAGAGTGCCG ATAGCTGCGT CTTTATCACA CCTTATGCTG GTGAGTACCG TCGTGATCCA
GTGACCTCCG GTTACAGCGG TCAGCGCTAT GGGGTATTGC TCGGTCAGAA CCAACATTTT
GGCGATTTCC AGCTTGGCTG GCACGGCGGT TATGAGAGTG CCAGCACCGA TTTTAATGGT
ACCTCTGTAG GGCGTAAGGA GGAGATTAAT ACGCTGATGC TGGGGGTTCA GGGCGGAATG
AAGCTCTCTG AGAGTCTGTT TATTGCAGCG ACCAGTACGT GGTTCAGCAG TGACACCGAT
TACAGTGACA GCAATACCTA CTACGGCGTG GGCAGCCAGA GCGGCAGCTA TGATAGCAGT
GGCCTCTACA CCGATGTCTC CGTGGGTAAT AGCTGGCAGT TGAATCGCAA CTATGCCATG
ACACCGATGG TCGGGCTGAC GCATATCTGG CAGCAACGCG ATGGCTATAC TGTCTCATCC
AATAACAAAA ACTATGATCT TATCGACACC CGCTATAGCA GCTACAGCGA CCATGCAGTC
GCTCTCCATG CTGGAATACG TCTCGATGGT CGCTATCCAC TGACCAACGA GACACTGCTT
AAGCCGTTCT TTAATGTCGG ATTCCAGCAA ATGTTGTATG GCGATGAGAT CACCATCGAT
CAGAGTATCC CTAATTCGCC GGTAGTTGGC GTAAGCACCA AGGATAAAAC CACTCAAGGA
ACCTTCGATC TGGGGATGGC ACTGGTAAGC GATAACGGTG TGTCTGCCAG CCTACAGCTA
TCGGGCATGG TCAATAGTGA TCGTCAGGAT TTCACCGGCT GGGCAAATCT GGGCTGGGCG
TTCTAG
 
Protein sequence
MNNHKIWRLS AVAVALLISG NSYADQTLHF AQQPHNTGDH TVIAAPEVNN FSLADLTANA 
EGQAVDATNA GALSLRVEPT QGDAQANGVL VTSGTFTNQA QGSITVSASS QDQHAVASGL
QATDAYWSTL RNQGSIRATS NSLHGSAEVY GIQAGLAFSD AAGDHYGDHM MLSNEGVMQV
SATAEQDAFA SGIYTNAGNG VVNNGRLDVS AHASAGDARA TGLHTLSANP GVDPDPWNPA
NPTHLMSNNG SLSVTASGNN ATASAISTEG EGVYIYNNGT LIVDAFSESA QGGASAYGIH
VLNGSATINS SGRILATATG GSQQQAYEVM ADGSVVNIER YTLALGNETP WAVSNGGSIV
LGNGTQGADL VLMAGQASEG FAYGKEYSLD NLAYDTTSGQ QSTVGGSIAG LYGITPDIKV
IHSGSSSSSL GSAALVYAPD VSHAGVSALA QRSSMEQASN IISSQLHSQL VSNAGCNQKL
ESADSCVFIT PYAGEYRRDP VTSGYSGQRY GVLLGQNQHF GDFQLGWHGG YESASTDFNG
TSVGRKEEIN TLMLGVQGGM KLSESLFIAA TSTWFSSDTD YSDSNTYYGV GSQSGSYDSS
GLYTDVSVGN SWQLNRNYAM TPMVGLTHIW QQRDGYTVSS NNKNYDLIDT RYSSYSDHAV
ALHAGIRLDG RYPLTNETLL KPFFNVGFQQ MLYGDEITID QSIPNSPVVG VSTKDKTTQG
TFDLGMALVS DNGVSASLQL SGMVNSDRQD FTGWANLGWA F