Gene YpAngola_A3513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3513 
Symbol 
ID5801989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3729458 
End bp3733264 
Gene Length3807 bp 
Protein Length1268 aa 
Translation table11 
GC content53% 
IMG OID641341330 
Productouter membrane autotransporter 
Protein accessionYP_001607843 
Protein GI162418506 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.273383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.696988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAAC ATTTCAGACT GGGTCTTTTT ACCGCAAATG TTCAGGCGGA TACAGGGAGC 
CAACACTCCT TTAGCCGTGT TCCAGCAGCC GCGATTAATC CCATAACATT AGCGATTGTT
GTGGCGTTTT CTACTATTGC TTTACCACCA ATGGCATTAG CAGCATGCAC CAGCCCCGGG
GTGGGAACCT ATGTCTGCGA AGGTGAAAAT ACCGACGGCA TCATCCTGAG CGGCACGGAT
ATTGCGGTTG AGACCCAACC TGGTTTCAGC ATAATAGTTC CTGAAAATGG AGACCCTGCA
CTCTCGCTAG TCGGTTCTGG CACCATCAGC TACCTTGATA CCAACAGTTC TGCGCTTGAT
ACCACGGGTG CCGATTCTTT GTATATCCAG AATGATACGC ACACCACTGG GCAATCGTCG
TCCATTAATG TTCAGACAAA CGGTTCAATG GGTAGCGGTA TCAATATTAG CAACCATAGC
GGTGCCGACT CGACAGTACA GGTGGATGTT TCCGGCACAT TATTTGGTGA TCTAAATGGT
TCTCCCGCCC TTTTAATCTA TTCATCGGCG GAGAATAATT CCACTATTCT CATGAATATC
AATACTATTT TCGGCGACGT GGGTATTCGA AGCGATAACA CTTCATTCAA TGGAACAGCC
ATCACTAACG TTGATATTGC TAATGATATT AATGCTACAT ATTCGGGTGC CACTATCAAC
AACAGCGGTA GCGGCGGTAC CAGTGTGGTT AATTTCAACT CAAAAAGCAT CATTACAGAA
TTGGACGCCT TAAATATTTA TAATACCAAT TATTCAGGGG CGGTTATAAC CAATGTCGAT
ATCGATGGTG ATGTCATTTC AACGACTAGC CAGGCGACCT CTTTTTATAA CGATGCCTAC
AACGGCTCTG CAAACTTTAC GTTTCGTGCG AACAATGTCA CTGGTGAATA TTCCGGCATT
TCTATCAATA ATAGCAGTCA TAACAGTGCT AATAACAACC ATGATAATGC GGTGATAACC
GATATCCTGT TGACGGGGGA TCTTACGTCT ACTTCAGGAT CGGGCCTCCA GATAAACTCT
TATATGGACG ATGGGGATAT CAAGGTATCA GCCCAGTTGG AGAACATCTA TTCTTACTAT
GAAGCCCTCA GTGTGCGTGC GGATACCCTA ACTGGCAACC TGCAGTTTAA CATCGATGTT
TCGGGCAATA TTGTTGCTGA AAATGGCCTC GGTTTCATGG TGATGGGGGG CGCTTCCGAA
GGCAACTCCA CTATGATCAT CAATGCGAAT AACATTAGTT CTGGTAGTCA AGCACTCAAT
ATTTACAACT ATAGCGGTTT AGGTTCGGCA TTTACTGCCG TTACCGCAAC CGGGCATCTC
GTCTCTGAAC AGGGGATAGG GGCAATGTTC AGTACCTATG TCAGTCAAGG GGATGCTACC
GCTGTCATCA ATTTAAATGA CATCACTACT GCGGGCAGTG GTGTAAAGAT AGACACCATA
GCGAATGGGG GGAATTCAAC CACCTATCTC ACCGTGGTGG GCCAGATTAA CTCTAGTTTG
TATGATGGTA TTGATCTTCG TGCGACGGCT ACTGAAGGTA ATACCCTGGT CAGCATTGAC
GTCAATAATA TCGCCAGTGA ATACGATGCT ATCCACCTCG ATAACAAGAA CTACACCACA
GGCGCAGACA ACGGCACCTC GACCATTGAT CTGATCACCC GGGGCGCGCT GGTTTCGCAG
CAGGGCTACG GAATTAATAT TGAAACCAAT ACCGCAGACA CCTATGTCAC CGTGGGCGGC
TTGGTGCACG GCGGCAATGG CACCGCAATC GGCATTCATC GGCTTGATAA CATTCAAACA
TCGGCCACGT TAGAGCTGCA ATCTGGCTAT GCCCTTGAAG GCGTTACGCA GGCACTGGTC
TTCAATGGCA GTTATGCGGA GATCAATGAT GCCGCGCTGG ATCTGGCAAA CAGCCATCTG
GTGCTGGGCG GAGCAGGAGA CGCCGCTTTC GATCTCACGC GTATTGATAA CCGTGAAGAG
GCCATTCTGG ATGGCGACCC GAACCGGATC ACCGGCTTCG GTACCCTGAC CAAAACCAAC
AACAGCATCT GGACGTTAAC CGGCGCCAAT ATGGCCGACG GTGACGCCAA TGCCTTCCTG
TCGGCCAATA TCGCCGGGGG GATTTTGGTG CTGGATAACG CCACGCTGGG CCTGACACCT
GACGCTGGGG CACTCACTGG AGCCACAGTA AACCGCCTCA GTGCTGCCGA TATCGCCGCT
GACCCGACGC TAGTGGCTAC CGAAACCGGT GCATTAACCC TGGCTGAAGG CGGGGCGTTG
TCCTCGCTCG GTGACTCGGT TCTGAGCGGT AACCTTATCA GCGCCGGTGG GATCCTGCTG
TCAAACCACT ATACCGGCGG CAATGGTGCC GCTACCGACG ATCGGCTGAC CGTGACCGGG
ACTTATTTTG GTGAAAATAA CGGTTCCGGT GAAGGGGCCT GGCTGGCACT CGATACGGTG
CTGGGGGATG ACGATTCTGC CACCGACCGG TTAGTGATCA ACGGCGATGC CACCGGCACC
ACCTCGGTCC GGGTGAACAA TGCGGGCGGT CTGGGCGATA AAACCCTCAA TGGCATCAAC
CTGATCACCG TGGACGGTCT GGCGCAGGAT GACACCTTCC TACTGGCCGG GGACTATGTC
ACCACGGATG GCTATCAGGC GGTGGTGGGC GGGGCGTATG CCTACACCTT ACAGGCCGAC
GGGGAAGCCG CCACTGCGGG GCGCAACTGG TATCTCTCTT CAGAACTGAT GTTAACCGAG
GGGGTACGCT ATCAGGCGGG CGTGCCGCTG TATGAACAAT ATCCGCAGGT GCTGGCCGCC
CTGAATACCC TGCCGACGCT GCAACAGCGT GTCGGTAACC GTTACGGGGC GCCGGGCGCG
CTGGCAGACC TGGACTTTGA CAATAATCAA TGGGCCTGGG GCCGTATTGA AGGGAGCCAC
CAGGTCACCG ACCCGGCCCG CTCCACCAGT GGTTCACAAC GCGAGATTGA TGTGTGGAAG
TTGCAGACCG GCATTGATGT GCCGCTGTAT CAGAGCCAGG ACGGTTCACT GCTGACCGGC
GGGGTAAACT TCTCCTACGG TAAAGCCAAA GCGGATATCC ACTCATTCTT TGGTGATGGC
CGCATCAACA GCGCAGGTTA CGGTCTTGGC ACCAGCCTGA CCTGGTATGG CAATAACGGC
GTGTATGTGG ATGGCCAGTT GCAGACGATG TGGTTTGACA GCGACCTGAG CTCCCGTACC
GCAGGGCATG CAGTGGCCAG CGGTAACAAT GGTCGCGGGT ATACCTCGGC GATAGAAGCC
GGTAAAGGTT ACGCACTGGG TAACGGGTTG TCACTGACCC CGCAGATGCA GGTGACCTAC
TCGCGGGTCG ATTTCGATAC CTTCCGCGAT CCGTTTGATA GCGAAGTTTC GCTGCAAGAG
GGTGACAGCC TGCGGGGCCG CCTCGGTGTC TCACTGGATA AGGAAACGAC CTGGAGCGCG
AAAGACGGCA CCACCCGCCG CTCACACATT TACAGTCATC TCGATCTGCA CAATGAGTTC
CTCAATGGCA GTAAAGTGCA GGTCTCGGGG GTGGAGTTCG CCACCCGGGA TGAGCGTCAG
TCGGTGGGGT TAGGCGCGGG CGGCACCTAT GAATGGCAGA ATGGTCGCTA CGCGGTTTAC
GGCAATGTCA ATCTGCTGGG GGCTACACGG AATGTCAGTG ACAACTATGC CGTCGGCGGT
ACGATAGGTG CACGGGTGAG CTGGTAA
 
Protein sequence
MLKHFRLGLF TANVQADTGS QHSFSRVPAA AINPITLAIV VAFSTIALPP MALAACTSPG 
VGTYVCEGEN TDGIILSGTD IAVETQPGFS IIVPENGDPA LSLVGSGTIS YLDTNSSALD
TTGADSLYIQ NDTHTTGQSS SINVQTNGSM GSGINISNHS GADSTVQVDV SGTLFGDLNG
SPALLIYSSA ENNSTILMNI NTIFGDVGIR SDNTSFNGTA ITNVDIANDI NATYSGATIN
NSGSGGTSVV NFNSKSIITE LDALNIYNTN YSGAVITNVD IDGDVISTTS QATSFYNDAY
NGSANFTFRA NNVTGEYSGI SINNSSHNSA NNNHDNAVIT DILLTGDLTS TSGSGLQINS
YMDDGDIKVS AQLENIYSYY EALSVRADTL TGNLQFNIDV SGNIVAENGL GFMVMGGASE
GNSTMIINAN NISSGSQALN IYNYSGLGSA FTAVTATGHL VSEQGIGAMF STYVSQGDAT
AVINLNDITT AGSGVKIDTI ANGGNSTTYL TVVGQINSSL YDGIDLRATA TEGNTLVSID
VNNIASEYDA IHLDNKNYTT GADNGTSTID LITRGALVSQ QGYGINIETN TADTYVTVGG
LVHGGNGTAI GIHRLDNIQT SATLELQSGY ALEGVTQALV FNGSYAEIND AALDLANSHL
VLGGAGDAAF DLTRIDNREE AILDGDPNRI TGFGTLTKTN NSIWTLTGAN MADGDANAFL
SANIAGGILV LDNATLGLTP DAGALTGATV NRLSAADIAA DPTLVATETG ALTLAEGGAL
SSLGDSVLSG NLISAGGILL SNHYTGGNGA ATDDRLTVTG TYFGENNGSG EGAWLALDTV
LGDDDSATDR LVINGDATGT TSVRVNNAGG LGDKTLNGIN LITVDGLAQD DTFLLAGDYV
TTDGYQAVVG GAYAYTLQAD GEAATAGRNW YLSSELMLTE GVRYQAGVPL YEQYPQVLAA
LNTLPTLQQR VGNRYGAPGA LADLDFDNNQ WAWGRIEGSH QVTDPARSTS GSQREIDVWK
LQTGIDVPLY QSQDGSLLTG GVNFSYGKAK ADIHSFFGDG RINSAGYGLG TSLTWYGNNG
VYVDGQLQTM WFDSDLSSRT AGHAVASGNN GRGYTSAIEA GKGYALGNGL SLTPQMQVTY
SRVDFDTFRD PFDSEVSLQE GDSLRGRLGV SLDKETTWSA KDGTTRRSHI YSHLDLHNEF
LNGSKVQVSG VEFATRDERQ SVGLGAGGTY EWQNGRYAVY GNVNLLGATR NVSDNYAVGG
TIGARVSW