Gene YpsIP31758_0499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_0499 
Symbol 
ID5386714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp575710 
End bp578724 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content45% 
IMG OID640863470 
Productputative autotransporter protein 
Protein accessionYP_001399492 
Protein GI153948648 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3468] Type V secretory pathway, adhesin AidA 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATT CAAATAGATC ACCCAAAAAC ATATCAAGAA ATTTTAAACA TCATAAAGAA 
ACCTCTCTTT CAAGTAAAAA TAAAAGTCCT ATCGCAACAT GTGTCGCGGC GGCTCTCTTT
ATATTTGGCA GTTCATCAGT TATCGCGAAC CCGGACCATG AAGGTATCGT TGTGGGGAAA
TCAATCCTCA ATAAAAAGCA GAGTGCGGTA AATGCCATAA TAAATGAGGG TAACAGTTTA
GTACTCACTG ACAGTGCCAG CGCAGAGCAT ACGGCCGTTA ATACTGGCAG TATTTTTACG
TTAAAAGAGG ATAGTACAGC CGATATAACC TCGGTGACGG GGGGCTTTTT TAGTTTATCT
GGTAGCAGTA AAGCCAATAT AAATACGGTT CTTTCTGGAG GGTGGCTAGA GGTCAATGAC
GATGCAAGCA TCACCGAGAC AACGATCAGC TCAGATATCG AAAAAAAAAG TACTGTGCGC
TTATACCAAG ATGGAAGTGC CACAAAAACC ACTGTCGGTG ATAATGGTAT TCTCTATGTT
TCAGGTGATA GCCGTGCAGA AGAGACTCAT GTAACCAAAG GAGGAAAATT GATTGTTTAC
AGTGAGTCTC AGGGCCCGAC ACTGAAAAAC ACTCAAATCG CGGGAACATT AACCTTAAAA
AGTGATGTCA CACTGGAAGG TAAAACGGAA TTTGTCTCTA GCGCAACCAT AAAAACAACC
GGTCATTTAA TTGATAACCA GGGTCAACTT ATTTTCAATA GTGATAAAGA TATCGTTATT
GAAGCCATGA TCGATGGCCA AGGTTCATTA ACCAAAGAAA ATCCATTAAC CACATTAACG
CTATCTAGCG CGGGTGACGC TTGGGTTGCC AGTTATGTCT ATTCCGGTGA AACACATATC
AATGCAGGCA ATCTAAAACT CGCCAACACC CATTTTTTTG GTAGCCCGAT TAGTGGCAAT
CCAAATACAC GCCTTATTTT AGAAAAAAGC ACACTCGACA CCACCGTACA GGGCAGTAGT
GTATTTATCG ATAAACATAG CATATGGAAT ATGCAGGGTG ATTCTAACAT CCATCATTTA
GATATATTGG ATTCAGGGAG ACATGATTTA AATAACCCTG GAAAAACGGG CAATCAATTA
ATCATTAATG GCGATTACTT CAGTGATAAC GGCACGTTGA TTTTTCATAG TCAGTTAGCG
GGAGATGACT CCGTCACTGA CCACATACTG ATTAAAGGCA ATACGGGGGG CCATACCAAC
GTCCGAGTCA TTAATGTTAA CGGTGAAGGG AATAAAACAG ATTCAGGTAT CCAATTGATC
GAGGTGAGAG GGATATCGGA TGGGGAGTTT AGCCAGGTTG GCCGTATTAC CGCAGGGGCT
TATGAATACC GTTTAGGCCG TGGTAAGGAC GAACTCAGCA AAAACTGGTA TTTGAGCAGT
GATATTACGG ATTACTCTTC AGACGGTGTT CCCGAAGCAG AACTTCCAGG GATTTTGGTT
CTAAAATCAG ATAATGCCGC AGTCTTTTCA GCTAAATTAG CTGATTACGC GTTGCAAGCC
GTTGGCGGCG TGGTCGATTC CTTCACTCAA CCCGAGACTT CCCCCCCAAG TAACACCCCC
GAATTAACCG TTCCAGACCC AAAGCGCGCC AATGCCGCAG CCTTTTCAGC CAAATTAGCT
GATTACGCGT TGCAAACCGT TGGCGGCGTG GTCGATTCCT TCACTCAACC TGAGACTTCC
CCCCCAAGTA ACACCCCCGA ATTAACCGTT CCAGACCCAA AGCGCGCCAA TGCCGAAGCC
TTTTCAGCCA AATTAGCTGA TTACGCGTTG CAAGCTGTTG GCGGCGTGGT CGACACTTTC
ACCTCACCTG AGCAAACTAC TGCAACCCCA GACAGCACAC CTAAACCAAC CGATTTAGCA
GTGAATCCAG CAAGCACCCC CAAATCAACC GATTCAGCGG TGAAACCAGC AAGCACACCT
AAACCAACCG ATTTAGCAGT GAAACCCGTA AACACCGTGA TCAGCAGTGC CAAAACAGCC
GCACCCAAAC GCCAGACTTT AGTTCATACC CCCGAAAACG GGAGCTACAT TGCCAATATT
GCAATGGCAA GAAACCTGTT TACCACTCGT TTAGAAGATC GCACCGGCCA TTATCTCTAT
AAAGATATCG TTACTGGGCA ATGGCAGCCT ACCAGCATGT GGATGCATAC CCAAGGCGGC
AGAAGCCAGT TTGGTCATAC TGTTGAGCAA TTGAATATCA AAGGGAACTA CTACTCTGTT
CAACTTGGTG GGGATATTGC TCAATGGGCA ACCAATGAGC AAGACAGTGG GCGAATTGGC
GTGCTGGCAG GGTTAGGCAA AGCCACTAAC CACAGCCACT CTAAGGTGAC GAGCTATCAT
TCACGCGGTT CCGTTGATGG TTACAACCTT GGCATATACG CAACTTGGTT CGCCGATCAG
CAACATAATA CCGGCGTCTA TATTGACACT CTGGCGCAGT ATAGCTGGTT TAATAACGCC
GTTAATGGGC AAGATAAAGC AGAAGAAAAG TATAAATCAT CAGGTTTCAC TACCTCTATC
GAGAGCGGCT ATACCTTTAA TTTAGCGAAT AGCGATCAAC TGAGCTATTT CATCCAACCC
AACGCACAAA TTACTTGGGC AGGCATCAAC GCACAAACAC ATAAAACAGC TGACGGAGCG
GTGGTCAGTT ACCGCAATAA CGGGCATTTC ATCACTCGTA TCGGGGCTAA AGCGTATCTG
CAAACGCATG ATTCCCTGAA TACAAAATTC ACCCCTTTTG TTGCCGTAAA CTGGATTCAC
CAAAATCAAA ATACCGGCAC CATAATTTCA GGCCAAGGGA TTGATAATAA GATCCAAAAC
AGCACTGAAT TCAACGTCGG CGTTGAAAGC CAAATTGACC AACAACTGCA TATTTGGGCG
AATATAAATC ATCAAATGGG GCGTTATAAT TATACCGATA CCAACGCACT GGTTGGCGTG
AAATATCACT TCTAA
 
Protein sequence
MKNSNRSPKN ISRNFKHHKE TSLSSKNKSP IATCVAAALF IFGSSSVIAN PDHEGIVVGK 
SILNKKQSAV NAIINEGNSL VLTDSASAEH TAVNTGSIFT LKEDSTADIT SVTGGFFSLS
GSSKANINTV LSGGWLEVND DASITETTIS SDIEKKSTVR LYQDGSATKT TVGDNGILYV
SGDSRAEETH VTKGGKLIVY SESQGPTLKN TQIAGTLTLK SDVTLEGKTE FVSSATIKTT
GHLIDNQGQL IFNSDKDIVI EAMIDGQGSL TKENPLTTLT LSSAGDAWVA SYVYSGETHI
NAGNLKLANT HFFGSPISGN PNTRLILEKS TLDTTVQGSS VFIDKHSIWN MQGDSNIHHL
DILDSGRHDL NNPGKTGNQL IINGDYFSDN GTLIFHSQLA GDDSVTDHIL IKGNTGGHTN
VRVINVNGEG NKTDSGIQLI EVRGISDGEF SQVGRITAGA YEYRLGRGKD ELSKNWYLSS
DITDYSSDGV PEAELPGILV LKSDNAAVFS AKLADYALQA VGGVVDSFTQ PETSPPSNTP
ELTVPDPKRA NAAAFSAKLA DYALQTVGGV VDSFTQPETS PPSNTPELTV PDPKRANAEA
FSAKLADYAL QAVGGVVDTF TSPEQTTATP DSTPKPTDLA VNPASTPKST DSAVKPASTP
KPTDLAVKPV NTVISSAKTA APKRQTLVHT PENGSYIANI AMARNLFTTR LEDRTGHYLY
KDIVTGQWQP TSMWMHTQGG RSQFGHTVEQ LNIKGNYYSV QLGGDIAQWA TNEQDSGRIG
VLAGLGKATN HSHSKVTSYH SRGSVDGYNL GIYATWFADQ QHNTGVYIDT LAQYSWFNNA
VNGQDKAEEK YKSSGFTTSI ESGYTFNLAN SDQLSYFIQP NAQITWAGIN AQTHKTADGA
VVSYRNNGHF ITRIGAKAYL QTHDSLNTKF TPFVAVNWIH QNQNTGTIIS GQGIDNKIQN
STEFNVGVES QIDQQLHIWA NINHQMGRYN YTDTNALVGV KYHF