Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B3908 |
Symbol | |
ID | 6795367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 3800753 |
End bp | 3805171 |
Gene Length | 4419 bp |
Protein Length | 1472 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642778028 |
Product | putative autotransporter |
Protein accession | YP_002148623 |
Protein GI | 197248710 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport [W] Extracellular structures |
COG ID | [COG5295] Autotransporter adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGAA TATTTAGAGT CCTCTGGAAT GCCGCTACGG GAACATTTGT TGTCACCAGC GAAACCGCAA AAAGCCGCGG CAAAAAAAGC GGCCGCAGAA AGCTGGCGGT TTCCGCACTC ATCAGTCTTA GCAGCATTAT GGTTTCTGCG GACGCACTGG CCAATGCTGG GAACGATACA GGGAAAGGGA TCGGCACTGA GAATGGATGG ATAGCCATTG GCGAAGGCGC CGAAGCTGAT TCAACAATAA CAACTAAAGA AAAGGTCGCA GAAGATCCGT CCAAAGGATA TGTAGCGGGT GCCGGTATTG CAATAGGTTA CTATAGTAAG GCATCTGGGG CTTCCAGCAC TGCGTTGGGC GGATATAGCC TTGCTGAGAG TATAGGCTCA GTGGCGCTAG GGGCAGGTGC ACATACCACT TCTACGGCTA AATACTCTAT GGCAGTAGGT ACAAACGCAG TTGCCTCAAA TCTTTACGCC ATCGCACTGG GAACAAATGC AAATGCCTCA GGGAAAAATT CAATAGCGCT TGGACGCGAT ACCGTATCAT CAGCAGAAAG AAGCCTGACG GTAGGATTAG GGGCGAATGC CGCATCTCTG GATAGCATGG CTTTCGGTTA CAATACATCC GTAACGTCAG ATGCGGCAAA TGGCATTGCC TTCGGTAGTG GGGCGGTAAC CTCTGCAAAA AATTCCGTCG CGATAGGGAG CAATTCTACA GCGACCGAAG AAAACGTAGT TTCTGTAGGG AGTGATGCCC TAAAGCGCAA AATTGTCAAC GTTGGTAACG GAGCCATATC AGAAAGCAGC ACCGACGCCG TTAACGGCTC TCAGCTATTT GCAACCAACG CCAATGTGAC GCAGAACACC ACTGATATTG CTGCCAATAC CGACAGCATT AACCAAAACA CAACCGATAT CGCCACCAAC ACCACCAATA TCAACAGCCT GAGCGACTCC GTCACCACGC TCACCGACGA TGCCCTGTTG TGGGATGCAG CCTCTGGCGC ATTCAGCGCT AAGCACAACG GAAGCGACAG CAAGTTAACC AATCTGGCGG CGGGTACCCT GGCCGCAGAC AGCACCGACG CCGTTAACGG CTCTCAGTTG TTTGATACAA ATGAGAAAGT GGATAAGAAC ACTGCTGATA TCGCCACCAA TACCGACAGC ATCAACCAAA ACACTGCCGA TATTACCGCT AATACCGACA GCATTAACCA GAACACAACC GATATCGCCG CCAATACGAC CAGCATCAAT CAGAACACCA CTGATATTGC CACCAACACT ACCAATATCA ACAATCTGAG CGATTCCATC ACCGGCCTTA CCGATGATGC CCTGCTTTGG GATGCTGACA CTGGCGCATT CAGCGCTAAG CACAACGGAA GCGACAGTAA AATCACCAAT CTGGCGGCGG GTACCCTGGC CGCTGACAGC ACCGACGCCG TTAACGGCTC TCAGTTGTTT GCCACCAATG AAAATGTGTC TCAGAACACA ACCGATATCG CCGCCAATAC CGACAGCATT AACCAAAACA CCACTGATAT TGCCACCAAC ACTACCAATA TCAACAATCT GAGCGATTCC ATCACCGGCC TTACCGATGA TGCCCTGCTT TGGGATGCTG ACACTGGCGC ATTCAGCGCT AAGCACAACG GAAGCGACAG TAAAATCACC AATCTGGCGG CGGGTACCCT GGCCGCTGAC AGCACCGACG CCGTTAACGG CTCACAATTG TTTGCCACCA ATGAAAATGT GTCTCAGAAC ACCACTGATA TCGCTGCCAA TACAACCAGC ATCAATCAGA ACACCACTGA TATCGCCACC AACACCACCA GTATCAACAA CCTGAGCAAC TCAGTTACCA CGCTCACCGA CGATGCACTG CTATGGGATG CAGTCTCTGG CGCATTCAGC GCTAACCGCA ACGGAAGCGC CAGTAAAATC ATCAATGTCG CCGCCGGGGA CCTTTCTGAG GACAGTACAG ACGCGGTGAA CGGCTCCCAG TTGTATGAAA CCAACCAGAG GGTGGATCAA AACACCTCTG CTATCGCAGA TATTAATACG TCCATCACCA ATCTTAGCTC TGACAATCTG AGCTGGAATG AAACAACAAG TTCGTTCTCT GCCAGCCACG GAAGTAGTAC GACAAACAAA ATCACCAACG TTGCTGCCGG AGAGCTGTCT GAAGAAAGTA CCGACGCGGT TAACGGCTCG CAGCTGTTCG AAACCAATGA AAAAGTGGAT CAGAACACGA CCGATATCGC CGCCAATACC ACTAACATCA CTCAGAACAG CACCGCGATT GAGAACCTGA ATACTTCTGT CTCCGACATT AATACGTCCA TTACCGGCCT CACTGATAAC GCTCTGTTAT GGGACGAAGA CATCGGCGCT TTCAGCGCAA ACCACGGTGG AAGCACCAGT AAAATAACCA ATGTCGCCGC CGGGGCGCTT TCTGAGGACA GTACAGACGC GGTAAACGGC TCACAGTTGT ATGAAACCAA CCAGAAGGTG GATCAAAACA CCTCTGCTAT CGCGGATATC AATACGTCCA TCACCAATCT TGGTACCGAT GCACTGAGCT GGGATGACGA AGAGGGTGCG TTCAGCGCCA GCCACGGTAC CAGCGGTACC AATAAGATAA CCAATGTAGC AGCAGGTGAA ATCGCCAGTG ACAGTACTGA CGCTGTTAAT GGCTCTCAGC TCTATGAAAC TAACATGCTG ATTTCTCAGT ATAATGAATC TATTAGCCAA CTAGCCGGCG ATACCAGCGA AACCTATATC ACGGAAAATG GTACCGGCGT GAAATACATC CGTACGAATG ATAACGGGCT TGAAGGCCAG GATGCATACG CAACGGGTAA CGGCGCAACG GCAGTAGGTT ACGACGCCGT CGCCTCTGGC GCTGGCAGCC TGGCTCTTGG CCAAAACAGC AGCAGCACCA TTGATGGCAG TATCGCTTTG GGCAGTGGCT CCACCTCTAA CCGCGCTATT ACAACCGGTA TACGAGAAAC GAGCGTAACA AGCGATGGCG TCGTCATTGG CTACAATACA ACAGACAGAG AGTTGCTGGG CGCGTTGTCA TTGGGGACGG ATGGAGAAAG CTATCGTCAA ATTACCAACG TTGCTGACGG CTCTGAAGCG CAAGATGCGG TAACAGTTCG TCAGTTACAA AATGCCATTG GTGCGGTCAC TACTACACCG ACCAAGTACT ACCACGCAAA CTCAACGGAA GAAGATTCAC TGGCTGTCGG AACTGACTCA CTGGCAATGG GTGCGAAGAC CATCGTCAAT GCTGATGCAG GTATTGGTAT TGGTCTGAAT ACACTGGTGA TGGCTGATGC CATCAACGGT ATTGCTATCG GTTCTAACGC ACGCGCCAAT CATGCAAACA GTATTGCAAT GGGTAATGGT TCTCAGACCA CTCGCGGCGC ACAGACTGAC TACACCGCCT ACAACATGGA CACACCGCAG AACTCTGTCG GTGAGTTCTC TGTCGGCAGT GAAGACGGCC AACGTCAGAT CACTAACGTC GCGGCGGGTT CGGCAGATAC CGATGCCGTT AACGTTGGTC AATTGAAAGT CACCGACAGC CGTGTTGCCG CGAATACCGA AAGCATCAAT AACCTGAACA CGCAGGTGAG TAGTCTTGAT ACTCGCGTCA CCAATATTGA AAACGGTATT GGCGATATCG TCACTACCGG TAGCACCAAG TACTTCAAGA CCAACACCGA TGGCGTAGAT GCAAATGCCC AGGGTGCGGA CAGCGTCGCG ATTGGCTCTG GCTCTATTGC TGCCGCTGAA AACAGCGTGG CGTTAGGCAC AAATTCCGTC GCAGATGAAG CTAATACTGT GTCTGTCGGC TCTTCTACTC AACAACGCCG TATTACCAAC GTTGCCGCAG GGGTGAACAA CACTGACGCG GTTAATGTGG CGCAACTGAA AGCCTCAGAA GCAGGCTCCG TGCGTTATGA AACCAATGCA GATGGCTCGG TTAACTATAG CGTGCTCAAC CTGGGAGACG GGAGTGGCGG TACCACTCGG ATCGGCAATG TTTCAGCGGC GGTGAATGAT ACGGATGCGG TTAACTATGC GCAATTGAAA CGCAGCGTCG AAGAAGCAAA CACCTATACC GACCAGAAAA TGGGTGAAAT GAACAGCAAA ATCAAAGGTG TAGAAAACAA GATGAGCGGC GGTATCGCTT CAGCGATGGC GATGGCCGGT CTGCCACAAG CCTACGCCCC GGGCGCCAAC ATGACCTCGA TTGCTGGCGG TACGTTTAAT GGTGAAAGTG CCGTCGCCAT TGGGGTCTCC ATGGTAAGTG AAAGCGGGGG CTGGGTGTAT AAATTGCAAG GAACCAGCAA CAGCCAGGGC GATTACTCTG CGGCCATTGG CGCGGGCTTC CAGTGGTAA
|
Protein sequence | MNRIFRVLWN AATGTFVVTS ETAKSRGKKS GRRKLAVSAL ISLSSIMVSA DALANAGNDT GKGIGTENGW IAIGEGAEAD STITTKEKVA EDPSKGYVAG AGIAIGYYSK ASGASSTALG GYSLAESIGS VALGAGAHTT STAKYSMAVG TNAVASNLYA IALGTNANAS GKNSIALGRD TVSSAERSLT VGLGANAASL DSMAFGYNTS VTSDAANGIA FGSGAVTSAK NSVAIGSNST ATEENVVSVG SDALKRKIVN VGNGAISESS TDAVNGSQLF ATNANVTQNT TDIAANTDSI NQNTTDIATN TTNINSLSDS VTTLTDDALL WDAASGAFSA KHNGSDSKLT NLAAGTLAAD STDAVNGSQL FDTNEKVDKN TADIATNTDS INQNTADITA NTDSINQNTT DIAANTTSIN QNTTDIATNT TNINNLSDSI TGLTDDALLW DADTGAFSAK HNGSDSKITN LAAGTLAADS TDAVNGSQLF ATNENVSQNT TDIAANTDSI NQNTTDIATN TTNINNLSDS ITGLTDDALL WDADTGAFSA KHNGSDSKIT NLAAGTLAAD STDAVNGSQL FATNENVSQN TTDIAANTTS INQNTTDIAT NTTSINNLSN SVTTLTDDAL LWDAVSGAFS ANRNGSASKI INVAAGDLSE DSTDAVNGSQ LYETNQRVDQ NTSAIADINT SITNLSSDNL SWNETTSSFS ASHGSSTTNK ITNVAAGELS EESTDAVNGS QLFETNEKVD QNTTDIAANT TNITQNSTAI ENLNTSVSDI NTSITGLTDN ALLWDEDIGA FSANHGGSTS KITNVAAGAL SEDSTDAVNG SQLYETNQKV DQNTSAIADI NTSITNLGTD ALSWDDEEGA FSASHGTSGT NKITNVAAGE IASDSTDAVN GSQLYETNML ISQYNESISQ LAGDTSETYI TENGTGVKYI RTNDNGLEGQ DAYATGNGAT AVGYDAVASG AGSLALGQNS SSTIDGSIAL GSGSTSNRAI TTGIRETSVT SDGVVIGYNT TDRELLGALS LGTDGESYRQ ITNVADGSEA QDAVTVRQLQ NAIGAVTTTP TKYYHANSTE EDSLAVGTDS LAMGAKTIVN ADAGIGIGLN TLVMADAING IAIGSNARAN HANSIAMGNG SQTTRGAQTD YTAYNMDTPQ NSVGEFSVGS EDGQRQITNV AAGSADTDAV NVGQLKVTDS RVAANTESIN NLNTQVSSLD TRVTNIENGI GDIVTTGSTK YFKTNTDGVD ANAQGADSVA IGSGSIAAAE NSVALGTNSV ADEANTVSVG SSTQQRRITN VAAGVNNTDA VNVAQLKASE AGSVRYETNA DGSVNYSVLN LGDGSGGTTR IGNVSAAVND TDAVNYAQLK RSVEEANTYT DQKMGEMNSK IKGVENKMSG GIASAMAMAG LPQAYAPGAN MTSIAGGTFN GESAVAIGVS MVSESGGWVY KLQGTSNSQG DYSAAIGAGF QW
|
| |