Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4077 |
Symbol | |
ID | 6872754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 3921484 |
End bp | 3925512 |
Gene Length | 4029 bp |
Protein Length | 1342 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642787026 |
Product | putative autotransporter |
Protein accession | YP_002217653 |
Protein GI | 198242645 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport [W] Extracellular structures |
COG ID | [COG5295] Autotransporter adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 78 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAGAA TATTTAAAGT CCTCTGGAAT GCCGCTACGG GAACATTTGT TGTCACCAGC GAAACCGCAA AAAGCCGCGG CAAAAAAAAC GGCCGCAGAA AGCTGGCAGT TTCCGCACTC ATCGGTCTTA GCAGCATTAT GGTTTCTGCG GATGCACTGG CCAACGCAGG AAACGATACA GGCGACGGCG TTACTCCAAC GGGTACCCAG ACTGGAGGAA AAGGGTGGAT TGCAATTGGT ACCGATGCCA CAGCCAATAC TTACACCAAC GTTGATGGCG CAAGCGCCGC AATGGGTTAT AAAGCCTCCG CGATGGGGAA ATGGAGTACC GCCATTGGTT CCTACAGCCA GTCCACCGGC GACTCTTCGT TAGCGCTTGG CGTAAAATCG GTTTCAGCCG GTGACCGGGC CATTGCAATG GGCGCCTCAT CCTCAGCCAG TGGAAGTTAT TCAATGGCAA TGGGCGTGTA TGCCAATTCG AGCGGCGCAA AATCGGTTGC GTTAGGTTAT AAATCTGTCG CGAGCGGAGC AACTTCCTCT GCATTAGGTT ATCAAGCTAC TGCGAGCGGC GACGACAGCG CTGCATTTGG TAATGGCGCA AAAGCGATAG GCACCAACTC AGTTGCCCTT GGCTCGGGCT CTGTCGCCCA GGAAGACAAT TCCGTCGCCG TGGGTAACAG CACCACTCAG CGGCAGATCA CCTACGTTGC TAAAGGCGAC ATCAATTCCA CCAGTACCGA TGCCGTTACA GGTGCGCAAA TTTATTCTTT AAGTCAATCC GTCGCCGACC GACTCGGCGG AGGGGCTTCC GTTAATAGTG ATGGTACAGT GAATGCCCCC CTCTACGAGG TAGGCACAGG CATCTACAAT AACGTAGGCA GTGCATTAAG CGCACTTAAC ACTTCTATCA CTAACACAGA GGCCTCTGTC GCAGGATTAG CCGAAGACGC GCTGTTGTGG GATGAAAGCA TCAGCGCCTT TAGCGCTAGC CACACGGGAA ACGCCAGCAA AATCACCAAT CTGGCGGCGG GTACCCTGGC CGCTGACAGC ACCGACGCCG TTAACGGCTC CCAGTTGTTT GATACAAATG AGAAAGTGGA TCAGAACACC GCTGATATCA CCACCAATAC CAACAGCATC AATCAGAACA CCACTGATAT TGCCACCAAC ACCACCAATA TCAACAACCT GAGCGATTCC ATCACCACGC TCACCGATGA TGCCCTGCTT TGGGATGCAG CCTCTGGCGC ATTCAGCGCT AAGCACAACG GAAGCGACAG TAAAATCACC AACCTGGCGG CGGGTACCCT GGCTGCGGAC AGCACCGATG CCGTTAACGG CTCTCAGCTG TTTGCCACCA ACGAAAATGT GTCTCAGAAC ACCACGGATA TCGCTGCCAA TACAACCAGC ATCAATCAGA ACACCACTGA TATCGCCACC AACACCACCA GCATCAACAA CCTGAGCAAT TCCGTCACCA CGCTCACCGA TGATGCCCTG TTATGGGATG CAGCCTCTGG CACATTCAGC GCCAGCCGTA GCGGAAGCAC CAGCAAGATC ACCAATCTGG CGGCGGGCAC CCTGGCCGCG GACAGTACAG ACGCGGTGAA CGGCTCACAA TTGTATGAAA CCAACCAGAA GGTGGATCAA AACACCTCTG CTATCGCAGA TATTAATACG TCCATCACCA ATCTTAGCTC TGACAACCTG AGCTGGAATG AAACAACGAG TTCGTTCTCT GCCAGCCACG GAAGTAGCAC GACAAACAAA ATCACCAACG TTGCTGCCGG AGAGCTGTCT GAAGAAAGTA CCGACGCGGT TAACGGATCG CAGCTGTTCG AAACCAATGA AAAAGTGGAT CAGAACACGA CCGATATCGC CGCCAATACC ACTAACATCA CTCAGAACAG CAGCGCGATT GAGAACCTGA ATACTTCTGT CTCCGACATT AATACATCCA TTACCGGCCT CACTGATAAC GCTCTGTTAT GGGACGAAGA CATCGGCGCT TTCAGCGCAA ACCACGGTGG AAGCACCAGT AAAATAACCA ATGTCGCCGC CGGGGCGCTT TCTGAGGACA GTACGGACGC GGTAAACGGC TCACAGTTGT ATGAAACTAA CCAGAAGGTG GATCAAAACA CCTCTGCTAT CGCGGATATC AATACGTCCA TCACCAATCT TGGTACCGAT GCACTGAGCT GGGATGACGA AGAAGGCGCG TTCAGCGCCA GCCACGGTAC CAGCGGTACC AATAAGATAA CCAATGTAGC AGCAGGTGAA ATCGCCAGTG ACAGTACTGA CGCTGTTAAT GGCTCTCAGC TCTATGAAAC TAACATGCTG ATTTCTCAGT ATAGTGAATC TATTAGCCAA CTAGCCGGCG ATACCAGCGA AACCTATATC ACGGAAAATG GTACCGGCGT GAAATACATC CGTACGAATG ATAATGGGCT TGAAGGCCAG GATGCATACG CAACGGGTAA CGGCGCAACG GCAGTAGGTT ACGACGCCGT CGCCTCTGGC GCTGGCAGTC TGGCTCTTGG CCAAAACAGC AGCAGCAGCA TTGAGGGCAG TATCGCTTTG GGCAGTGGCT CCACCTCTAA CCGCGCTATT ACAACCGGTA TACGAGAAAC GAGCGTAACA AGCGATGGCG TCGTCATTGG CTACAATACA ACAGACAGAA AGTTGCTGGG CGCGTTGTCA TTGGGGACGG ATGGAGAAAG CTATCGTCAA ATTACCAACG TTGCTGACGG CTCTGAAGCG CAAGATGCGG TAACAGTTCG TCAGTTACAA AATGCCATTG GTGCGGTCAC TACTACACCG ACCAAGTACT ACCACGCAAA CTCAACGGAA GAAGATTCAC TGGCTGTCGG AACTGACTCA CTGGCAATGG GTGCGAAGAC CATCGTCAAT GCTGATGCAG GTATTGGTAT TGGTCTGAAT ACACTGGTGA TGGCTGATGC CATCAACGGT ATTGCTATCG GTTCTAACGC ACGCGCCAAT CATGCAAACA GTATTGCAAT GGGTAATGGT TCTCAGACCA CTCGCGGCGC ACAGACTGAC TACACCGCCT ACAACATGGA CACACCGCAG AACTCTGTCG GTGAGTTCTC TGTCGGCAGT GAAGACGGTC AACGTCAGAT CACTAACGTC GCGGCGGGTT CGGCAGATAC CGATGCCGTT AACGTGAGTC AATTGAAAGT AACGGACGCG CAGGTTTCCA GGAATACCCA GAGCATTACT AACCTGAATA CTCAGGTATC GAATCTGGAT ACCCGCGTCA CCAATATCGA AAACGGCATT GGCGATATCG TCACTACCGG TAGCACCAAG TACTTCAAGA CCAACACCGA TGGCGCAGAT GCAAATGCCC AGGGTGCGGA CAGCGTCGCG ATTGGCTCTG GCTCTATTGC TGCCGCTGAA AACAGCGTGG CGTTAGGCAC AAATTCCGTC GCAGATGAAG CTAATACTGT GTCTGTCGGC TCTTCTACTC AACAACGCCG TATTACCAAC GTTGCCGCAG GGGTGAACAA CACTGACGCG GTTAATGTGG CGCAACTGAA AGCCTCAGAA GCAGGCTCCG TGCGTTATGA AACCAATGCA GATGGCTCGG TTAACTATAG CGTGCTCAAC CTGGGAGACG GGAGTGGCGG CACCACTCGG ATCGGCAATG TTTCTGCGAC GGTGAATGAT ACGGATGCGG TGAACTATGC GCAATTGAAA CGCAGCGTCG AAGAAGCAAA CACCTATACC GACCAAAAAA TGGGTGAAAT GAACAGCAAA ATCAAAGGCG TAGAAAACAA GATGAGCGGC GGTATCGCCT CAGCGATGGC GATGGCCGGT CTGCCACAAG CCTACGCCCC GGGCGCCAAC ATGACCTCGA TTGCTGGCGG TACGTTTAAT GGTGAAAGTG CCGTCGCCAT TGGTGTCTCC ATGGTAAGTG AAAGCGGGGG CTGGGTGTAT AAATTGCAAG GAACCAGCAA CAGCCAGGGC GATTACTCTG CGGCCATTGG CGCGGGCTTC CAGTGGTAA
|
Protein sequence | MNRIFKVLWN AATGTFVVTS ETAKSRGKKN GRRKLAVSAL IGLSSIMVSA DALANAGNDT GDGVTPTGTQ TGGKGWIAIG TDATANTYTN VDGASAAMGY KASAMGKWST AIGSYSQSTG DSSLALGVKS VSAGDRAIAM GASSSASGSY SMAMGVYANS SGAKSVALGY KSVASGATSS ALGYQATASG DDSAAFGNGA KAIGTNSVAL GSGSVAQEDN SVAVGNSTTQ RQITYVAKGD INSTSTDAVT GAQIYSLSQS VADRLGGGAS VNSDGTVNAP LYEVGTGIYN NVGSALSALN TSITNTEASV AGLAEDALLW DESISAFSAS HTGNASKITN LAAGTLAADS TDAVNGSQLF DTNEKVDQNT ADITTNTNSI NQNTTDIATN TTNINNLSDS ITTLTDDALL WDAASGAFSA KHNGSDSKIT NLAAGTLAAD STDAVNGSQL FATNENVSQN TTDIAANTTS INQNTTDIAT NTTSINNLSN SVTTLTDDAL LWDAASGTFS ASRSGSTSKI TNLAAGTLAA DSTDAVNGSQ LYETNQKVDQ NTSAIADINT SITNLSSDNL SWNETTSSFS ASHGSSTTNK ITNVAAGELS EESTDAVNGS QLFETNEKVD QNTTDIAANT TNITQNSSAI ENLNTSVSDI NTSITGLTDN ALLWDEDIGA FSANHGGSTS KITNVAAGAL SEDSTDAVNG SQLYETNQKV DQNTSAIADI NTSITNLGTD ALSWDDEEGA FSASHGTSGT NKITNVAAGE IASDSTDAVN GSQLYETNML ISQYSESISQ LAGDTSETYI TENGTGVKYI RTNDNGLEGQ DAYATGNGAT AVGYDAVASG AGSLALGQNS SSSIEGSIAL GSGSTSNRAI TTGIRETSVT SDGVVIGYNT TDRKLLGALS LGTDGESYRQ ITNVADGSEA QDAVTVRQLQ NAIGAVTTTP TKYYHANSTE EDSLAVGTDS LAMGAKTIVN ADAGIGIGLN TLVMADAING IAIGSNARAN HANSIAMGNG SQTTRGAQTD YTAYNMDTPQ NSVGEFSVGS EDGQRQITNV AAGSADTDAV NVSQLKVTDA QVSRNTQSIT NLNTQVSNLD TRVTNIENGI GDIVTTGSTK YFKTNTDGAD ANAQGADSVA IGSGSIAAAE NSVALGTNSV ADEANTVSVG SSTQQRRITN VAAGVNNTDA VNVAQLKASE AGSVRYETNA DGSVNYSVLN LGDGSGGTTR IGNVSATVND TDAVNYAQLK RSVEEANTYT DQKMGEMNSK IKGVENKMSG GIASAMAMAG LPQAYAPGAN MTSIAGGTFN GESAVAIGVS MVSESGGWVY KLQGTSNSQG DYSAAIGAGF QW
|
| |