Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A0657 |
Symbol | |
ID | 6515986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 638552 |
End bp | 641722 |
Gene Length | 3171 bp |
Protein Length | 1056 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642745800 |
Product | gp21 |
Protein accession | YP_002113623 |
Protein GI | 194735627 |
COG category | [S] Function unknown |
COG ID | [COG5281] Phage-related minor tail protein |
TIGRFAM ID | [TIGR01541] phage tail tape measure protein, lambda family [TIGR02675] tape measure domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000011896 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCCAGA ACGTCGGCGA TATTGAATAT GTGATAAAGG CGGATACTGC TCAGTTGCTG AGAGCTGACA AACAGGTTCG TGACGTAACA GACGGCATGG AAGTCGGTTT CAAGCGGGCT GATAAGGCTG CATCATCTCT CACTTCCTCA TTCGGATCTT TGAGCCGGGT TGCCACTTCT CTGATGGCTA TCCTGTCGGT TCAACAGGTA TCTCAATACG CTGACGCCTG GACAACGCTC AACAACAAAC TGGCTAACGC CATCAGGCCA AGCGAGCAAC TGGTCGATGT GACGGAGCGC GTATTTAATA TCACGCAGCA AACTCGCGGG AGTCTGGATG CTACGGCTTC TTTGTACGCC AGACTTGAAC GAGCCACCCG GGAATATGGA ACCAGTGCTG ACGATCTGGC TAAGCTGACT ACCATCATCA ACCAGGGATT TGTTGTCTCT GGTGCGACTG CACAAGAGGC CGAAAACGCC ATTATCCAGT TGTCTCAGGG GCTGGCATCT GGCGCGCTGC GCGGTGAAGA ATTTAACTCA GTGAATGAGC AGGGCAACCG CCTGATCGTT GCACTTGCCG ACTCAATGGG TGTTGGTATT GGGCAGATGC GTCAGATGGC CGCCGCCGGG AAGTTGACTA CTGATGTTGT GGTTAACGGA TTACTTTCAC AAGGGGTGAC GATCGGCAAT GAGTTCGCCA ATACCACGAC AACTATCAGT CAGGCATTGC AGGTTGCCGG GAATAACATC ACCAAGTTCT TTGGTGAAAA CTCCACGGTA AAAACCGGCA CAGCCATTTT TAACGATGCC GTGATCAGCG TCAGTGAGAA CATCGGCGCT CTTAGCGCCA TCCTGACCGC CGCTGCTGCT GTTATGGGTA GCCGCTACGT TGGCGCACTG ACAATGGCTA CTGCTGCGAA GGTAAAGGCC GCTGTAGCTG CAAGAAATCA GTCTGCTGCT GAAATGCAGG CGGCGCAGGC CGTTGCAAAT AAAGCTGCCG CCGACCTCCG CGCAGCCGCT ATCGCAAAAG AACGGGCGCT TGACGAGATC CGCCTTGCGG AGATGATGAA GCAGACAGCG GTTAGTGCGA CGAATGCCGC CGCTGCCGAG CAACGCTTAT CTTCCGCTCG CGTAGCCGCT GCTGGTGCTG TTGATAATTA CAACCGCGCT CTGGCAGCAA ATAAAGCGGC ACAGGCTGGG TTAGCTACTG GAGCAGGGTT GGTTAGCCGA GGATTGTCTC TCATAGGTGG CCCAGCTGGT GCTGCCATGC TCGCGGCCAG TGCGATTCTA TATTTCTCTC AGCGAGCTAA AGAGGCCAGA GACGATGCCA ATAACCTGGC GGATAGCGTC AATGAACTGA GCGCTAAGTT CCAGACTATG TCGCATACCG AGTTGGCAGC CACCATTGGC AAGTTGAGCA AGAATCTGCC AGAACTAAGC GATGCGGTAG CCGACGCACA GAAAGAATTT AACGACGCTG AATATGCAGT AAAAAACTAT AACCGCGAAA TAGGACGATA TGGCAACACC ACAAGAGGGA GAGAGGCAGC AGAAGCATTG TCTGGTGCTC AAAATAGACT TGCAATAGCT ACTTTCGAAC TTGAAAAAGC ACAAAACAGA TTAAGCCAGA CCCAGAACGC CATTAATATT GGACAGGCAA CACTCAATGG CACCATGCGA CAAGGCCTAC CGCTTCTCCA GAGAGAAGGC GAGGAAGCTG GTATCACTGC CGGTATGATG GGCAAGCTTG GCGATATGAT CAATTTCGCC GCCAAAGCGA AGGAGAAATA TAACTCTTCC AGTCTGATGG TTATGCGCAG CGAGGATGGA GATAAACTCC TGTCCAGCCT TGAAAAGCAA AACAATCTGC TGTCCATAAC AGACAAAAAA GAAAGGGCTG TAGCCGAGGC CAGACAAGCG GCCCTGGATG CGGGGGTGGA TGCGCATTCA AATCAGATGA GGCAGATTGA AGAGGCCGCG GCAAAAAGAT ATGACCTTCA GGAGGCTGAT TCAGCAGTAA CAAAGTCTAC AAAAGAGGGA ACTAAAGCTG TTGATGAGGC TGCGCAGTCA CTTTCAAGGC AACAGGCTGC TCTCGATCGC CTGAACACTG GTTACGCCGA TGGCTCGCTC GAATTAGCGA AATACGATGC TGTTGTTGCG CTTGGTAACA AAGCATCAGC AGAGCAGATC GCCAAAGCGG AACAGCAAGC TGAGTCCATC TGGAAAGTAC AGCAGGCAAC CAAAGCCGCA GCAGAAGAGG AAAGGAAGCG CACACAGGCC GGACAAAACT TTACCGGATT GCAAGGGCAG GTATCACCAG TCGCCGCAGT AGATAACACC TACGCGCAGC AAATGGCACA GCTTGATGAG TATGTGCAAC TTTACCCACA GAAGATTGCA GAGGCTGAGG CTGTACGCGC AGGGATTGAA GATCAGTATC ACCAGAAACG CATGGCCGCA ATGTGGGAGG AATGGCAGCA GCAGAGCGAG ATCAACAACA TGCTTGGCTC TGCAATCGAT TCCTTACAGG GGGGCGCAAC CAACGCGATT ACCGGCCTTA TCAACGGCAC TCAAAGCCTG CAGGAGTCAT TCGCAAATAT TGGTTCGACA ATACTCAACA GCGTTGTAAG CGCCATTGTG GATATGGGAG TTCAGTATGT TAAGAGCCTG ATTATAGGTA AGGCCATGTC ATCTGCTGCA ACTGCCGCAC AGATTGCTGA GGCTGGCGCT CTTGCAACAG CTTGGGCTCC TGCGGCTATG GCAGCATCTA TTGCGACCCA GGGCAAAGCA TCTGCTATCG GTTTGGCTGC CTATAGTTCT TCCATGGCGG CAGGGCAGGC GCTTTCTATT GCTGGCGCTC GCCGTTACGG CGGCACAGTA TCAGCTGGCA ACGCCTACCG CATCAACGAA GATGGACGCT CTGAAATCTT CCAGACTGCA GGTGGGCAGC AGGCATTCAT CCCGAACCAG TCAGGGAAGA TTATTCCGGC GGACAAGGCC GGAGGTGGCG GGTCGTTTAG CCCTGTAATG AACCTCACGA TAAATACTAC GGGAGGAATT GGTAATGAGG AGATCGCAAG GCTGCGTAAA GTGTGGAACA ACGACATGCT GAAAATGATG GTAGACCAGA GCACGCGGCC GAACGGTTTA CTGCAAGGGC GGAGAAAATA A
|
Protein sequence | MTQNVGDIEY VIKADTAQLL RADKQVRDVT DGMEVGFKRA DKAASSLTSS FGSLSRVATS LMAILSVQQV SQYADAWTTL NNKLANAIRP SEQLVDVTER VFNITQQTRG SLDATASLYA RLERATREYG TSADDLAKLT TIINQGFVVS GATAQEAENA IIQLSQGLAS GALRGEEFNS VNEQGNRLIV ALADSMGVGI GQMRQMAAAG KLTTDVVVNG LLSQGVTIGN EFANTTTTIS QALQVAGNNI TKFFGENSTV KTGTAIFNDA VISVSENIGA LSAILTAAAA VMGSRYVGAL TMATAAKVKA AVAARNQSAA EMQAAQAVAN KAAADLRAAA IAKERALDEI RLAEMMKQTA VSATNAAAAE QRLSSARVAA AGAVDNYNRA LAANKAAQAG LATGAGLVSR GLSLIGGPAG AAMLAASAIL YFSQRAKEAR DDANNLADSV NELSAKFQTM SHTELAATIG KLSKNLPELS DAVADAQKEF NDAEYAVKNY NREIGRYGNT TRGREAAEAL SGAQNRLAIA TFELEKAQNR LSQTQNAINI GQATLNGTMR QGLPLLQREG EEAGITAGMM GKLGDMINFA AKAKEKYNSS SLMVMRSEDG DKLLSSLEKQ NNLLSITDKK ERAVAEARQA ALDAGVDAHS NQMRQIEEAA AKRYDLQEAD SAVTKSTKEG TKAVDEAAQS LSRQQAALDR LNTGYADGSL ELAKYDAVVA LGNKASAEQI AKAEQQAESI WKVQQATKAA AEEERKRTQA GQNFTGLQGQ VSPVAAVDNT YAQQMAQLDE YVQLYPQKIA EAEAVRAGIE DQYHQKRMAA MWEEWQQQSE INNMLGSAID SLQGGATNAI TGLINGTQSL QESFANIGST ILNSVVSAIV DMGVQYVKSL IIGKAMSSAA TAAQIAEAGA LATAWAPAAM AASIATQGKA SAIGLAAYSS SMAAGQALSI AGARRYGGTV SAGNAYRINE DGRSEIFQTA GGQQAFIPNQ SGKIIPADKA GGGGSFSPVM NLTINTTGGI GNEEIARLRK VWNNDMLKMM VDQSTRPNGL LQGRRK
|
| |