Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2794 |
Symbol | |
ID | 6484119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2734758 |
End bp | 2737130 |
Gene Length | 2373 bp |
Protein Length | 790 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642738118 |
Product | side tail fiber protein |
Protein accession | YP_002041852 |
Protein GI | 194443605 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG3064] Membrane protein involved in colicin uptake [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGTAC TTATTTCCGG CGTACTGAAA GATGGTACGG GAACGCCGGT ACAGAACTGC ACCATTCAGC TGAAGGCCTG CCGGACCAGT ACGACGGTGG TCGTGAATAC GGTGGCATCG GAAAATCCGG ATGACGCCGG GCGCTACAGC ATGGATGTGG AGCAGGGGCA GTACACAGTC ACGCTCCTGG TGGAAGGGTA TCCCCCGTCA CATGCCGGAG TTATTACGGT CTACGATGAT TCAAAACCGG GCACCCTGAA TGATTTTCTG GGGGCCATGA CGGAAGACGA CGTCCGCCCG GAGGCGCTGC GACGTTTTGA GGCGATGGTG GAAGAAGTTG CCCGCCAGGC ATCGGAGGCA TCGCGGAATG CCACCGCCGC AGGGCAGGCA TCTGAACAGG CGCAGACATC AGCAGGTCAG GCAGCGGAAA GCGCCACGGC AGCAGTGAAT GCAGCCGGAG CGGCAGAAGC ATCAGCCACA CAGGCAGCCT CATCCGCAGC ATCTGCGGAG AGCAGCGCAG GTACGGCGAC CACAAAAGCC GGGGAGGCAT CAGCCAGCGC GGCGTCGGCT GACACAGCCA GAACGGCGGC AGCCGCATCG GCAGCCGCAG CGAAAACATC TGAAGCGAAT GCAGATGTCT CCCGTACTGC CGCCGGCGAT TCAGCTGCTG CCGCAGCCGC CAGCGCGACG GCGGCGCAGA CATCAGCAGC GCGCGCCGGA GCATCCGAAA CCGCCGCGAA GACGTCAGAA ACGCAGGCGG CTTCCAGTGC CGGTGATGCA GGTGCGTCAG CCACTGCGGC GGCAGCGTCG GAAAAGGCGG CAGCCGCATC GGCAGCCGCA GCAAAAATAT CTGAGACAAA CGCTGCAACG TCAGCAAGTA CAGCAGCGGC CAGCGCAACA GCCGCCTCGT CATCAGCATC GGAGGCATCC AATCACGCCG CCGCATCTGA TACCAGCGCA TCACTGGCGG CGCAAAGCAG TACTGCTGCC GGAGCAGCAG CCACCAGAGC AGAAGATGCC GCAAAACGGG CAGAAGACAT CGCGGACGTG ATTTCCCTGG AAGATGCCAG CCTGACGAAA AAAGGTATCG TTAAGTTAAG CAGCGCCACG GACAGTGACA GCGAAGCGCT GGCAGCCACG CCAAAGGCGG TCCATGCTGT CATGGACGAG GTACAGACCA AAGCGCCGCT GGACAGTCCG GTATTCACTG GAACGCCGAC CACACCGACG CCGCCAGATG ACGCTAAGGG ACTTCAGACT GCAAACGCTG AGTTTGTTCG TAAACTGATT GCTGCACTGG TCGGTTCCGT ACCTGAGTCG CTGGATACGC TGCAGGAACT GGCTGACGCG CTGGGTAACG ATCCTAATTT TGCTACCACT ATCACTAACA TGATTGCGGG CAAGCAGCCG CTGGACGATA CACTGACGGC GCTGTCAGGA AAAAGCATTG AAGGTCTTAT CGAATACGTT GGTTTACGGA GCACAATTGA TAAGGCTGCT GGTGCGTTGC CTGCTGGTGG TACGGCTGTC GCAGCGAACA GGCTTGCATC ACGCGGCGCG CTTCCGGCAC TGACTGGCAC GACAAGAGGC AGCGATGGCG GCCTGATAAT GGGCGAGGTC TACAACAATG GCTATCCGAC GCAATACGGA AATATTTTAC GTCTGACCGG AACCGGTGAT GGGGAAATCC TCATTGGCTG GAGCGGGACA AATGGTGCGC CAGCGCCCGC ATATATTCGC AGCCATCGAG ATACCGCCGA TGCTGAGTGG TCCGAATGGG CAATGCTTTA CACCACACTA AACCCACCTC CGGATTCGCA TCCAGTAGGG GCGGCGATTG CATGGCCATC TGATGCTACT CCGGCAGGTT ACGCTCTGAT GCAGGGGCAG TCCTTCGATA AATCTGCTTA CCCGTTACTG GCTATAGCGT ATCCGTCCGG CGTTATCCCT GACATGAGAG GCTGGACAAT AAAGGGTAAG CCCATCAGTG GACGTGCCGT ATTGTCGCAA GAAATGGACG GCAATAAATC GCACTCGCAC ACCGCGCGGG CGCAGGATAC TGACTTAGGG ACAAAATCTA CCTCATCTTT TGATTACGGC ACGAAATCGA CCAATACCAC GGGCAACCAT ACTCACCAGT TCGGCGGTTA TATCAATTCA TACTGGGGAG ACTCCAATCA CACCTCATTT CAGCCTGGAG GTGGTGCATG GACACAGGCC GCTGGCGACC ATGCGCATAC AGTTTATATC GGAGGACACG AGCACACCAT GTATATCGGT CCACACGGTC ACGTCGTTAT TGTGGACGCA GACGGTAATG CGGAAACCAC GGTTAAAAAT ATTGCATTTA ACTACATAGT GAGGCTGGCA TGA
|
Protein sequence | MPVLISGVLK DGTGTPVQNC TIQLKACRTS TTVVVNTVAS ENPDDAGRYS MDVEQGQYTV TLLVEGYPPS HAGVITVYDD SKPGTLNDFL GAMTEDDVRP EALRRFEAMV EEVARQASEA SRNATAAGQA SEQAQTSAGQ AAESATAAVN AAGAAEASAT QAASSAASAE SSAGTATTKA GEASASAASA DTARTAAAAS AAAAKTSEAN ADVSRTAAGD SAAAAAASAT AAQTSAARAG ASETAAKTSE TQAASSAGDA GASATAAAAS EKAAAASAAA AKISETNAAT SASTAAASAT AASSSASEAS NHAAASDTSA SLAAQSSTAA GAAATRAEDA AKRAEDIADV ISLEDASLTK KGIVKLSSAT DSDSEALAAT PKAVHAVMDE VQTKAPLDSP VFTGTPTTPT PPDDAKGLQT ANAEFVRKLI AALVGSVPES LDTLQELADA LGNDPNFATT ITNMIAGKQP LDDTLTALSG KSIEGLIEYV GLRSTIDKAA GALPAGGTAV AANRLASRGA LPALTGTTRG SDGGLIMGEV YNNGYPTQYG NILRLTGTGD GEILIGWSGT NGAPAPAYIR SHRDTADAEW SEWAMLYTTL NPPPDSHPVG AAIAWPSDAT PAGYALMQGQ SFDKSAYPLL AIAYPSGVIP DMRGWTIKGK PISGRAVLSQ EMDGNKSHSH TARAQDTDLG TKSTSSFDYG TKSTNTTGNH THQFGGYINS YWGDSNHTSF QPGGGAWTQA AGDHAHTVYI GGHEHTMYIG PHGHVVIVDA DGNAETTVKN IAFNYIVRLA
|
| |