Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A2974 |
Symbol | |
ID | 6486179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 2903442 |
End bp | 2904542 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 642738290 |
Product | effector protein pipB2 |
Protein accession | YP_002042019 |
Protein GI | 194444294 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 0.16662 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAAATT TTATCATGCA CTGTGTTGCT GTCTCTGGGA GAAAATATAT GGAGCGTTCA CTCGATAGTC TGGCTGGTAT GGCTAAATCT GCTTTTGGCG CGGGGACTTC TGCTGCTATG CGGCAAGCTA CCTCGCCCAA AACCATTCTG GAATATATCA TTAACTTTTT TACCTGTGGT GGGATACGTC GGAGAAATGA AACACAATAT CAGGAATTGA TAGAGACTAT GGCTGAGACA TTGAAAAGTA CAATGCCTGA CAGAGGTGCT CCGTTGCCAG AAAACATCAT CCTGGATGAT ATGGATGGGT GTCGTGTCGA ATTTAATCTT CCTGGTGAGA ATAACGAAGC TGGACAAGTT ATTGTACGAG TCAGTAAAGG CGACCATTCT GAGACAAGAG AAATTCCGCT TGCCTCTTTT GAAAAAATAT GCCGAGCTTT ACTATTCAGA TGCGAATTTT CTCTCCCTCA GGATTCTGTA ATATTAACTG CCCAGGGAGG CATGAATCTT AAAGGCGCTG TCCTTACCGG AGCAAATCTG ACGTCAGAAA ATTTATGTGA CGCAGACTTA AGCGGCGCAA ATTTAGAGGG GGCAGTGCTG TTTATGGCGG ATTGTGAAGG TGCAAATTTT AAGGGCGCAA ATCTATCGGG AACATCACTA GGCGACAGTA ATTTCAAGAA CGCCTGTCTG GAAGATAGCA TTATGTGTGG CGCTACCCTC GATCACGCTA ATCTTACTGG CGCCAATTTA CAACACGCGA GTCTGTTAGG CTGTAGCATG ATAGAATGTA ATTGCTCCGG TGCAAATATG GATCACGCTA ATGTTTCAGG CGCAACCCTT ATACGTGCCG ATATGAGCGG TGCGACATTA CAGGGTGCTA CTATAATGGC TGCTGATATG GAAGGCGCTA TCTTAACCCG GGCAAACCTG CAAAAGGCGA GTTTCATTTC TACGAACCTG GGCGAGGCTG ATTTGTCCGA AGCTAATTTA AAAAATACCA GTTTTAAAGA TTGTACACTA ACCGATTTGC GTACTGAAGA CGCCACAATG TCTACGAGTA CACAAACACT CTTTAACGTA TTTTATAGTG AAAATATTTA G
|
Protein sequence | MINFIMHCVA VSGRKYMERS LDSLAGMAKS AFGAGTSAAM RQATSPKTIL EYIINFFTCG GIRRRNETQY QELIETMAET LKSTMPDRGA PLPENIILDD MDGCRVEFNL PGENNEAGQV IVRVSKGDHS ETREIPLASF EKICRALLFR CEFSLPQDSV ILTAQGGMNL KGAVLTGANL TSENLCDADL SGANLEGAVL FMADCEGANF KGANLSGTSL GDSNFKNACL EDSIMCGATL DHANLTGANL QHASLLGCSM IECNCSGANM DHANVSGATL IRADMSGATL QGATIMAADM EGAILTRANL QKASFISTNL GEADLSEANL KNTSFKDCTL TDLRTEDATM STSTQTLFNV FYSENI
|
| |