Gene SNSL254_A2805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2805 
Symbol 
ID6482390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2748210 
End bp2749460 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content54% 
IMG OID642738129 
Productpeyer'S patch-specific virulence factor GipA 
Protein accessionYP_002041863 
Protein GI194442869 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.537832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTGCTC ATCAGAATGC CTCCTTTCCT CCCCGGCCTG AAGGCCGGGG AGGAAAGGAG 
GCGGTTTTCC GGCTAACTGT CTTTTGCATA ATCACATTTT CCTCTTTAAC ATGTGAAGCC
ATGAAACGCG CATATAAATA CCGGTTTTAC CCCACGACTG AGCAGGCTGA GCTTTTAGCT
CAGACGTTCG GTTGTGTGCG TTTCGTCTAC AACTCCATCC TCCGCTGGCG TACCGATGCG
TACTACGAGC GAAAGGAAAA GATCGGTTAC CTACAGGCCA ACGCTCGCCT TACGGCGCTG
AAAAAGGAGC CAGAATTTGC CTGGCTTAAC GACGTTTCCT GCGTTCCCCT CCAGCAGTCT
TTGCGCCACC AACAAACCGC CTTTGCTAAC TTCTTCGCCG GACGGGCTGC ATATCCGGCT
TTCAAAAGCA AACGGCACAA GCAGGCGGCT GAGTTCACTG CGAGCGCGTT TAAATACCGC
GACGGCAAGC TGTACATGGC AAAGAACAAA ATCCCCTTAG ACGTGCGCTG GAGTCGTCCG
CTGCCGTCCG TGCCGTCTAC CGTCACCATT TCCAAAGATG CCGCAGGGCG GTACTTTGTT
TCGTGCCTTT GCGAATTTGA ACCCGCATCA CTGCCGATCA CCTCTTCAAT GGTCGGCATT
GATGTTGGTT TAAAAGATTT GTTCGTCACC GATACCGGAT TCAGGTCCGG CAATCCCCGC
CATACCGCTA AATACGCGGC TCGCCTGGCA CTACTCCAGC GCCGGTTAAG CAAAAAGGCC
AAAGGCTCAA AGAACCGCGC CAAAGCCCAC TTAAAGGTAG CCCGACTCCA CGCGAAAATT
GCTGATTGCC GACTGGATGC CCTGCACAAG GCCACCCGCA AACTGATTAA CGATAACCAA
GTTGTATGCG TCGAATCCCT GAAAGTGAGG AACATGATCC GCAACCCGTC GCTATCCAAA
GCAATAGCAG ACGCGAGCTG GGGCGAACTT GTGCGCCAGC TCCGGTACAA AGGCGAATGG
GCGGGGCGGT CAGTGGTAGC CATTGACCAG TTTTTCCCGT CCTCAAAACG CTGTAGCTGT
TGCGGTTTCA TCATGAAAAA AATGCCTCTT GATGTTCGTA AATGGCAGTG CCCTGAGTGC
GGAACTGACC ACGACCGGGA CGTTAACGCG GCACGTAATA TCAAAGCTGC CGGGCTGGCA
GTGTTAGCCC ACGGAGAGCC TGTAAATCCT GAATCGCTCA AAGCGGCTTA G
 
Protein sequence
MFAHQNASFP PRPEGRGGKE AVFRLTVFCI ITFSSLTCEA MKRAYKYRFY PTTEQAELLA 
QTFGCVRFVY NSILRWRTDA YYERKEKIGY LQANARLTAL KKEPEFAWLN DVSCVPLQQS
LRHQQTAFAN FFAGRAAYPA FKSKRHKQAA EFTASAFKYR DGKLYMAKNK IPLDVRWSRP
LPSVPSTVTI SKDAAGRYFV SCLCEFEPAS LPITSSMVGI DVGLKDLFVT DTGFRSGNPR
HTAKYAARLA LLQRRLSKKA KGSKNRAKAH LKVARLHAKI ADCRLDALHK ATRKLINDNQ
VVCVESLKVR NMIRNPSLSK AIADASWGEL VRQLRYKGEW AGRSVVAIDQ FFPSSKRCSC
CGFIMKKMPL DVRKWQCPEC GTDHDRDVNA ARNIKAAGLA VLAHGEPVNP ESLKAA