Gene SNSL254_A3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3101 
Symbol 
ID6483908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3014904 
End bp3016961 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content46% 
IMG OID642738413 
Productinvasion protein InvA 
Protein accessionYP_002042137 
Protein GI194444273 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4789] Type III secretory pathway, component EscV 
TIGRFAM ID[TIGR01399] type III secretion protein, HrcV family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.758436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.000000202364 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCTGCTTT CTCTACTTAA CAGTGCTCGT TTACGACCTG AATTACTGAT TCTGGTACTA 
ATGGTGATGA TCATTTCTAT GTTCGTCATT CCATTACCTA CCTATCTGGT TGATTTCCTG
ATCGCACTGA ATATCGTACT GGCGATATTG GTGTTTATGG GGTCGTTCTA CATTGACAGA
ATCCTCAGTT TTTCAACGTT TCCTGCGGTA CTGTTAATTA CCACGCTCTT TCGTCTGGCA
TTATCGATCA GTACCAGTCG TCTTATCTTG ATTGAAGCCG ATGCCGGTGA AATTATCGCC
ACGTTCGGGC AATTCGTTAT TGGCGATAGC CTGGCGGTGG GTTTTGTTGT CTTCTCTATT
GTCACCGTGG TCCAGTTTAT CGTTATTACC AAAGGTTCAG AACGTGTCGC GGAAGTCGCG
GCCCGATTTT CTCTGGATGG TATGCCCGGT AAACAGATGA GTATTGATGC CGATTTGAAG
GCCGGTATTA TTGATGCGGA TGCTGCGCGC GAACGGCGAA GCGTACTGGA AAGGGAAAGC
CAGCTTTACG GTTCCTTTGA CGGTGCGATG AAGTTTATCA AAGGTGACGC TATTGCCGGC
ATCATTATTA TCTTTGTGAA CTTTATTGGC GGTATTTCGG TGGGGATGAC CCGCCATGGT
ATGGATTTGT CCTCCGCCCT GTCTACTTAT ACCATGCTGA CCATTGGTGA TGGTCTTGTC
GCCCAGATCC CCGCATTGTT GATTGCGATT AGTGCCGGTT TTATCGTGAC TCGCGTAAAT
GGCGATAGCG ATAATATGGG GCGGAATATC ATGACGCAGC TGTTGAACAA CCCATTTGTA
TTGGTTGTTA CGGCTATTTT GACCATTTCA ATGGGAACTC TGCCGGGATT CCCGCTGCCG
GTTTTTGTTA TTTTATCGGT GGTTTTAAGC GTACTCTTCT ATTTTAAATT CCGTGAAGCA
AAACGTAGTG CCGCCAAACC TAAAACCAGC AAAGGCGAGC AGCCGCTCAG TATTGAGGAA
AAAGAAGGGT CGTCGTTAGG ACTGATTGGC GATCTCGATA AAGTCTCTAC AGAGACCGTA
CCGTTGATAT TACTTGTGCC GAAGAGCCGG CGTGAAGATC TGGAAAAAGC TCAACTTGCG
GAGCGTCTAC GTAGTCAGTT CTTTATTGAT TATGGCGTGC GCCTGCCGGA AGTATTGTTA
CGCGATGGCG AGGGCCTGGA CGATAACAGC ATCGTATTGT TGATTAATGA GATCCGTGTT
GAACAATTTA CGGTCTATTT TGATTTGATG CGAGTGGTAA ATTATTCCGA TGAAGTTGTT
TCCTTTGGCA TTAATCCAAC AATCTATCAG CAAGGTAGCA GCCAGTATTT CTGGGTAACG
CATGAAGAGG GGGAGAAACT CCGGGAGCTT GGCTATGTGT TGCGGAACGC GCTTGATGAG
CTTTACCACT GTCTGGCGGT GACGCTGGCG CGCAACGTCA ATGAATATTT CGGTATTCAG
GAAACAAAAC ATATGCTGGA CCAACTGGAA GCGAAATTTC CTGATTTACT TAAAGAAGTG
CTCAGACATG CCACGGTACA ACGTATATCT GAAGTTTTGC AGCGTTTGTT AAGCGAACGT
GTTTCCGTGC GTAATATGAA ATTAATTATG GAAGCGCTCG CATTGTGGGC GCCAAGAGAA
AAAGATGTCA TTAACCTTGT GGAGCATATT CGTGGGGCAA TGGCGCGTTA TATTTGCCAT
AAATTCGCCA ATGGCGGCGA ATTACGAGCA GTAATGGTAT CTGCTGAAGT TGAGGATGTT
ATTCGCAAAG GGATCCGTCA GACCTCTGGC AGTACCTTCC TCAGCCTTGA CCCGGAAGCC
TCCGCTAATT TGATGGATCT CATTACACTT AAGTTGGATG ATTTATTGAT TGCACATAAA
GATCTTGTCC TCCTTACGTC TGTCGATGTC CGTCGATTTA TTAAGAAAAT GATTGAAGGT
CGTTTTCCGG ATCTGGAGGT TTTATCTTTC GGTGAGATAG CAGATAGCAA GTCAGTGAAT
GTTATAAAAA CAATATAA
 
Protein sequence
MLLSLLNSAR LRPELLILVL MVMIISMFVI PLPTYLVDFL IALNIVLAIL VFMGSFYIDR 
ILSFSTFPAV LLITTLFRLA LSISTSRLIL IEADAGEIIA TFGQFVIGDS LAVGFVVFSI
VTVVQFIVIT KGSERVAEVA ARFSLDGMPG KQMSIDADLK AGIIDADAAR ERRSVLERES
QLYGSFDGAM KFIKGDAIAG IIIIFVNFIG GISVGMTRHG MDLSSALSTY TMLTIGDGLV
AQIPALLIAI SAGFIVTRVN GDSDNMGRNI MTQLLNNPFV LVVTAILTIS MGTLPGFPLP
VFVILSVVLS VLFYFKFREA KRSAAKPKTS KGEQPLSIEE KEGSSLGLIG DLDKVSTETV
PLILLVPKSR REDLEKAQLA ERLRSQFFID YGVRLPEVLL RDGEGLDDNS IVLLINEIRV
EQFTVYFDLM RVVNYSDEVV SFGINPTIYQ QGSSQYFWVT HEEGEKLREL GYVLRNALDE
LYHCLAVTLA RNVNEYFGIQ ETKHMLDQLE AKFPDLLKEV LRHATVQRIS EVLQRLLSER
VSVRNMKLIM EALALWAPRE KDVINLVEHI RGAMARYICH KFANGGELRA VMVSAEVEDV
IRKGIRQTSG STFLSLDPEA SANLMDLITL KLDDLLIAHK DLVLLTSVDV RRFIKKMIEG
RFPDLEVLSF GEIADSKSVN VIKTI