Gene SNSL254_A3102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3102 
Symbol 
ID6484277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3016986 
End bp3018104 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content45% 
IMG OID642738414 
Productinvasion protein InvE 
Protein accessionYP_002042138 
Protein GI194444612 
COG category 
COG ID 
TIGRFAM ID[TIGR02511] type III secretion effector delivery regulator, TyeA family
[TIGR02568] type III secretion regulator YopN/LcrE/InvE/MxiC 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.00000387053 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTCCTG GCTCAACCTC CGGTATTTCA TTTTCCAGAA TATTGTCCCG GCAGACATCT 
CATCAGGATG CGACCCAGCA TACTGATGCG CAACAGGCGG AAATACAACA GGCCGCAGAG
GATTCGTCTC CAGGGGCGGA AGTACAAAAA TTTGTCCAGT CGACGGACGA AATGTCAGCG
GCGCTGGCGC AATTTCGTAA CCGTCGCGAT TATGAAAAAA AATCCAGTAA TTTATCTAAC
AGTTTTGAAC GCGTGCTGGA GGATGAGGCT TTACCGAAGG CGAAGCAAAT CTTAAAGCTA
ATTAGCGTAC ATGGCGGCGC GTTAGAAGAT TTTTTACGTC AGGCGCGTAG CTTATTTCCT
GACCCCAGTG ATTTAGTCCT TGTGTTACGC GAATTGCTTC GTCGTAAAGA CCTGGAAGAG
ATCGTGCGGA AAAAGCTGGA GTCGTTACTT AAGCACGTTG AAGAGCAAAC CGATCCGAAG
ACCCTCAAGG CAGGGATTAA TTGTGCGTTG AAGGCCCGGC TTTTTGGGAA AACATTATCG
TTAAAACCAG GCTTATTGCG CGCCAGCTAT CGGCAATTTA TCCAGAGTGA ATCACATGAA
GTGGAGATTT ACTCTGACTG GATAGCCAGT TATGGCTATC AACGTCGACT GGTGGTACTG
GATTTTATTG AGGGTTCGCT ATTAACCGAT ATTGACGCGA ATGACGCCAG TTGTTCGCGC
CTGGAGTTTG GCCAGCTTTT ACGACGCCTG ACGCAACTTA AAATGTTGCG CTCCGCTGAC
CTACTGTTTG TGAGTACATT GTTGTCGTAT TCGTTTACCA AAGCGTTTAA TGCGGAGGAG
TCGTCGTGGT TACTACTGAT GCTTTCGCTA TTGCAACAGC CACATGAAGT GGATTCGCTG
TTAGCCGATA TTATAGGTTT GAATGCGTTA TTGCTTAGTC ATAAAGAACA TGCATCCTTT
TTGCAGATAT TTTATCAAGT ATGTAAAGCC ATACCCTCTT CACTCTTTTA TGAAGAATAT
TGGCAGGAAG AATTGTTAAT GGCGTTACGT AGTATGACCG ATATTGCCTA CAAGCATGAA
ATGGCAGAAC AGCGTCGTAC TATTGAAAAG CTGTCTTAA
 
Protein sequence
MIPGSTSGIS FSRILSRQTS HQDATQHTDA QQAEIQQAAE DSSPGAEVQK FVQSTDEMSA 
ALAQFRNRRD YEKKSSNLSN SFERVLEDEA LPKAKQILKL ISVHGGALED FLRQARSLFP
DPSDLVLVLR ELLRRKDLEE IVRKKLESLL KHVEEQTDPK TLKAGINCAL KARLFGKTLS
LKPGLLRASY RQFIQSESHE VEIYSDWIAS YGYQRRLVVL DFIEGSLLTD IDANDASCSR
LEFGQLLRRL TQLKMLRSAD LLFVSTLLSY SFTKAFNAEE SSWLLLMLSL LQQPHEVDSL
LADIIGLNAL LLSHKEHASF LQIFYQVCKA IPSSLFYEEY WQEELLMALR SMTDIAYKHE
MAEQRRTIEK LS