Gene SNSL254_A4383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4383 
Symbol 
ID6484064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4255458 
End bp4256645 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content54% 
IMG OID642739625 
Productphage tail sheath protein 
Protein accessionYP_002043319 
Protein GI194444297 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.301789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.000355682 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGATT TTCACCACGG CACGCAGGTC ATCGAAATTA ATGACGGTAC GCGTGTTATT 
TCCACAGTAG CGACTGCGGT CGTCGGCATG GTTTGTACAG CCAGCGATGC AGATGCCACG
CTATTTCCCC TCAATGAACC GGTACTGATT ACCAATGTGC AAAGCGCCAT TGCGAAAGCC
GGTAAAAAAG GCACGCTGGC TGCATCACTG CAGGCCATTG CAGACCAGTC AAAACCCGTC
ACTGTTGTTG TACGTGTTGA GGATGGAACC GGCGATGACG AGGAAGCTGC GCTCGCACAG
ACTGTTTCCA ACATTATCGG AGGTACGGAT GAGAACGGTA AATACACCGG TATCAAGGCT
CTCCTGACCG CTCAGGCCGT CACCGGCGTC AAGCCGCGTA TTCTTGGGGT GCCGGGGCTG
GATACTAAAG AGGTCGCGGT CGCGCTTGCG TCGGCTGCCA TTAAGTTACG TGCATTTGCT
TACGTCAGCG CGTGGGGATG TAAGACTATT TCCGAAGCGA TGGAATATCG TAAAAATTTC
AGCCAGCGCG AGTTGATGGT TATCTGGCCT GATTTCCTCG CATGGGACAC CGTCAAAAAT
ACCACCGCAA CGGCTTACGC CACTGCGCGT GCACTCGGCC TGCGTGCTTA CATCGACCAG
ACTGTCGGCT GGCACAAAAC CCTGTCTAAC GTTGGTGTAC AGGGAGTTAC CGGCATCAGC
GCCTCAGTGT TCTGGGATTT GCAGGCATCC GGCACCGATG CTGACCTGCT CAACGAGGCC
GGGGTTACAA CGCTGGTACG CAAGGACGGT TTCCGTTTCT GGGGTAACCG CACCTGCTCA
GATGACCCGC TTTTTCTGTT TGAGAACTAC ACCCGCACCG CGCAGGTACT GGCCGACACG
ATGGCTGAGG CGCACATGTG GGCGGTCGAT AAGCCCATTA CCGCTACGCT CATTCGTGAC
ATTGTTGACG GAATCAATGC CAAATTCCGC GAGCTGAAAT CAAACGGCTA CATCGTGGAG
GGTAAATGCT GGTTCGATGA GGAATCGAAC GACAAGGAAA CCCTCAAGGC CGGGAAACTG
TATATCGACT ACGACTATAC ACCGGTTCCG CCACTGGAAA GCCTGACCCT GCGCCAGCGT
ATCACCGATA AATATCTGGT GAATCTGGCC GAATCGGTCA ACAGCTAA
 
Protein sequence
MSDFHHGTQV IEINDGTRVI STVATAVVGM VCTASDADAT LFPLNEPVLI TNVQSAIAKA 
GKKGTLAASL QAIADQSKPV TVVVRVEDGT GDDEEAALAQ TVSNIIGGTD ENGKYTGIKA
LLTAQAVTGV KPRILGVPGL DTKEVAVALA SAAIKLRAFA YVSAWGCKTI SEAMEYRKNF
SQRELMVIWP DFLAWDTVKN TTATAYATAR ALGLRAYIDQ TVGWHKTLSN VGVQGVTGIS
ASVFWDLQAS GTDADLLNEA GVTTLVRKDG FRFWGNRTCS DDPLFLFENY TRTAQVLADT
MAEAHMWAVD KPITATLIRD IVDGINAKFR ELKSNGYIVE GKCWFDEESN DKETLKAGKL
YIDYDYTPVP PLESLTLRQR ITDKYLVNLA ESVNS