Gene SNSL254_A4537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4537 
Symbol 
ID6485695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4411877 
End bp4413358 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content56% 
IMG OID642739763 
Productgp19 
Protein accessionYP_002043445 
Protein GI194444988 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAATG AGTTTTATAC CCTCCTGACC GACTGGGGAA TGGCGAAAAT CGCCAGTGCC 
CTTGCGGATA AGAAACAGAT ACATCTGCAA AAGATGGCGG TTGGCGACGG CGGCGGACAA
TATTATGAAC CGACCGCCAG CCAGACCAAT TTACGCCACG AAGTCTGGCG CGGCGAGATG
AATACGCTGA CCGTTGCGCC GAATAACCCT AACTGGCTGA TTGCCGAGTT GGTGCTGCCG
GAGGATGTTG GCGGCTGGTA CGTACGTGAA GTGGGCGTAT TCGACGACGA GGGCGAGCTG
ATCGCCATCG GCAAATTCCC GGAATCCTAC AAACCGCTGC TGCCGGGCGG CTGCGGCAAG
CAGGTCTGTA TCCGCCTGAT TATGGAGGTC TCCAACACCA CGGCGGTGAC GCTGACGGTC
GATCCGAGCA TTGTGCTGGC GACGCGCGAC TATGTGGATG CCCGGCTGGA CGAGCATGAA
CATTCGACAA ATCACCCGGA TGCGACATTA ACGCAGAAAG GGTTTACGCA GCTCAGTAAT
GCCACCGACA GCGATGACGA AACCAAAGCG GCTACGCCAA AGGCGGTAAA AGCGGCGATG
GCGGAAGCGC GTAATCACAC GCATACCTGG AACCAGATTA CCGGCGTTCC GGACGGTACG
CTGACGCAAA AGGGGATTGT TAAGCTTAGT AGTGCTACTG ACAGCACCAG CACAACGGAA
GCGGCAACGC CGAGTGCAGT CAAGGCGGCG ATGGATAAGG CGAATGCGGC AGCTCCGGCC
AGCCATACTC ACGCCTGGAA CCAGATTACC GGCGTCCCGG ACGGCACGCT GACGCAAAAA
GGGATCGTGA AACTTAACAG CGCGACGGAC AGCACCAGTA CCACAGAAGC GGCAACGCCC
AGTGCGGTAA AGGCGGCGAT GGATAAGGCG AGTGCGGCGG CCCCGGCCAG CCATACTCAC
GCCTGGGGGC AGATCACCGG CACCCCGGAC GGTACGCTGA CGCAAAAAGG GATCGTGAAG
CTTAATAACG CCACCGACAG CACCAGTACG ACGGAGGCGG CGACGCCGAG CGCGGTGAAA
GCGGCGTATG ACCTGGCGAA TGGGAAGGCG GCGGGGAGTC ACAAACATGC GTGGGGGGAT
ATTACCGACG TGCCGGATGG GACTACGGCG CAGAAAGGGA TCGTAAAGCT CAACAGTGCA
ACGAACAGCA CCAGTACGAC GGAGGCAGCG ACGCCGAGCG CGGTAAAGGC GGCGTATGAT
TTGGCAAAAA GCAAAACCTC TGCAACGAAT ATATATACCA GGACACAATC TGATGCACGA
TACGTGCAAA ATGTTATGTT AGGTGCAGAG GTACAAGCAC CAACAATGGC ACCTGCTGGA
TGTGTAATAA CATTTGTTGA TGGTGGTGAT AAAATGGAAT GTGTGAGATA TAAACCACTT
CAGATTAACA TCAACGGTTT TTGGCGAACT ATTTCAGGAT AA
 
Protein sequence
MDNEFYTLLT DWGMAKIASA LADKKQIHLQ KMAVGDGGGQ YYEPTASQTN LRHEVWRGEM 
NTLTVAPNNP NWLIAELVLP EDVGGWYVRE VGVFDDEGEL IAIGKFPESY KPLLPGGCGK
QVCIRLIMEV SNTTAVTLTV DPSIVLATRD YVDARLDEHE HSTNHPDATL TQKGFTQLSN
ATDSDDETKA ATPKAVKAAM AEARNHTHTW NQITGVPDGT LTQKGIVKLS SATDSTSTTE
AATPSAVKAA MDKANAAAPA SHTHAWNQIT GVPDGTLTQK GIVKLNSATD STSTTEAATP
SAVKAAMDKA SAAAPASHTH AWGQITGTPD GTLTQKGIVK LNNATDSTST TEAATPSAVK
AAYDLANGKA AGSHKHAWGD ITDVPDGTTA QKGIVKLNSA TNSTSTTEAA TPSAVKAAYD
LAKSKTSATN IYTRTQSDAR YVQNVMLGAE VQAPTMAPAG CVITFVDGGD KMECVRYKPL
QININGFWRT ISG