Gene SNSL254_A4554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4554 
Symbol 
ID6486621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4425131 
End bp4426558 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content55% 
IMG OID642739780 
Productphage tail sheath protein 
Protein accessionYP_002043462 
Protein GI194442415 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00163736 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.433622 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCTA ATTACCTGCA CGGTGTAGAG ACCATTGAGA TCGAAACCGG CCCACGTCCG 
GTTAAGGCGG TTAAATCTGC GGTTATTGGT CTGATCGGCA CCGCGCCATG CGGCCCGGTT
AACCAGCCAA CGCTGTGCCT TTCTGAAAGC GACGCGGCAC AGTTTGGCCC AGGTCTGGCA
AATTTCACCA TCCCGCAGGC GCTGAAGGCG ATCTACGATC ACGGCGCAGG GACGGTCGTG
GTGATTAACG TGCTGAATCC GGCGGTACAC AAAAGTACCA TTCCCAGTGA AACCGTGAAG
GTTGATGACA ACGGTCAGAT TCAACTCAAG CACGGGGCCG TGCAAACGAT GAACATTGGC
CGCAGCACGA ACGCCGGAAA CGCTTATATC AAAGGCACCG ATTACACCAT TGATATGCTG
ACCGGTAAAA TCACCTGCAT GGGGACCAAC CTGAAACCCG GCGTTCAGGC CTATGTAAAT
TATACTTACG CGGACCCCAC CAAAGTGACT GCCGCCGATA TTGTTGGCGC GGTAAACACG
GCGGGTGACC GTACCGGCAT GAAGCTGTTA CAGGACACCT GGAACCAGTT TGGTTTTTAC
GCAAAGATCC TGATTGCGCC GGTCTTTTGT ACGCAAAACT CGGTCGCCGT TAAGCTTATC
GCTCAGGCAG AAGCGCTGGG AGCCATTACC TACATTGATG CGCCCATCGG CACGACTTTC
CAGCAGGTTT TGGCAGGGCG CGGCCCGCAG GGGGCGATTA ACTTCAATAC CAGTTCCGAT
CGCGCGCGTC TGTGCTATCC GCACGTTAAA GTGTACGACA GCGCGACGGA CAGCGAAGTC
CTGGAGCCGC TCTCCTCCCG CGCCGCTGGC CTGCGTGCCA AAGTGGATCT GGAAAAAGGC
TTCTGGTGGA GCAACTCCAA TCAGGAAATT CAGGGCATTA CCGGCGTAGA GCGCTCGCTG
TCAGCGATGA TCGACGATCC GCAAAGCGAA GTGAATCAAC TGAATGAAAA CGGCATCACC
ACCATCTTCA ACAGCTATGG CTCCGGTTTG CGCCTGTGGG GCAACCGTAC CGCCGCCTGG
CCGACGGTTA CTCATATGCG TAACTTTGAG AACGTGCGCC GTACCGGCGA TGTAATCAAC
GAATCGATTC GTTATTTCAG CCAGCAGTAT ATGGATATGC CGATAAACCA GGCGCTGATC
GACGCGCTGA CCGAATCGGT GAACACCTGG GGCCGCAAGC TGATTGCCGA CGGCGCGTTG
TTGGGCTTTG AATGCTGGTA CGACCCGGCG CGTAACGAAC AGACTGAACT GGCAGCCGGG
CATCTGTTGC TGAGCTACAA ATTCACCCCG CCGCCGCCAC TGGAACGTCT GACGTTTGAA
ACCGAAATTA CCTCTGAATA TTTAGTTTCT CTGGAGAGCA ATCGCTAA
 
Protein sequence
MAANYLHGVE TIEIETGPRP VKAVKSAVIG LIGTAPCGPV NQPTLCLSES DAAQFGPGLA 
NFTIPQALKA IYDHGAGTVV VINVLNPAVH KSTIPSETVK VDDNGQIQLK HGAVQTMNIG
RSTNAGNAYI KGTDYTIDML TGKITCMGTN LKPGVQAYVN YTYADPTKVT AADIVGAVNT
AGDRTGMKLL QDTWNQFGFY AKILIAPVFC TQNSVAVKLI AQAEALGAIT YIDAPIGTTF
QQVLAGRGPQ GAINFNTSSD RARLCYPHVK VYDSATDSEV LEPLSSRAAG LRAKVDLEKG
FWWSNSNQEI QGITGVERSL SAMIDDPQSE VNQLNENGIT TIFNSYGSGL RLWGNRTAAW
PTVTHMRNFE NVRRTGDVIN ESIRYFSQQY MDMPINQALI DALTESVNTW GRKLIADGAL
LGFECWYDPA RNEQTELAAG HLLLSYKFTP PPPLERLTFE TEITSEYLVS LESNR