Gene SNSL254_A4381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4381 
Symbol 
ID6485511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4253012 
End bp4254754 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content53% 
IMG OID642739623 
Productgp19 
Protein accessionYP_002043317 
Protein GI194445827 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0427744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.000225609 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCACGA AATTTAAAAC CGTTATCACC ACTGCCGGAG CCGCTAAGCT TGCTGCAGCC 
ACCATGCCGG GCGGTAAGAA AATAAATCTT AACGTTATGG CTGTTGGTGA CGGCGGCGGA
AAGCTGCCGG ATCCTGATGC CGGTCAGACG CAGCTTGTTA ATGAGGTCTG GCGTCATACT
CTGAATAAAA TCAGCCAGGA CAACCGGTAC AGTAATTACA TTGTGGCCGA GCTGGTTATT
CCGCCGGAGG TGGGCGGCTT CTGGATGCGT GAGCTTGGCC TTTACGACGA TGAAGGGACG
CTGATTGCTG TTGCCAATAT GGCCGAAAGC TACAAGCCAG AACTGGCTGA GGGCTCAGGG
CGTGCGCAGA CATGCCGCAT GGTCATTATT GTCAGCAGTG TCGAGTCTGT GGCGCTGTCC
ATTGACTCAA CGATGGTGAT GGCGACGCAG GATTATGTCG ACGACAGGCT CGCCGAACAT
GAAAAATCCC GTCGTCATCC TGATGCCACT CTTAAAGAAA AAGGGTTTAC TCAGCTCAGT
AACGCGACTG ACAGCGAGTC TGAAACGCTC GCAGCGACGC CGAAAGCTGT TAAGGCAGCA
TATGATCTTG CTGACGCGAA ATATACGGCT CAGGACGCCA CCACAACGCG TAAAGGGATT
GTACAACTAA GTAGTGTAAC CGACAGTAAT GATGAAAATC AGGCTGCCAC GCCAAAAGCG
GTCAAAATTG CGATGGACAA CGCCAACAAG CGCCTTGCCA AAGAACGCAA CCTCGCTGAC
CTGACAAACA TCCAGCAGGC CCGTCAGTCC CTCCAGCTTG GCAACAGCGC TACACTCAAT
GTCGGCACCA CACCAGACAC TGTAGCTGCA GGTGACGACG CCCGCATTAT CACCACCAAA
AAAGCCATTG ACGACACCCA GATCGGTCTT GGTGCTCAGC CCGTTATGTG GGTAAGCTCC
GCCGATGATT TGAGCAGCCT GCCGTCTGGC GCACGCCGGT TTGCCAGCAA TAAAGTTCCG
GCAACAATAT TGCCGGTAAA CGATTATGTT TTCCTGGAAG TGATTGCCAA ACGCGATTGC
GTAGACGGCT GCGCCGTTCT GATAACAGAC TCAATTGGTA ACACCTGGAT TGGCGCGCGC
TGGGACGCAA CCAATGGTTC CGGTTTTACC TGGCGTCCCA TGATGTCGTG TCCGCCCGGT
GTTCCCCTTC CGTGGCCATC TGACACTATC CCTGCCGGTT ATGCCCTGAT GCAGGGGCAG
GCATTTGATA AGAACGTTTA TCCCTTACTG GCAATAGCAT ATCCATCCGG CACTATTCCG
GACATGCGAG GCTGGACAAT CAAAGGTAAG CCTGTCAGTG GACGTGCAGT GCTGTCGCAA
GAACTGGACG GTAACAAATC GCACAGCCAC GGCGCGCGGG CGCTGGATAC CGATCTGGGA
ACGAAAGGTA CGTCGTCATT TGATTACGGA ACGAAGAGCT CTAATACAAC AGGCGGTCAT
AACCATTCAG CGGGCGGCAC ATACGGTGGT GATTCAATCG GTGGAAAAGC TCGCGTTCAG
CGTGATGGCA ATGACCAGTT AACAAGCTGG AATGGCGATC ACGCACATAC CACATGGATT
GGCCCGCATG ACCACACTGT ATATATCGGC CCACACGGAC ACGTCGTTAT TGTGGACGCA
GACGGTAATG CGGAAACCAC GGTTAAAAAC ATTGCATTTA ACTACATAGT GAGGCTGGCA
TAA
 
Protein sequence
MSTKFKTVIT TAGAAKLAAA TMPGGKKINL NVMAVGDGGG KLPDPDAGQT QLVNEVWRHT 
LNKISQDNRY SNYIVAELVI PPEVGGFWMR ELGLYDDEGT LIAVANMAES YKPELAEGSG
RAQTCRMVII VSSVESVALS IDSTMVMATQ DYVDDRLAEH EKSRRHPDAT LKEKGFTQLS
NATDSESETL AATPKAVKAA YDLADAKYTA QDATTTRKGI VQLSSVTDSN DENQAATPKA
VKIAMDNANK RLAKERNLAD LTNIQQARQS LQLGNSATLN VGTTPDTVAA GDDARIITTK
KAIDDTQIGL GAQPVMWVSS ADDLSSLPSG ARRFASNKVP ATILPVNDYV FLEVIAKRDC
VDGCAVLITD SIGNTWIGAR WDATNGSGFT WRPMMSCPPG VPLPWPSDTI PAGYALMQGQ
AFDKNVYPLL AIAYPSGTIP DMRGWTIKGK PVSGRAVLSQ ELDGNKSHSH GARALDTDLG
TKGTSSFDYG TKSSNTTGGH NHSAGGTYGG DSIGGKARVQ RDGNDQLTSW NGDHAHTTWI
GPHDHTVYIG PHGHVVIVDA DGNAETTVKN IAFNYIVRLA