Gene SNSL254_A2794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2794 
Symbol 
ID6484119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2734758 
End bp2737130 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content58% 
IMG OID642738118 
Productside tail fiber protein 
Protein accessionYP_002041852 
Protein GI194443605 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG3064] Membrane protein involved in colicin uptake
[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGTAC TTATTTCCGG CGTACTGAAA GATGGTACGG GAACGCCGGT ACAGAACTGC 
ACCATTCAGC TGAAGGCCTG CCGGACCAGT ACGACGGTGG TCGTGAATAC GGTGGCATCG
GAAAATCCGG ATGACGCCGG GCGCTACAGC ATGGATGTGG AGCAGGGGCA GTACACAGTC
ACGCTCCTGG TGGAAGGGTA TCCCCCGTCA CATGCCGGAG TTATTACGGT CTACGATGAT
TCAAAACCGG GCACCCTGAA TGATTTTCTG GGGGCCATGA CGGAAGACGA CGTCCGCCCG
GAGGCGCTGC GACGTTTTGA GGCGATGGTG GAAGAAGTTG CCCGCCAGGC ATCGGAGGCA
TCGCGGAATG CCACCGCCGC AGGGCAGGCA TCTGAACAGG CGCAGACATC AGCAGGTCAG
GCAGCGGAAA GCGCCACGGC AGCAGTGAAT GCAGCCGGAG CGGCAGAAGC ATCAGCCACA
CAGGCAGCCT CATCCGCAGC ATCTGCGGAG AGCAGCGCAG GTACGGCGAC CACAAAAGCC
GGGGAGGCAT CAGCCAGCGC GGCGTCGGCT GACACAGCCA GAACGGCGGC AGCCGCATCG
GCAGCCGCAG CGAAAACATC TGAAGCGAAT GCAGATGTCT CCCGTACTGC CGCCGGCGAT
TCAGCTGCTG CCGCAGCCGC CAGCGCGACG GCGGCGCAGA CATCAGCAGC GCGCGCCGGA
GCATCCGAAA CCGCCGCGAA GACGTCAGAA ACGCAGGCGG CTTCCAGTGC CGGTGATGCA
GGTGCGTCAG CCACTGCGGC GGCAGCGTCG GAAAAGGCGG CAGCCGCATC GGCAGCCGCA
GCAAAAATAT CTGAGACAAA CGCTGCAACG TCAGCAAGTA CAGCAGCGGC CAGCGCAACA
GCCGCCTCGT CATCAGCATC GGAGGCATCC AATCACGCCG CCGCATCTGA TACCAGCGCA
TCACTGGCGG CGCAAAGCAG TACTGCTGCC GGAGCAGCAG CCACCAGAGC AGAAGATGCC
GCAAAACGGG CAGAAGACAT CGCGGACGTG ATTTCCCTGG AAGATGCCAG CCTGACGAAA
AAAGGTATCG TTAAGTTAAG CAGCGCCACG GACAGTGACA GCGAAGCGCT GGCAGCCACG
CCAAAGGCGG TCCATGCTGT CATGGACGAG GTACAGACCA AAGCGCCGCT GGACAGTCCG
GTATTCACTG GAACGCCGAC CACACCGACG CCGCCAGATG ACGCTAAGGG ACTTCAGACT
GCAAACGCTG AGTTTGTTCG TAAACTGATT GCTGCACTGG TCGGTTCCGT ACCTGAGTCG
CTGGATACGC TGCAGGAACT GGCTGACGCG CTGGGTAACG ATCCTAATTT TGCTACCACT
ATCACTAACA TGATTGCGGG CAAGCAGCCG CTGGACGATA CACTGACGGC GCTGTCAGGA
AAAAGCATTG AAGGTCTTAT CGAATACGTT GGTTTACGGA GCACAATTGA TAAGGCTGCT
GGTGCGTTGC CTGCTGGTGG TACGGCTGTC GCAGCGAACA GGCTTGCATC ACGCGGCGCG
CTTCCGGCAC TGACTGGCAC GACAAGAGGC AGCGATGGCG GCCTGATAAT GGGCGAGGTC
TACAACAATG GCTATCCGAC GCAATACGGA AATATTTTAC GTCTGACCGG AACCGGTGAT
GGGGAAATCC TCATTGGCTG GAGCGGGACA AATGGTGCGC CAGCGCCCGC ATATATTCGC
AGCCATCGAG ATACCGCCGA TGCTGAGTGG TCCGAATGGG CAATGCTTTA CACCACACTA
AACCCACCTC CGGATTCGCA TCCAGTAGGG GCGGCGATTG CATGGCCATC TGATGCTACT
CCGGCAGGTT ACGCTCTGAT GCAGGGGCAG TCCTTCGATA AATCTGCTTA CCCGTTACTG
GCTATAGCGT ATCCGTCCGG CGTTATCCCT GACATGAGAG GCTGGACAAT AAAGGGTAAG
CCCATCAGTG GACGTGCCGT ATTGTCGCAA GAAATGGACG GCAATAAATC GCACTCGCAC
ACCGCGCGGG CGCAGGATAC TGACTTAGGG ACAAAATCTA CCTCATCTTT TGATTACGGC
ACGAAATCGA CCAATACCAC GGGCAACCAT ACTCACCAGT TCGGCGGTTA TATCAATTCA
TACTGGGGAG ACTCCAATCA CACCTCATTT CAGCCTGGAG GTGGTGCATG GACACAGGCC
GCTGGCGACC ATGCGCATAC AGTTTATATC GGAGGACACG AGCACACCAT GTATATCGGT
CCACACGGTC ACGTCGTTAT TGTGGACGCA GACGGTAATG CGGAAACCAC GGTTAAAAAT
ATTGCATTTA ACTACATAGT GAGGCTGGCA TGA
 
Protein sequence
MPVLISGVLK DGTGTPVQNC TIQLKACRTS TTVVVNTVAS ENPDDAGRYS MDVEQGQYTV 
TLLVEGYPPS HAGVITVYDD SKPGTLNDFL GAMTEDDVRP EALRRFEAMV EEVARQASEA
SRNATAAGQA SEQAQTSAGQ AAESATAAVN AAGAAEASAT QAASSAASAE SSAGTATTKA
GEASASAASA DTARTAAAAS AAAAKTSEAN ADVSRTAAGD SAAAAAASAT AAQTSAARAG
ASETAAKTSE TQAASSAGDA GASATAAAAS EKAAAASAAA AKISETNAAT SASTAAASAT
AASSSASEAS NHAAASDTSA SLAAQSSTAA GAAATRAEDA AKRAEDIADV ISLEDASLTK
KGIVKLSSAT DSDSEALAAT PKAVHAVMDE VQTKAPLDSP VFTGTPTTPT PPDDAKGLQT
ANAEFVRKLI AALVGSVPES LDTLQELADA LGNDPNFATT ITNMIAGKQP LDDTLTALSG
KSIEGLIEYV GLRSTIDKAA GALPAGGTAV AANRLASRGA LPALTGTTRG SDGGLIMGEV
YNNGYPTQYG NILRLTGTGD GEILIGWSGT NGAPAPAYIR SHRDTADAEW SEWAMLYTTL
NPPPDSHPVG AAIAWPSDAT PAGYALMQGQ SFDKSAYPLL AIAYPSGVIP DMRGWTIKGK
PISGRAVLSQ EMDGNKSHSH TARAQDTDLG TKSTSSFDYG TKSTNTTGNH THQFGGYINS
YWGDSNHTSF QPGGGAWTQA AGDHAHTVYI GGHEHTMYIG PHGHVVIVDA DGNAETTVKN
IAFNYIVRLA