Gene SeSA_A2015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A2015 
Symbol 
ID6518487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1940354 
End bp1941682 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content53% 
IMG OID642747101 
Productside tail fiber protein 
Protein accessionYP_002114902 
Protein GI194736320 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.478107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAGG TACAGACCAA AGCCCCGCTG GACAGTCCAG CACTGACCGG TACGCCAACG 
GCACCAATGC CGGAAACCAC AGCTGCAGGT ATTGAAATTG CCACGGCAGC GTTTGTGGCT
GCGAAAGTGG CGCAGTTGGT TGGTTCTGCG CCGGAAGCGC TGGACACCCT GCAGGAACTG
GCTGACGCGT TGGGAAACGA TCCGAACTTT GCCATCACGG TACTGAATAA ACTGGCGGGC
AAGCAGCCGC TGGACGAAAC CCTGACGGCG CTGTCAGGAA AAAGCGCTGA TGGTCTTATC
GAATATGTTG GTTTACGGGA AACGATAAAT CACGCCGCCG ATGCGTTACA AAAATCACAG
AATGGTGGCG ATATTCCGGA AAAGCCGCTG TTTGTACAAA ATATCGGAGC GCTCCCTGCA
TCAGGTACGG CTGTTGCAGC GAACAGACTG GCATCACGCG GCGCGCTTCC GGCACTGACT
GGTACGACAA GAGGCAGTGA TAGCGGCCTG ATAATGGGCG AGGTTTACAA TAACGGTTAT
CCAACGCAAT ACGGGAATAT TTTGTGTCTG ACCGGAATCG GTGATGGAGA AATATTAATC
GGATGGCGTG GGGTTAATGG TGCTCCTGCG TCTGCATATA TTCGCAGCCA TCGAGATACC
GCCGACGCTG AGTGGTCAGA ATGGGCGATG TTCTACACCT CACTAAATCC GCCACCGGAT
TCGTATCCAG TAGGTGTGGC GATAGCATGG ACGTCTGATG CTACTCCGGC AGGTTACGCT
CTGATGCAGG GGCAATTGTT TGATAAATCT GCTTACCCGT TACTGGCTAT AGCGTATCCG
TCCGGCATTA TCCCTGACAT GCGAGGCTGG ACAATCAAAG GTAAACCCAC CAGTGGGCGA
GCTGTACTTT CTCAGGAGAT GGACGGCAAC AAATCGCACT CGCACACCGC GCGGGCGCAG
GATACCGACT TAGGGACAAA AACAACCGGC AATCAGGTTT ATATCTCCGA TCTTGGTCCG
CTACCTGAAA ACGTCACATC AGTTTCACCA GGTGGTGGAT ACAAAAAATG GGATAGTAAG
GCTCAGGTCT GGGTGAATGA TGAAGCTGCG GAGGCCGCAG CCAGACTTCG TGAAGCTGAA
GGAACGAAAA ACAGACGCCT GCAAATAGCG TCTGAAAAAA TCGCGCCGTT ACAGGATGCA
GTGGATCTGG ACGGAGCAAC CGATAAAGAA AAAGCTTCTC TTCTGGCATG GAGAAAGTAC
CGGGTACAGG TAAACCGTGT TGATACTTTA AAGCCTGTCT GGCCGGAGAA ACCAGCCAGT
AGTTTATAA
 
Protein sequence
MGEVQTKAPL DSPALTGTPT APMPETTAAG IEIATAAFVA AKVAQLVGSA PEALDTLQEL 
ADALGNDPNF AITVLNKLAG KQPLDETLTA LSGKSADGLI EYVGLRETIN HAADALQKSQ
NGGDIPEKPL FVQNIGALPA SGTAVAANRL ASRGALPALT GTTRGSDSGL IMGEVYNNGY
PTQYGNILCL TGIGDGEILI GWRGVNGAPA SAYIRSHRDT ADAEWSEWAM FYTSLNPPPD
SYPVGVAIAW TSDATPAGYA LMQGQLFDKS AYPLLAIAYP SGIIPDMRGW TIKGKPTSGR
AVLSQEMDGN KSHSHTARAQ DTDLGTKTTG NQVYISDLGP LPENVTSVSP GGGYKKWDSK
AQVWVNDEAA EAAARLREAE GTKNRRLQIA SEKIAPLQDA VDLDGATDKE KASLLAWRKY
RVQVNRVDTL KPVWPEKPAS SL