Gene SeD_A3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3031 
Symbol 
ID6875365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2928740 
End bp2929912 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content58% 
IMG OID642786061 
Productmajor tail sheath protein 
Protein accessionYP_002216707 
Protein GI198243423 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.0023583 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTCAGG ATTACCACCA CGGGGTGCGC GTTGTTGAAA TCAACGAAGG CACCCGACCT 
ATTACCACGG TGAGCACTGC CATCGTGGGC ATGGTCTGCA CCGGCGATGA TGCTGATGCG
TCCGTGTTCC CCCTCAATAA GCCGGTCCTG CTGACTGATG TGCTCACCGC CAGCGGTAAA
GCGGGGGAGT CCGGCACGCT GGCCCGCTCG CTGGACGCGA TTGCAGATCA GGCAAAACCC
GTAACTGTCG TTGTGCGTGT GGCGCAGGGC GAAACCGAAG CGGAAACCAC CTCCAATATT
ATCGGCGGCG TAACTCCCGA CGGTAAGAAA ACGGGCATGA AAGCGCTACT GTCGGCGCAG
TCGCAGCTCG GTGTTAAGCC GCGCATTCTT GGGGTGCCGG GACATGACAC TCAGGCCGTT
GCTACTGAAC TGCTGGGCGT GGCGCAAAGC TTGCGCGGGT TTGCCTACCT TGCTGCTAAT
GGCTGCAAAA CGGTGGAGGA AGCTATTGCC TATCGCGAGA ATTTCAGTCA GCGCGAGGGA
ATGCTGATCT GGCCTGACTT CATCAACTTT GACACCGTGC TGAAAGCAGA CGCCACGGCT
TACGCCTCCG CCCGTGCGCT CGGCCTGCGT GCCAAAATCG ACGAGCAGAT TGGCTGGCAT
AAAACCCTGT CCAATGTGGG TGTGAACGGT GTCACCGGCA TTTCCGCTGA TGTGTTCTGG
GATCTGCAGG ACCCGGCAAC CGATGCGGGA CTGCTGAACA AAAATGACGT CACCACATTG
ATCCGCAAAG ACGGCTTCCG CTTCTGGGGT TCCCGTTGTC TCAGTGACGA TCCGCTGTTT
GCCTTTGAGA ACTACACCCG CACGGCGCAG GTGCTGGCTG ACACTATGGC GGAGGCGCAC
ATGTGGGCGG TGGATGGCGT GCTTAATCCG TCGCTGGCCC GCGACATTAT TGAAGGACTA
CGCGCCAAGA TGCGCAGTCT GGTCAACCAG GGATACCTGA TTGGTGGTGA CTGCTGGCTG
GATGAGTCCG TTAACGATAA AGACACCCTT AAAGCCGGGA AACTGACCAT TGATTATGAC
TACACTCCGG TGCCTCCGCT TGAAAACCTG ATGCTGCGCC AGCGCATCAC CGATCGTTAC
CTGGTCGATT TTGCCAGCCG TGTCGCTGCA TAA
 
Protein sequence
MAQDYHHGVR VVEINEGTRP ITTVSTAIVG MVCTGDDADA SVFPLNKPVL LTDVLTASGK 
AGESGTLARS LDAIADQAKP VTVVVRVAQG ETEAETTSNI IGGVTPDGKK TGMKALLSAQ
SQLGVKPRIL GVPGHDTQAV ATELLGVAQS LRGFAYLAAN GCKTVEEAIA YRENFSQREG
MLIWPDFINF DTVLKADATA YASARALGLR AKIDEQIGWH KTLSNVGVNG VTGISADVFW
DLQDPATDAG LLNKNDVTTL IRKDGFRFWG SRCLSDDPLF AFENYTRTAQ VLADTMAEAH
MWAVDGVLNP SLARDIIEGL RAKMRSLVNQ GYLIGGDCWL DESVNDKDTL KAGKLTIDYD
YTPVPPLENL MLRQRITDRY LVDFASRVAA