Gene SeD_A1123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1123 
Symbol 
ID6874481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1115675 
End bp1117384 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content49% 
IMG OID642784308 
Productside tail fiber protein 
Protein accessionYP_002214982 
Protein GI198244544 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATTG GATTCGGAAA TAATGTCGTC TCCTCACTGG CGGCTGATAT TACCGCCAGC 
CAGACGACCA TTCAGGTGAT GCCTGGTGTG GGAGCGATGT TTGCTAATTT GCTGACCAGC
GATTATGCAA ACAGCTCAAA CCCTCTTAAA ACTTACGCCA AAATTACACT GACAGACGCA
AAAGAAACAG TTTTTGAGGT ATGCCATCTG ACAGCAGTTA ATAATGACAT GCTGACGGTT
ATTCGCGGTC AGGAAGGTAC AACAGCGAAG GGATGGTCAC TGAATGACGT TATAGCGAAT
TTTGCGACGC GAGGATCTGA AAATCAGTTT GTACAAATTG AAGAGCTCCA GAGTGGGCAT
TATGTCGCTG GTGTGGCCGG AGGTACAGAA AATAATCTGA CGCTGGAGTT ACCAGCAACT
TATTTCGTCA ATGGTGGAGT TGACTGGACA TTGCGCACTC CACTTGTGGT TATTCCGGCG
CTAAACAATA CCGGAGCCAG CACTCTGCAA CTGACGATGG GAGGACGTGT GCTTGGCATA
TTCCCACTAT ACAAGGGGAA TAAAGCAGAG TTATCGGCCA ATGATATTAT TAAAGATATT
CCTGTCTTAT GCGTTCTGGA TAATACAAAA ACCTATTTTT CTGTGCTTAA TCCCCTGGAG
ATTTATTTGG GATCACGGTA TTTGCAGAAG GACCAGAACC TGTCCGACGT ACCGGATAAG
GCCAAAGGTC GCTCCAGTCT TGAGGTCTAC AGCAAAACCG AAAGTGATGA AAACTACATG
GCTAAAAGCC AGTGTGGTGC GGATATCCCG AATAAGCCGC TGTTTGTACA AAATATCGGA
GCGCTCCCTG CATCAGGTAC GGCTGTTGCA GCGAACAGAC TGGCATCACG CGGCGCGCTT
CCGGCACTGA CTGGTGCGAC AAGAGGCAGC GATAGCGGCC TGATAATGGG CGAGGTCTAC
AACAATGGCT ATCCGACGCA ATACGGAAAT ATTTTACGTC TGACCGGAAC CGGTGATGGG
GAAATCCTCA TTGGCTGGAG CGGGACAAAC GGTGCGCCAG CGCCCGCATA TATTCGCAGT
CATCGAGATA CCGCCGATGC TGAGTGGTCC GAATGGGCGA TGCTCTACAC CTCACTAAAT
CCGCCACCGA ATTCGTATCC AGTAGGTGCG GCGATAGCAT GGCCGTCTGA TGCTACCCCA
GCCGGTTACG CCCTGATGCA GGGGCAATCG TTTGATAAAT CTGCTTACCC GTTACTGGCT
ATAGCGTATC CGTCCGGCAT TATCCCTGAC ATGCGGGGCT GGACAATAAA GGGTAAGCCC
ATCAGTGGAC GTGCTGTACT GTCGCAAGAA ATGGACGGCA ACAAATCGCA CAGTCACAGC
GCCAGAGCGC AGGATACTGA CTTAGGGACA AAATCTACCT CATCCTTTGA TTACGGCACG
AAATCGACCA ATACCACGGG CAATCATACT CACCAGTTCG GCGGTTATAT CAATTCATAC
TGGGGAGATT CCAATCACAC CTCATTTCAG CCAGGAGGTG GTGCATGGAC ACAGGCCGCT
GGCGACCATG CACATACAGT TTATATCGGA GGACATGAGC ACACCATGTA TATAGGTCCA
CACGGACACG TCGTTATTGT GGACGCAGAC GGTAATGCGG AAACCACGGT TAAAAACATT
GCATTTAACT ACATAGTGAG GCTGGCATAA
 
Protein sequence
MIIGFGNNVV SSLAADITAS QTTIQVMPGV GAMFANLLTS DYANSSNPLK TYAKITLTDA 
KETVFEVCHL TAVNNDMLTV IRGQEGTTAK GWSLNDVIAN FATRGSENQF VQIEELQSGH
YVAGVAGGTE NNLTLELPAT YFVNGGVDWT LRTPLVVIPA LNNTGASTLQ LTMGGRVLGI
FPLYKGNKAE LSANDIIKDI PVLCVLDNTK TYFSVLNPLE IYLGSRYLQK DQNLSDVPDK
AKGRSSLEVY SKTESDENYM AKSQCGADIP NKPLFVQNIG ALPASGTAVA ANRLASRGAL
PALTGATRGS DSGLIMGEVY NNGYPTQYGN ILRLTGTGDG EILIGWSGTN GAPAPAYIRS
HRDTADAEWS EWAMLYTSLN PPPNSYPVGA AIAWPSDATP AGYALMQGQS FDKSAYPLLA
IAYPSGIIPD MRGWTIKGKP ISGRAVLSQE MDGNKSHSHS ARAQDTDLGT KSTSSFDYGT
KSTNTTGNHT HQFGGYINSY WGDSNHTSFQ PGGGAWTQAA GDHAHTVYIG GHEHTMYIGP
HGHVVIVDAD GNAETTVKNI AFNYIVRLA