Gene SeAg_B2850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B2850 
Symbol 
ID6794868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp2794696 
End bp2796462 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content52% 
IMG OID642777022 
Producttail fiber domain protein 
Protein accessionYP_002147636 
Protein GI197248971 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCAA AATTTTATAC CCTGCTGACG GAGATCGGCG CGGCGAAACT GGCAAGCGCC 
GCCGCGCTCG GTGTCCCGTT GAAAATTACC CATATGGCGG TGGGTAGCGG TGGTGGTGTG
CTGCCCACAC CCAACGCGCA ACAGACCGCA TTAGTTGCTG AGGAGCGCCG CGCAGCGCTG
AATATGCTGT ATATCGACCC GCAGAACAGC AGCCAGATTA TTGCTGAGCA GGTGATCCCG
GAAAATGAGG GCGGGTGGTG GATTCGTGAA GTCGGCCTGT TTGATGAAAC CGGTGCGCTG
ATCGCTGTGG GTAACTGCCC GGAGAGCTAC AAGCCGAAGC TGGTAGAAGG GAGCGGACGT
ACGCAGACCG TGCGCATGGT ACTGATTACC AGCAGCACCG ATAACATCAC CCTGAAAATT
GACCCTGCAG TAGTGCTGGC AACCCGTAAA TATGCGGATG ATAAGGCGCT GGAGCTGAAG
GTATATGTAG ACGACCTGAT GGCAAAACAT CTTGCTGCAG TAGATCCACA TACGCAGTAC
GCGCCTAAAG TTAGCCCGTC GTTCACAGGT ACACCAAGAG CGCCAACGCC AGCGGCAGGA
AATAACAGCA GCCAGATTGC TAATACGGCA TTTGTTCAGG CAGCGATAGC AGCACTGGTT
GCGTCATCGC CAGCGGCGCT GGATACATTA AATGAACTGG CTGCTGCGCT GGGGAATGAT
CCAAACTTTG CGACAACAGT GATGAATGCG CTGGCAGGAA AACAACCTCT CGATACCACG
CTGACGAATC TGAGTGGAAA AGACAATGCA GGGATTCTCC AATACCTCGG TTTAAAAGAC
GCACTTTTAA AAGGTGACGG GCGATTCCTT GCGGGAACGT TTGTCAGTGA CGCAATTGAC
CGAACATCAA TTGGTGCCAG AGCGGCTACA GGCTGTCAGT TTATGCGCGC ACATCAGGCA
CCTGATGCGC CAGACCAGGT AAGTTTCTGG CAAATTATTA CCCTTAGCGA GGTGATAAGT
CCGACCACTG TTGTGGATGT TCTTGCAGTC AGTGGCAATA ACGTATTGTT TGGTCACGGT
ACAGGAGCGG GTATTACCTC ATGGCGTCAA GTGGCGATGC TGGAGGGAGG CGCCTTTACG
GGGGGTATTT CTGCACCAAA TATGCGTGGT GATACCCTGG TTACAGTTGG GGATGGCACT
GGTGGGATGG CTAAAGGTGA CGTTGATGGT GCAGGTTTTA ATGGTAACAA TCTGAACATT
AAGTCATGGA ATGGTATCGG ATTTCAGAAC TCAGAAGACC TGGCTATCCG GGCATATATC
AGCACCCGAC TCGGTGTTAT CGCAGCTGCT GAAAATTTGC AGGCCGGAAA TGCGATATTC
AACAAAAACG GCGATGTTTA CGGCGATATA TGGGGCACAG GCAGCGGGCC TGGCTGGTTG
AGTGCGTATA TTGCAGGCAG ACCGTTACGA CAATACATCA CCATGGTCGG TGTGTACCAG
AACGACAAAA CAAAGCCATT TATGCTTCAT GATGATGGTT CTGGTGTATT CCTTGCTACA
ACTGACATGC TAAGTGGGTA TGTTCAGTCA ATTCGATTCG GTGCCGTTGA GCATGGAAAC
GTATATCGTT CGCCCGGATT TGCAGACCAA TTAGGTTACG TCATTACAGG TGTTGAGAAT
GGAGACTCGA ACGAAACACC AGACAGGATC CAACGACGCT TGTTACAGCT TAAAGTGAAT
GGTCAGTGGT ATACGGTGGG GGCATAA
 
Protein sequence
MSAKFYTLLT EIGAAKLASA AALGVPLKIT HMAVGSGGGV LPTPNAQQTA LVAEERRAAL 
NMLYIDPQNS SQIIAEQVIP ENEGGWWIRE VGLFDETGAL IAVGNCPESY KPKLVEGSGR
TQTVRMVLIT SSTDNITLKI DPAVVLATRK YADDKALELK VYVDDLMAKH LAAVDPHTQY
APKVSPSFTG TPRAPTPAAG NNSSQIANTA FVQAAIAALV ASSPAALDTL NELAAALGND
PNFATTVMNA LAGKQPLDTT LTNLSGKDNA GILQYLGLKD ALLKGDGRFL AGTFVSDAID
RTSIGARAAT GCQFMRAHQA PDAPDQVSFW QIITLSEVIS PTTVVDVLAV SGNNVLFGHG
TGAGITSWRQ VAMLEGGAFT GGISAPNMRG DTLVTVGDGT GGMAKGDVDG AGFNGNNLNI
KSWNGIGFQN SEDLAIRAYI STRLGVIAAA ENLQAGNAIF NKNGDVYGDI WGTGSGPGWL
SAYIAGRPLR QYITMVGVYQ NDKTKPFMLH DDGSGVFLAT TDMLSGYVQS IRFGAVEHGN
VYRSPGFADQ LGYVITGVEN GDSNETPDRI QRRLLQLKVN GQWYTVGA