Gene SeAg_B4452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4452 
Symbol 
ID6794129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4340075 
End bp4341820 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content57% 
IMG OID642778543 
Producttail fiber domain protein 
Protein accessionYP_002149109 
Protein GI197251834 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAATG AGTTTTATAC CCTCCTGACC GACAGGGGAA TGGCGAAAAT CGCCAGCGCC 
CTTGCGGACA AAAAACAGCT ACATCTACAA AAGATGGCGG TTGGCGACGG CGGCGGGCAA
TATTATGAGC CGACCGCCAG CCAGGCCAAA TTACGCCACG AAGTCTGGCG CGGTGAGATG
AATACGCTGA CCGTTGCGCC GAATAATCCC AACTGGCTGA TTGCCGAACT GGTGCTGCCG
GAGGACGTTG GCGGCTGGTA TGTGCGTGAA GTGGGCGTGT TCGACGACGA GGGCGAGCTA
ATCGCCATCG GCAAGTTCCC GGAATCCTAC AAACCGCTGT TGCCCGGCGG CTGCGGCAAG
CAGGTCTGTA TCCGCCTGAT TATGGAGGTC TCCAACACCA CGGCGGTGAC GCTGACGGTC
GATCCGAGCA TTGTGCTGGC GACGCGCGAC TATGTGGATG TCCGGCTGGA CGAGCATGAA
CATTCGACAA ATCACCCGGA TGCGACATTA ACGCAGAAAG GGTTTACACA GCTCAGTAAC
GCCACCGACA GCGATGACGA AACCAAAGCC GCTACGCCGA AGGCGGTAAA AGCGGCGATG
GCGGAAGCGC GTAATCACAC GCATACCTGG AACCAGATCA CTGGCGTTCC GGACGGCACG
CTGACGCAAA AGGGGATTGT TCAGCTTAAC AGTGCGACGG ACAGCACCAG CACAACGGAA
GCGGCAACGC CGAGCGCGGT CAAGGCGGCG ATGGATAAGG CGAATGCGGC AGCTCCGGCG
AACCATACTC ACGTCTGGAA CCAGATTATC GGCGTCCCGG ACGGCACGCT GGCGCAAAAA
GGGATCGTGA AACTTAATAA CGCCACCGAT AGCACCAGCA CCACCGAAGC GGCAACGCCC
AGTGCGGTAA AGGCGGCGAT GGATAAGGCA AACGCGGCGG CCCCGGCCAG CCATATACAC
GCCTGGGGGC AGATCACCGG CGTCCCGGAC GGTACGCTGA CGCAAAAAGG GATTGTGAAG
CTTAATAGCG CCACCGATAG CACCAGCACC ACCGAAGCGG CAACGCCTAG TGCGGTCAAG
GCGGCGTATG ACAAGGCGAG CGCAGCGGCT CCGGCTAATC ATTCCCATTA TCAGTTTTTT
ACGGCTAACG GTACGTTTAC GGTACCTGAT GGGGTGACTC AGGTCTTTGT GGAAATGTTA
GGAGGAGGAG GAGGAGGTGG TGGTGGTGCA GTCACCGATG GAGGATTCGC GGGTGCTAGT
GGTGGTTCGG GAGGAACCTG TGGAAGCACA AATATTTCCA TTGTTCCTGT AACTCCGGGA
GGAAAATACG CGGTTATAGT TGGCGCAGGA GGGGTTGGCG GAGTCGCAGC CAGTCAGTCG
TCTACGGCAC CATCAGGAAT TCACACCTTG GTAACTAGCA CGCCAGGATC CCCCGGTATT
GATGGCGGGG ATTCAATTTT TGTTAATGTT ACCGCCAAAG GTGGTTCAGG AGGAGCCGGC
GGCGTTATCT CAACTGTATC AGTAATAAAC CCAGCCCCAT CTGGTAATGG CGCAGCAGGG
GAAAACTCAT CGTACGGTAC GGGAGGGAGC GGTGGCAGTA ATACTGACGG GGGCAATGCT
GGTGGTTATG GTGCCGGGGG CGGCGGAGGC GCCAGAGGCA AAACCACTGG CAGTGATAAT
ACCTATAGTG GGTCTGGCTT TCCTGGAGGG AAAGGCTCAA ACGGTTTTGT AAAAATTTCA
TGGTGA
 
Protein sequence
MDNEFYTLLT DRGMAKIASA LADKKQLHLQ KMAVGDGGGQ YYEPTASQAK LRHEVWRGEM 
NTLTVAPNNP NWLIAELVLP EDVGGWYVRE VGVFDDEGEL IAIGKFPESY KPLLPGGCGK
QVCIRLIMEV SNTTAVTLTV DPSIVLATRD YVDVRLDEHE HSTNHPDATL TQKGFTQLSN
ATDSDDETKA ATPKAVKAAM AEARNHTHTW NQITGVPDGT LTQKGIVQLN SATDSTSTTE
AATPSAVKAA MDKANAAAPA NHTHVWNQII GVPDGTLAQK GIVKLNNATD STSTTEAATP
SAVKAAMDKA NAAAPASHIH AWGQITGVPD GTLTQKGIVK LNSATDSTST TEAATPSAVK
AAYDKASAAA PANHSHYQFF TANGTFTVPD GVTQVFVEML GGGGGGGGGA VTDGGFAGAS
GGSGGTCGST NISIVPVTPG GKYAVIVGAG GVGGVAASQS STAPSGIHTL VTSTPGSPGI
DGGDSIFVNV TAKGGSGGAG GVISTVSVIN PAPSGNGAAG ENSSYGTGGS GGSNTDGGNA
GGYGAGGGGG ARGKTTGSDN TYSGSGFPGG KGSNGFVKIS W