Gene SeAg_B2844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B2844 
Symbol 
ID6797165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp2789047 
End bp2791854 
Gene Length2808 bp 
Protein Length935 aa 
Translation table11 
GC content58% 
IMG OID642777017 
Productphage tail tape measure protein 
Protein accessionYP_002147631 
Protein GI197249129 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.581537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGACA ATAACCTGCG TCTGCAGGTG ATTCTTAATG CGGTTGACAA GCTCACCCGC 
CCATTTCGAT CCGCGCAGGC CAGCTCAAAA GAGCTGGCTA CAGCGATTCA GCAAAGCCGC
GCCCGTTTAA AAGAATTAGA TGCTCAGGCG GGCCGCATTG ACGGTTTCCG CAAGGCCAGC
GCGCAGCTGG CAGTCACCGG TAACAGCCTG AAAGCCGCAC GCGAAGAAGC TGCAAAACTT
GCCACGCAAT TCTCTGCCAC CAACCGCCCG ACGGCGGCGC AGGCACGGTT GCTTGAGCAG
GCAAAAAACC GCGTTACGGA GTTACAGAGC AAATATAACG GTCTACGTCA GTCGGTGCAG
CGCCAGCGTC TTGCGCTCAA TGAAGCCGGG CTGGACACGA AAAAGCTCAG TAGTGCGCAG
CGGGAACTGC GGCAGAATGC CGACGAAACC CGGCAGGCCC TGGACCGGCA GCAGAAATCC
CTTAAACGCC TGGGCGAACA GCAGGCGCGA ATGAACGCCG TCCGCGATCA GTATTCACGC
CGTCTTGAGG TGCGGGATCG CATCGCCGGG GCAGGGGCTA CCACTACGGC TGCGGGGGTG
GCAATGGGCG CACCTGTTGT GGCGGCAGTT AAGAGCTACG CCAGCATGGA AGATGCCATG
AAAGGCGTGG CAAAGCAGGT AAACAGGCTG CGGGACGATA ATGGCAACCG CACAAAACAG
TTTTATGACA TGCAGGATGC CATCAAGGCC GCCAGCGAAC AGCTGCCGAT GGAGAACGGC
GCTATAGACT ATGCCGCGCT GGTTGAAGGT GGTGCTCGCA TGGGGGTGAC CAATCAGGAC
GATCCTTACG AAGAGCAGAA ACGTGACCTG CTGGCTTTTG CATCCACGGC GGCAAAAGCG
GCAACGGCCT TTGAGCTGCC CGCAGATGAA CTGGCAGAAG GGCTGGGGAA AATCGCGCAG
CTCTATAAAG TTCCGACGCG CAATATTGAA CAACTGGGCG ATGCGCTGAA CTACCTGGAC
GATAACGCCA TGTCAAAGGG TGGGGACATT ATCAACGTCC TGCAGCGTAT GGGGGGCGTG
GCTGACCGCC TTGACTTCCG AAAGGCCGCG GCGCTGGGTT CAACATTCCT TTCTCTTGGG
GCTGCCCCGG AAATCGCCGC CAGCGCCTCT AATGCCATGG TGCGTGAACT GTCCATTGCC
ACCATGCAAA GTAAACGATT TTTTGAAGGC ATGAATCTGT TGCAACTCAA TCCGGCGGAG
ATTGAAAAGC AGATGACCAC CGATGCCATG GGCACAATTC AGCGGGTTCT GGAGAAGGTC
AACAATCTGC CGCAGGATAA ACGCCTGTCA GCCATGACAA TGATTTTTGG CAAAGAGTTT
GGCGATGATG CGGCAAAGCT GGCTAACAAC CTGCCGGAGC TGCAGCGCCA GCTGAAACTC
ACATCAGGCA GTGGTGCTAA TGGCTCCATG CAGAAAGAAT CCGACATTAA CAAGGATTCA
TTGTCTGCGC AGTGGTTGCT GGTTAAGACG GGCGCGCAAA ACGCTTTCAG CAGCCTGGGG
GAAACGCTGC GCCAGCCGCT GATGGATATT ATGGGCATGG TTAAGCGCGT GACCGGGGCG
TTGCGTCGCT GGGTTGAGCA GAATCCCGTG CTGGCTGGCA CGCTGATGAA AGTGGCGGCA
GCTACGGCAG CCATTACTGT TGGGTTGGGG GCGCTGGCAG TGGCGGTGGC TGCTGTGCTG
GGACCGCTGG CGGTTATCCG GTTTGGCCTG TCCATGCTGT CAGTTAAAGC GTTACCTTCT
GCAGCCGCCG CTGCCACACG TACAGGTAGC GTGCTGCGTC TGTTGATCTC TGGTCCGCTG
GCTTTGCTGC GCGTGGCATT ATTTGCTGTT GGTAGCCTGC TGGGGGCGCT GCTCAGTCCT
GTAGGGCTGG TTGTGGCTGC ACTGGCAGGC GTGGCGCTGG TTATCTGGAA ATACTGGCAG
CCCATTAGTG CATTTCTGGG GGGCGTGGTG GAAGGGTTCA GAGCCGCTGC TGCGCCCATC
AGCGCCGCCT TTGAGCCGCT CAGACCCGTG TTTCAGTGGA TTGGTGACAG GGTGCAGGCC
TTGTGGGGCT GGTTCAATGA TTTACTTACC CCGGTTAAAT CCACTTCCGA AGAACTGAAC
AGCGCAGCTG CAATGGGGCG TCGGTTTGGT GAGGCGCTGG CGGAAGGTCT GAATATGGTG
ATGCACCCAC TTGAGTCACT TAAATCCGGT GTGTCATGGC TGCTGGAAAA GCTCGGTATT
GTCAGTAAGG AGGCGGCAAA GGCGAAACTA CCTGCGCAGG TTACGCAGCA GCAGTCCGCC
ACAGTGAACA GTGACGGCAA AGTGGTGCTG CCGCCAGGCG GGTTCCCGGC TTACGCGGGG
ATGTACGACA CGGGCGGGAT CATTCCACGC GGGCAGTTTG GCATTGTCGG AGAAAATGGC
CCTGAAATTG TGAACGGACC GGCAAATGTT ACCAGCAGGC GGCGTACTGC TGCGCTGGCC
TCTGTCGTTG CTGGCGTGAT GGGGGGAGCT GCGACACCTG CAGAAGCGGC TCCGCTTCAT
CCGTTCAGTT TGCCTGCGAG GGCATACCAG CCCCCGCTTG CTAAGGCAGA TAGCCCGCCG
CCGGTTATTC GTTATGAGAT AAATGCGCCC ATTCATATTG TCGCTCAGCC TGGGCAGAAC
GCGCAGGAAA TTGCCCGTGA AGTGGCACGC CAGCTTGACG AACGGGAGCG CCGGGCCAGG
GCAAAAGCAC GCAGCAATTT CAGCGATCAG GGGGGGTATG AATCATGA
 
Protein sequence
MSDNNLRLQV ILNAVDKLTR PFRSAQASSK ELATAIQQSR ARLKELDAQA GRIDGFRKAS 
AQLAVTGNSL KAAREEAAKL ATQFSATNRP TAAQARLLEQ AKNRVTELQS KYNGLRQSVQ
RQRLALNEAG LDTKKLSSAQ RELRQNADET RQALDRQQKS LKRLGEQQAR MNAVRDQYSR
RLEVRDRIAG AGATTTAAGV AMGAPVVAAV KSYASMEDAM KGVAKQVNRL RDDNGNRTKQ
FYDMQDAIKA ASEQLPMENG AIDYAALVEG GARMGVTNQD DPYEEQKRDL LAFASTAAKA
ATAFELPADE LAEGLGKIAQ LYKVPTRNIE QLGDALNYLD DNAMSKGGDI INVLQRMGGV
ADRLDFRKAA ALGSTFLSLG AAPEIAASAS NAMVRELSIA TMQSKRFFEG MNLLQLNPAE
IEKQMTTDAM GTIQRVLEKV NNLPQDKRLS AMTMIFGKEF GDDAAKLANN LPELQRQLKL
TSGSGANGSM QKESDINKDS LSAQWLLVKT GAQNAFSSLG ETLRQPLMDI MGMVKRVTGA
LRRWVEQNPV LAGTLMKVAA ATAAITVGLG ALAVAVAAVL GPLAVIRFGL SMLSVKALPS
AAAAATRTGS VLRLLISGPL ALLRVALFAV GSLLGALLSP VGLVVAALAG VALVIWKYWQ
PISAFLGGVV EGFRAAAAPI SAAFEPLRPV FQWIGDRVQA LWGWFNDLLT PVKSTSEELN
SAAAMGRRFG EALAEGLNMV MHPLESLKSG VSWLLEKLGI VSKEAAKAKL PAQVTQQQSA
TVNSDGKVVL PPGGFPAYAG MYDTGGIIPR GQFGIVGENG PEIVNGPANV TSRRRTAALA
SVVAGVMGGA ATPAEAAPLH PFSLPARAYQ PPLAKADSPP PVIRYEINAP IHIVAQPGQN
AQEIAREVAR QLDERERRAR AKARSNFSDQ GGYES