Gene SeAg_B2787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B2787 
Symbol 
ID6793594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp2732288 
End bp2734255 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content51% 
IMG OID642776964 
Productphage tail tape measure protein, TP901 family, core region 
Protein accessionYP_002147578 
Protein GI197250251 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGT TAGATTTTAC ATTAAGCCTG ATTGATAAAC TGACGCGCCC GTTAAAGCAG 
GTGCAGAGCA GTGTCACAGG CTTTGCGGAA AAATCGAAAG CGGCCTTTAC GCAGATTGGG
GGCGGTGCGC TGGCTTTAGC CGGCACAGGG ATGGCCATCA AAGGGGCGTT ATCGCCGGCT
ATTGAGATGT ATGACGCACT GAATGACGCT GCGGCAAAAG GGATTGATGA TCAGGCTTTA
AAGGCTGTCC AGCGTGATGC GCTGCGGTTC AGTATGACCT ACGGTGCCAG CGCGGTGGAG
TTTGTTAAGT CCACAGAAAA TATTAATGCC TCCATTGCCG GCCTCGCCGG TAATGAGCTG
CCGAAAGTGA CAAAAGTTGC TAATACCCTG GCATTTGCCC TGAGATCCAC ATCTGCCGAA
ACGGCGGAAT TTATGGGGCA GATGTTCGGT AACTTTTCCG CTGATGCGGA GCGTCTGGGC
AAGGTTCAGT TCGCTGAGCA GCTGGCCGGA AAAATGGTGT ATATGCGCAA GGTCTTCGGT
ACCGAAATGG GCACTATCAA AGACCTGATG GAAGGGGCGC GGGGCGTTGG TACCAACTAC
GGCGTCGGAC TGGATGAACA GCTGGCCGTA CTGGGGCAGC TTAACCGCAC GCTGGGAACG
GAAGCCAGCA GCGCTTACGA AGGCTTCATG ACCGGAGCCA TTGAGGGCGG TAAAAAGCTG
GGGCTGTCCT TTACGGATGC CACCGGCAAA ATGCTGTCCA TGCCTGAAAT GCTGATCAAG
TTACAGGGCA AGTATGGCAA AAGTCTGGAA GGGAACCTGA AAGCACAGGC GGAGCTGGAT
GCGGCATTTG GTGACAGTTC GGCGGTGGTG AAACACCTGT ACGGCAATGT GGCCTTACTG
CAACGTAACA TCACTGAGCT GGGCGGTTCT GACGGGCTGA AGCGTACACA GGAGATGGCC
GGCAAACTGG TGAAACCGTG GGATCGCTTT GTACAGATCC TTAAGTCTGT TCAGACCGTC
ATTGGACTGA CGTTGATCCC CGTTCTGTAT CCGGTGCTGA ACCGCCTGGC TGATATGGGA
CAGACCTTTG CCCGCTGGAT GCAGTTGTTT CCCAACATTG CGCGTGTTAT CGGTTATGCG
GCTATGGCGT TGTTGGGGTT TGCTGCTGCC GGCGCAATAG CTAACATCGT TCTGGGCGTC
TCAAAACTTA TTAAGCTGGG TGCGATTGCT CTCTGGAAGA CACTGACTTC AGTCACGAAG
ATATACACCG CCACCGTCTG GATTGCCTCA AAAGCTGTAG CGGCATGGAA TCTGACGCTT
AAATTTCTGC GTGGTACGCT TCTTGCGGTT CGTATGGCGG CAATTATGGC CGGAATTGGC
ATAAATCTGA TGAGCTGGCC GGTTCTGCTG GTTATTGGTG CGATTGGCCT GCTGGCAGCA
GGGTGTTATC TGCTGATTAA ACACTGGGAC GATGTACAGG CGGCGGTGAT GAATACGGCA
GCGTTTACCG CTGTGGCTGG CGTTGTCGAA TGGCTTGCCG GTGTGTTCTC GACGGCATGG
CAATGGATTA AGGACGGCTG GAACGGCTTT ATTAATCTGC TGACGGGATT TTCACCTTCA
CAGGCATTAA GCGGGATGGC CGGTGGTATT GTATCCATGT TTGATAATAT CTGGCAGTCC
GTTAAAGGTA GCTTCCTGAA ATCATGGAAC TGGATTGTAG AAAAGTTGAA TAAAATACCC
GGTGTCAATA TTTCGCTGGC TAACGAGTCA CCTCCGGCAC TGACAACAAA TACGCTTTCT
ACTGGTGGAG AATTAAAAGG AATTGATAAA GGTGGTATTA GTAAATCTGT TAGTAATAAC
TCAAGGGTTG TGACGGATAA CAGTCGGAAA ATTAATACTG TCAATATCTA TCCAAAAGAA
ATGATAACGC CGGGGCAGTT AATGGAGTTT CAGGAGCTGG GCGTATGA
 
Protein sequence
MKQLDFTLSL IDKLTRPLKQ VQSSVTGFAE KSKAAFTQIG GGALALAGTG MAIKGALSPA 
IEMYDALNDA AAKGIDDQAL KAVQRDALRF SMTYGASAVE FVKSTENINA SIAGLAGNEL
PKVTKVANTL AFALRSTSAE TAEFMGQMFG NFSADAERLG KVQFAEQLAG KMVYMRKVFG
TEMGTIKDLM EGARGVGTNY GVGLDEQLAV LGQLNRTLGT EASSAYEGFM TGAIEGGKKL
GLSFTDATGK MLSMPEMLIK LQGKYGKSLE GNLKAQAELD AAFGDSSAVV KHLYGNVALL
QRNITELGGS DGLKRTQEMA GKLVKPWDRF VQILKSVQTV IGLTLIPVLY PVLNRLADMG
QTFARWMQLF PNIARVIGYA AMALLGFAAA GAIANIVLGV SKLIKLGAIA LWKTLTSVTK
IYTATVWIAS KAVAAWNLTL KFLRGTLLAV RMAAIMAGIG INLMSWPVLL VIGAIGLLAA
GCYLLIKHWD DVQAAVMNTA AFTAVAGVVE WLAGVFSTAW QWIKDGWNGF INLLTGFSPS
QALSGMAGGI VSMFDNIWQS VKGSFLKSWN WIVEKLNKIP GVNISLANES PPALTTNTLS
TGGELKGIDK GGISKSVSNN SRVVTDNSRK INTVNIYPKE MITPGQLMEF QELGV