Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C3478 |
Symbol | |
ID | 6488025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 3379784 |
End bp | 3381754 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642743607 |
Product | phage tail tape measure protein, TP901 family, core region |
Protein accession | YP_002047221 |
Protein GI | 194449652 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01760] phage tail tape measure protein, TP901 family, core region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.000000000000755136 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACAGT TAGATTTTAC ATTAAGCCTG ATTGATAAGT TGTCCCGCCC GTTAAAACAG GCACAGAGCA GCGTCACCGG CTTTGCGGAA AAATCAAAAG CGGCCTTTAT GCAGATTGGC GGTGGTGTGC TGGCTTTAGC GGGTACAGGA ATGGCCATAC GGGGTGCGTT ATCACCGGCA ATTGAAATGT ATGATGCGCT GAATGATGCA GCATCAAAAG GGATTGATGA TCAGGCATTA AAGGCCGTAC AGCGGGATGC GCTGCGCTTC AGTACAACTT ATGGCGCCAG TGCGGTGGAA TTTGTTCAGT CCACTGAAAG TATTAATTCC GCCATTGCCG GGCTGACCGG TAATGAACTG CCGAAAGTGA CAAAAGTTGC TAATACCCTG GCGTTTGCCC TGAAATCCAC CGCCGCAGAA ACGGCGGAAT TTATGGGGCA GATGTTTGGT AATTTTTCCG CCGATGCGGA GCGTCTGGGC AAGGTTCAGT TCGCTGAGCA GCTGGCCGGA AAAATGGTGT ATATGCGCAA GGTCTTCGGT ACTGAAATGG GCACTATCAA AGACCTGATG GAAGGGGCGC GGGGCGTCGG TACCAACTAC GGCGTCGGAC TGGATGAACA GCTGGCCGTA CTGGGGCAGC TTAACCGCAC GCTGGGAACG GAAGCCAGCA GCGCTTACGA AGGCTTCATG ACCGGAGCCA TTGAGGGCGG TAAAAAGCTG GGGCTGTCCT TTACGGATGC CACCGGCAAA ATGCTGTCCA TGCCTGAGAT GCTGATTAAA TTGCAGGGCA AATACGGCAA GAGCCTGGAA GGGAACCTGA AAGCCCAGGA GGAACTGGAT GCGGCATTCG GTGACAGTTC GGCTGTGGTC AAACACCTTT ACGGTAATGT GGCGCTTCTC CAGAGGAACA TCACCGAACT GGGCGGATCT GACGGTCTGA AACGTACGCA GGAGATGGCC AGTAAACTGG TGAAACCGTG GGATCGGTTT GTACAAATCC TGAAAGCTAT TCAGACCGTA ATAGGGCTGA CACTAATCCC GGTATTGTAT CCGGTGCTGA ATCGTCTGGC GGATATGGGA CAGACATTTG CCAGATGGAT GCAGCTATTT CCCAACATTG CCCGTGTTAT CGGCTACGCA GCTATGGCGT TGCTGGGGTT TGCGGCAGTG GGCGCGGTTG CCAATATTGT GATGGGCGCT TCTAAGTTCA TCATGGCAGG TTTACGCGGG ATCTGGGTTG CCATGACCGC CGTCACGAAA GCATATACGG CAATGGTATG GTTGGCACAA ATTGCTGTTA TCGCCTGGAA TGCGACGCTT AAATTTTTGC GCGGAGCGTT GCTGGCCGTT CGTATGGCGG CAATCATGGC CGGAATCGGT ATTAATCTTA TGAGCTGGCC GGTCTTGCTT GTGATCGGGG CGATAGCGTT GCTTGCGGCG GGTTGCTGGT TGCTGATTAA ACACTGGGAT ACGGTGAAAG CGGCTGTTAT GGAAACATCC GCGTTTCAGG CATGTGCCAG GGGGGTGGCG TGGCTGGCCG GGGTGTTTTC CACAGCATGG CAATTTATCA GTGAAGGCTG GAACAGTTTT ATTGCGCTAT TAACAGAGTT TTCACCCTCA CAGGCATTAA GTGGACTAGC GTCGGGTATT GTATCCATGT TTGATAATGT CTGGCAGTCC GTTAAAGGTG GTTTTCTGAA ATCGTGGAAC TGGATTGTTG AGAAGCTGAA TAAAATACCC GGCGTTGATA TCTCAATGGC TAATGAAACC TCTTCGCCAC CATTAACAGT AAATAATTTA TCTACAGGTG GCGAGCTAAA AGGAATTGAT AAAGGTGGTA TCAGTAAATC TGTCAGTAAT AACTCAAGGT CTGTGACGGA TAACAGCCGG AAAATTAATA CTGTCAATAT CTATCCAAAA GAAATGATAA CGCCGGGGCA GTTAATGGAG TTTCAGGAGT TGGGCGTATG A
|
Protein sequence | MKQLDFTLSL IDKLSRPLKQ AQSSVTGFAE KSKAAFMQIG GGVLALAGTG MAIRGALSPA IEMYDALNDA ASKGIDDQAL KAVQRDALRF STTYGASAVE FVQSTESINS AIAGLTGNEL PKVTKVANTL AFALKSTAAE TAEFMGQMFG NFSADAERLG KVQFAEQLAG KMVYMRKVFG TEMGTIKDLM EGARGVGTNY GVGLDEQLAV LGQLNRTLGT EASSAYEGFM TGAIEGGKKL GLSFTDATGK MLSMPEMLIK LQGKYGKSLE GNLKAQEELD AAFGDSSAVV KHLYGNVALL QRNITELGGS DGLKRTQEMA SKLVKPWDRF VQILKAIQTV IGLTLIPVLY PVLNRLADMG QTFARWMQLF PNIARVIGYA AMALLGFAAV GAVANIVMGA SKFIMAGLRG IWVAMTAVTK AYTAMVWLAQ IAVIAWNATL KFLRGALLAV RMAAIMAGIG INLMSWPVLL VIGAIALLAA GCWLLIKHWD TVKAAVMETS AFQACARGVA WLAGVFSTAW QFISEGWNSF IALLTEFSPS QALSGLASGI VSMFDNVWQS VKGGFLKSWN WIVEKLNKIP GVDISMANET SSPPLTVNNL STGGELKGID KGGISKSVSN NSRSVTDNSR KINTVNIYPK EMITPGQLME FQELGV
|
| |