Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A4463 |
Symbol | |
ID | 6518944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 4333959 |
End bp | 4335263 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642749414 |
Product | phage portal protein, HK97 family |
Protein accession | YP_002117150 |
Protein GI | 194734728 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0000679414 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGAATA ATAAACACCC CGGGCGAATA AAAAGCGCCC TTTTAAACTG GCTGGGCGTC CCCGTCAGCC TGACTAACGG CGAGTTCTGG CGCGAGTGGT TCGGAACCAG CAGCAGTGGA AAAGTTGTGA CTGCTGATAA GATTATCCGC CTGTCTGCTG TATGGGCCTG TGTCAGGTTG TTGAGTGAAT CGGTTTCCAC GTTACCGCTG AAAATCTACG AACGTCAGGC TGATGGCTCC CGAAAACTGG CTTCTGATAA TCCTGCTTAC CAGGTGCTTT GTCGGCGCCC CAATCCTGAA ATGACGCCGT CACGTTTTAT GCTGATGGTG GTCGCCAGTA TCTGTCTGCG GGGAAATGCA TTTGTTGAAA AACTGTTTAT TGGCAGAAAA CTGGTATCGC TGGTTCCGCT GTTACCACAG AACATGGTAG TAAAACGACT GGATAGTGGG CAATTGCAGT ACTCATATAC TGAGAACGGA AAACAGCGAA TTATACCTGT AAACCGGATT ATGCATATCC GTGGATTCGG TCTGGATGGT GTATGTGGCA TGATGCCTGC GATGACGGGC ATCGATGTCT TTGGTGCGGC AATGTCGGTG GATGAAGCCG CGGCAAAAAT CTTTGAAAAT GGCCTTCAGA GTACAGGGTT TCTTTCTTCA AAAAATGCGC TGACCAAAGA GCAGCGTGAT CGTCTGAGGC AAAACCTTCA GTCTTTTATC GGTTCAAAAA ATGCCGGGAA ACTGATGGTG CTGGAAAATG AACTCACATA CCAGAATGTC ACCATGAATC CGGAAGCGGC ACAATTGCTG GAAAGCCGTT CCTTCAGTAT CGAGGAAATT TGCCGCTGGT TTCGCGTTCC TCCTTTCATG GTCGGTCATA CCACTAAACA AAGCAGCTGG GCATCCAGTC TTGAAGGGAT GAACCTTCAG TTCCTGACGC ACACTCTTCG ACCGCTGCTG GTGAATATTG AACAGGAAAT TGGCCGGTGC CTGCTCGATA GCGATGATGA CGTGTTCGCG GAGTTCTCCG TTGAAGGACT GCTGCGCGCC GACAGCGCTG GCCGTGCGGC TTACTATACC AGCGCGCTTC AGAATGGCTG GATGTCGCGA AACGATGTTC GCCGTCTGGA AAATATGCCG CCGATTGAAG GGGGGGACAT TTACACCGTT CAGCTCAACC TGACGCAACT GAAAAATCTC GAAAGCAGCA ATCCTGCTGT TCAGGCTCTG GCCCTGAGAG AACTGCATAA CCACGTATTC CCCGATATTT CCTTTGAACA ATCTCCGCTG AAACAGGCCG CTTAG
|
Protein sequence | MANNKHPGRI KSALLNWLGV PVSLTNGEFW REWFGTSSSG KVVTADKIIR LSAVWACVRL LSESVSTLPL KIYERQADGS RKLASDNPAY QVLCRRPNPE MTPSRFMLMV VASICLRGNA FVEKLFIGRK LVSLVPLLPQ NMVVKRLDSG QLQYSYTENG KQRIIPVNRI MHIRGFGLDG VCGMMPAMTG IDVFGAAMSV DEAAAKIFEN GLQSTGFLSS KNALTKEQRD RLRQNLQSFI GSKNAGKLMV LENELTYQNV TMNPEAAQLL ESRSFSIEEI CRWFRVPPFM VGHTTKQSSW ASSLEGMNLQ FLTHTLRPLL VNIEQEIGRC LLDSDDDVFA EFSVEGLLRA DSAGRAAYYT SALQNGWMSR NDVRRLENMP PIEGGDIYTV QLNLTQLKNL ESSNPAVQAL ALRELHNHVF PDISFEQSPL KQAA
|
| |