Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_2592 |
Symbol | |
ID | 7086158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 3079966 |
End bp | 3081156 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643461482 |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_002358506 |
Protein GI | 217973755 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.339104 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAACC CAAATTTTGA AAAACAAGTA GAAGAATTAG GTGCCAATCT AACTAAGATT GGTGATCAAA TTAAATCGGC GGCAGAAGAA ACCAATAAGC AGATCAAGGC TTCTGGTGAA ATGCATGCCG AAACCCGCGA TAAGGTCGAT AAGCTGCTGT TAGAGCAGGG TGCCATGCAA GCTCGCTTGC AAGAAGCTGA GCAAAAGCTG CTTAAGGGGC CGCAAAGTCA GCAAGAAGAA CGTGAACTGT CGATCGGTGA GCGAGTAGCG AAAGACAAAG AAATGGAAGG GGTAAACAGC TCTTTCCGTG GTAGTCGTCG TGTGCAAATG CCACGTTCAG CGATTACATC CGCCACAGGT TCAGGTGGTG CATTAGTCCG CCCTGATCGT ATGGCTGGTA TCGTTGCACC TCCACAGCGC ACCTTCACGA TTCGCGATTT GATTGCACCA GGTCGAACTG GCAGCAACAG CGTTGAATAT GTGAAGGAAA CTGGCTTTAC CAATAATGCG GCCCCAACTG CTGAAAATAC TCAAAAACCG TATTCTGATA TCACTTTTGG TTTAGTGAAT AATGCAGTAC GTACCATCCC GCATCTTTTT AAAGCCAGCC GCCAAATCTT GGATGATGCC GAGCAATTAG CTAGCTACAT CGATGCTCGT GCTCGTTACG GTTTAATGCT TGCTGAAGAA ACGCAACTGC TGTACGGCAA CAATACTGGC GCTAACCTGC ACGGTATTAT CCCGCAAGCC AGTGCTTACG TTAAACCAGC AGGTGCAACA GTCTCTGCAG AGCAGCATAT TGACCGTATT CGTCTTGCAA TGCTACAAGC TGCTTTAGCT GAATACTCAT CCGACGGTAT CGTACTCAAT CCGATCGACT GGGCTGTGAT TGAAATGCTT AAAGATAGCA ATGGGAATTA CTTAATTGGT AAACCTCAAG GCCAAACATT CGCAACCTTG TGGAATCGTC CTGTTGTAGA AACTTCTGCG ATTGTGCAAG ACGAGTTCTT AGTCGGTGCC TTCCAAATGG GCGCGCAGAT CTATGACCGT ATGGATATTG AAGTCTTAAT CTCTACCGAG AACGACAAAG ACTTTGAGTT AAACATGGTG ACTATCCGTG CTGAAGAACG TTTAGCACTT GCGGTTTATC GTCCTGAAGC GTTTGTCACT GGCGATTTCA CTTTCGCCTA A
|
Protein sequence | MPNPNFEKQV EELGANLTKI GDQIKSAAEE TNKQIKASGE MHAETRDKVD KLLLEQGAMQ ARLQEAEQKL LKGPQSQQEE RELSIGERVA KDKEMEGVNS SFRGSRRVQM PRSAITSATG SGGALVRPDR MAGIVAPPQR TFTIRDLIAP GRTGSNSVEY VKETGFTNNA APTAENTQKP YSDITFGLVN NAVRTIPHLF KASRQILDDA EQLASYIDAR ARYGLMLAEE TQLLYGNNTG ANLHGIIPQA SAYVKPAGAT VSAEQHIDRI RLAMLQAALA EYSSDGIVLN PIDWAVIEML KDSNGNYLIG KPQGQTFATL WNRPVVETSA IVQDEFLVGA FQMGAQIYDR MDIEVLISTE NDKDFELNMV TIRAEERLAL AVYRPEAFVT GDFTFA
|
| |