Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_4918 |
Symbol | |
ID | 5602453 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009829 |
Strand | - |
Start bp | 7813 |
End bp | 8859 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640930781 |
Product | major capsid protein E |
Protein accession | YP_001471686 |
Protein GI | 157362827 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 74 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.532464 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGATA TTGATATGTT TGAAACACGC ACGATGCTTG AACCTGTCGT GCAAAACTTT GCGCCCCGTC GTTTTTTGCT GAAAACTTTT TTCCCTGGCG TTAAGACGTT TGGAACGAAG CATGTGGATC TTGATTTTGT TCGCGGTAGC CGCACTATGG CCCCGTTCGT AGGCAATGGC TACGGTTCAA AAACGGTCGA AAAGCGGGGG TTCACAACGA AAACCTTTGA ACCGCCACTG GTGGCACCCG ATTTAGTGAC CACCGCTGAT CAGATGCTTG AACGCCGCCC GGGGGAAACA GTGTATGCAG CCAAAAGCCC CGAAGAGCGT GCCGCTGAAC AGCTCGGCCA AGACCTTAGC GATCTGGATG ACATGGTATC GCGTACAGAA GAATGGATGG CGGCTCAGAC ATTGTTTACC GGTCAGGTTC GCGTGCTGGG TGCCGGTGTC GATGAGACGA TTTATTACTG GCCGGAAAAC CCTGCAGATC AGCCGGTCGT CACTTTGACC GGCGATGATT TATGGACAGC GGATAAATCC GATCCGTTAG CAAATATTCG TGGCTGGAAA CGTAAAGTTT CGCTGCAATC GGGCTTTACA CCTCGCATGG TTGTCATGGG GGCAAAGGTT GTTGATGCCT TCATGAAAAA CAAGGCGATC AACGAGGCGC TTGATAATCG ACGAAAAGAG CAGGGCAAGA TTGAACCGAA GGATCTCGAT GAAGGGGTTA CGTTTTACGG TACCCTAGAA GGCGTCGATT TTTACGGTTA TGACGAACAG AGCTACAACG ACGTCAGCGG CAAGCTGGAG GTACTTGTTC CAGAAGATAA GCTGTTACTC GGTGCGCCCG GGCGTGGACG CATGCTGTAT GGTGCAGTTG CGGTCGCTGA CCAGGCGGAA AATACCTTTA CGTATTACGA ATCTCCCCGT GTGCCTGACT CTTGGGTTTC AAAGAAAAGT CCAGAAGGTC GGATCGTAGC GTTGAAATCT AAGCCAGTCC CTAACCCTGG CGTAGCCGAT GCCTATTTGG TTGCTAAGGT GGTCTAA
|
Protein sequence | MSDIDMFETR TMLEPVVQNF APRRFLLKTF FPGVKTFGTK HVDLDFVRGS RTMAPFVGNG YGSKTVEKRG FTTKTFEPPL VAPDLVTTAD QMLERRPGET VYAAKSPEER AAEQLGQDLS DLDDMVSRTE EWMAAQTLFT GQVRVLGAGV DETIYYWPEN PADQPVVTLT GDDLWTADKS DPLANIRGWK RKVSLQSGFT PRMVVMGAKV VDAFMKNKAI NEALDNRRKE QGKIEPKDLD EGVTFYGTLE GVDFYGYDEQ SYNDVSGKLE VLVPEDKLLL GAPGRGRMLY GAVAVADQAE NTFTYYESPR VPDSWVSKKS PEGRIVALKS KPVPNPGVAD AYLVAKVV
|
| |