Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_4729 |
Symbol | |
ID | 5602836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | + |
Start bp | 5225349 |
End bp | 5226635 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640940295 |
Product | filamentous haemagglutinin outer membrane protein |
Protein accession | YP_001480950 |
Protein GI | 157372961 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01731] adhesin HecA family 20-residue repeat (two copies) [TIGR01901] filamentous haemagglutinin family N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.257698 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGG TATTACCAAG GAAAATATCC TGCCTTTCGG CCATGATAAT GTTTTGCCTC AGTGGCGGAG CCGCCTTTGC TGACGCCTCC ATTGTGCCGG TTGGCGGCAA CGTTAACCTC GCTGCCAATA ACGTGCCGGT AGTGAATATC AATCAGCCGG GCCAGGATGG CGTATCACAT AACCAATACA GCCAGTTTGA TGTAGGTAGC CAGGGCGCGG TATTGAACAA TGCGCAAGGC AGCGCGCAAA GCCAACTGGC CGGTGCCATC ACCGCCAACC CCAACCTGAG CGCAGGTGCC GCGAAGGTGA TTCTTAATGA AATCAATTCC AGCGACAAAA CGCTGCTGAA TGGGATGATC GAAGTCGCCG GCCAGAAGGC GGACGTTATC GTCGCCAACC GTTCCGGCAT TACCTGTAAC GGTTGCGGAT TTATCAATAC CGGTACCGGG GTACTGACCA CCGGTGAACT GCAGTTCAAA GACAAACAGT TCACCGGCTA TAGCGTTACC GGCGGCACGG TAGCATTCGA AGGCAAGGGG CTGCAGAAAG GTGATGTGGA CTACACCGCC GTGATTGCCC GCGCTGAGAA TATCAATGCC GCGGTGCAAG CCAAGAGCCT GCTGGTGTTG GCGGGCAAGA AAACCGTCAC CGCAGATCTG ACTCGTTACA TAAACCTGGA AAACAAAGAA AAATCACCAG AAGTGTTGGT AGATGTCTCC CAGCTCGGTG GTATGTATGC AGGAAAAATC ATCTTAATCG CCAATGAAAA TGGTGTGGGG GTTAATGACA GTACACCCGT CAGCCTTAGA AATAACGGCG AATTAGTGAC TGATGGCGGT GATATGTTGT TGTCTGCTAC GTCTCTTTCA AACGCGGGTA AAATCGCTGC TGGCGGCAAT GCAAACCTGG CAACCACCGT GCTGTCGAAC CAAGGGGATA TCACGGCCAA TAACGCACTT AATATCAGCG CCACCTCTTT CAGCAACGCT AAAGACATCA GCGCAAAAAA TAATATTGAT ATTAAGTCAA CCACTACGTC TAACAATGGG ACAATAGTCT CTCAGCAAGG AGGGGTGACC GTATCGGATA CGACCTTCTC CAACAACGGC ACTATTCGGG CAAAAAATAA CATTTCTCAT AAAGGAACGT TATTTAGCAA TAACGGTACC CTGACCTCAA CCCAAGGTTC AGTGTTCAAC AACGGCAGAG ACATTAATTA CAAGCCTCCT GTCGTCACCC CACCGACAAA CTCATGGTGG TCATCCTGGT GGTCCCGTAG CTGGTAA
|
Protein sequence | MKKVLPRKIS CLSAMIMFCL SGGAAFADAS IVPVGGNVNL AANNVPVVNI NQPGQDGVSH NQYSQFDVGS QGAVLNNAQG SAQSQLAGAI TANPNLSAGA AKVILNEINS SDKTLLNGMI EVAGQKADVI VANRSGITCN GCGFINTGTG VLTTGELQFK DKQFTGYSVT GGTVAFEGKG LQKGDVDYTA VIARAENINA AVQAKSLLVL AGKKTVTADL TRYINLENKE KSPEVLVDVS QLGGMYAGKI ILIANENGVG VNDSTPVSLR NNGELVTDGG DMLLSATSLS NAGKIAAGGN ANLATTVLSN QGDITANNAL NISATSFSNA KDISAKNNID IKSTTTSNNG TIVSQQGGVT VSDTTFSNNG TIRAKNNISH KGTLFSNNGT LTSTQGSVFN NGRDINYKPP VVTPPTNSWW SSWWSRSW
|
| |