Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_0865 |
Symbol | |
ID | 5602769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | - |
Start bp | 959216 |
End bp | 960880 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640936380 |
Product | tail fiber repeat 2-containing protein |
Protein accession | YP_001477099 |
Protein GI | 157369110 |
COG category | [R] General function prediction only |
COG ID | [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0127303 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCAA AATATCGCGC CCTGCTCACC GATCAGGGTA AAGCGCTGCT GGCTAACGCC GCGGCAACCG GCCAGAAGCT GGAGATCACC CAGATGGCGG TCGGCGATGG CGGCGGCTCG GCCACTCTGC CCAGCGAAAG CCAAACCAGG CTGGTGAATG AAAAACGACG GGCGGTGCTC AATTCCCTGC AGGTTAACGC CGACAGCGGC AACCAGGTGA TCGCCGAGCA GGTGATCCCG GAAGATACCG GTGGCTGGTG GATCCGTGAA TTGGGGTTGT ACGATAAAAA TGGTGTACTG GTGGCCGTGG CTAATACACC AGACACTTAC AAGCCATTGC TGGCCGAAGG TGCCGGTCGT ACTCAGGTAG TGCGCATGGT CCTGCTGGTC AAAGGTGACG CCAGCGCGGT GATTGTGGCG GACAAAACCG CCGTGCTGGT TTCCCGCGAT ACGCTGGATG CGGCTATCGC CGAACATGCC CGTTCACGTA ACCACCCGGA TGCCACCCTG CTGGCCAAAG GCTTCACCCA ACTGAGCAGC GACAGCAATA GCAACAGCGA AGTTCTGGCA GCGACGCCAA AAGCGGTGAA AGCGGTTAAT GATGCAAGCC TCAAAAAAAC CGATAACCTT TCGGACCTGA CCAACAAAGC CACCGCACGT GGCAATCTCG CACTGGGTAA TGCGGCAACG AGAAATGTGG GGACTGAAGC TACAAATCTG ATGGAGGTCG GCGCGTTCGG TTTCGGCGCA GGTATAAAGC ATCATGCCGA TGCCTATAGT AACCTTGGTG AAATTTATCG GGTGAATAAC TTATCAAAAA ATGCTCCTGG ATCAGGCACC TATGGTGTTC TCAATCTCCC TTGCGATGGA GGACCCTCCA GCGGTTATTT GGCAATACAA AACAGCGCGG CTGCCTATAT CGGTATTTCC ACCGTTCCGG AGAAACCACT GTCGTGGTAC CGAATTTATA CCACGGCCTA TAAACCCACG GCGGCAGATG TCGGGGCTTA CAGCAAGGCC GAGACCGATG GCAAGTTTGT CAAACAGATC GGCGATACTA TAACCGGTGG CCTAACGGTT AATGGTTCTA TTGAAACCAA GTCAGGGCTG ACCACTCCGT CATTATCGGT GAATGGTAAC TCGGTTATTT CAGGTCAACT GACAGCTAAA GCCGGTGTCG AACTGTTTGG GACATCTCCT TATATCGATT TTCATTACGG TAATACCAAT ACGGATTATG ATGTCCGCAT TATCAATGAA AAACAAGGTC AACTCACCCT CGGAGCTAAA ACCGTTCGAG TTAATGAAAA CTTTTCGGTT GGCGGTGATG CTTATATTGA TGGTAGATTG TATGCAACCA TAGAATGCCG CGCAGGCGAA GCTTTATTTG CTGGTAACTC TGCCTATCAA TCAGACGGAA ATATAAACGG CAGTATCTGG GGTGGTTATT TATCTAACTA CCTTAATCAA AATTTTGTAC GAGACATTCG CCTTGGCTCT GTAGAGAGTG CGCCGTCTTG GAATGGACCA GGTTACAATG ATAATGCAGG CTACGTACTG ACAGGGGCAT CAAATTATAA TATGGATGAG TACCTAGATC ACATTTACCG TCGCCCGCTG CAAAAGCATA TTAATGGTAA CTGGGTTACC GTATGGAGCG TTTAA
|
Protein sequence | MTAKYRALLT DQGKALLANA AATGQKLEIT QMAVGDGGGS ATLPSESQTR LVNEKRRAVL NSLQVNADSG NQVIAEQVIP EDTGGWWIRE LGLYDKNGVL VAVANTPDTY KPLLAEGAGR TQVVRMVLLV KGDASAVIVA DKTAVLVSRD TLDAAIAEHA RSRNHPDATL LAKGFTQLSS DSNSNSEVLA ATPKAVKAVN DASLKKTDNL SDLTNKATAR GNLALGNAAT RNVGTEATNL MEVGAFGFGA GIKHHADAYS NLGEIYRVNN LSKNAPGSGT YGVLNLPCDG GPSSGYLAIQ NSAAAYIGIS TVPEKPLSWY RIYTTAYKPT AADVGAYSKA ETDGKFVKQI GDTITGGLTV NGSIETKSGL TTPSLSVNGN SVISGQLTAK AGVELFGTSP YIDFHYGNTN TDYDVRIINE KQGQLTLGAK TVRVNENFSV GGDAYIDGRL YATIECRAGE ALFAGNSAYQ SDGNINGSIW GGYLSNYLNQ NFVRDIRLGS VESAPSWNGP GYNDNAGYVL TGASNYNMDE YLDHIYRRPL QKHINGNWVT VWSV
|
| |