Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssed_3289 |
Symbol | |
ID | 5610401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sediminis HAW-EB3 |
Kingdom | Bacteria |
Replicon accession | NC_009831 |
Strand | + |
Start bp | 3994612 |
End bp | 3997599 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640934227 |
Product | PKD repeat-containing protein |
Protein accession | YP_001475021 |
Protein GI | 157376421 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.745526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000000829414 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATTACA GAGCTAACAT CATAAGGGTC GCGCTACTAA GCCTATCCTT ATCTGCAGCA GGTTTATCAG GATCAGATGC CAATGCTGCT AACCCAACGA CGAACACCAA TACCATTTCT GATGCTACTT CTAATCCCGG CGCTCAGCAC CGAGCCTTCC CTGACGTCAA CCTCCCCGAA CCCGCCAATG GCGAACACGC CATCGGGCTT TTAGGCGATA AGCTGCCGGA TGTCGCGGCG GCATATGAAA TGACAACTTC AGAGTTTGCC AAACTGATCA GAACCGATAA GACGGTATGG TTAGATCGCA GAGCTCATAT CTTCTATGTA GAAGTTGAAG CACCAACAGA GCTCGCCGAA TCCGATCCTG GCGGCGAAAT CCAAACTGCA CTCAATGAAG TAGAGACCTT TAGTCTCAAT AGCCGCCCCG GGGCACCGCG AACCATATTT CTCGATTTCG ACGGTCACAC AACCACAGGT ACCGCCTGGA ATAGCTCCAA TAATGTCACC ACGATAAACT CCCCCGCCTA TAACACAGAC GGTACCTCGG CCTCCTTCAC TCAACTCGAA CTCGATAGAA TTTATCTGAT GTGGCAACAG GTAGCCGAAG ACTTTGCCCC ATTTAATGTA AATGTGACAA CCCAGGAACC CTCACCAGAC AAGATAACGA GAACCACCTC TTCCGATCAG ACATTCGGAA CTCGGGTTAT TATCACGCAA GATAACTTCG CCAATTGCGG CTGCGGCGGT TTCGCCTATT TAAGAATATT CGATGATTAC GGCAGTAATG GCGATTATTA CAAACCCGCC TTCGTATTTA ACTCTAGCGT TGTCGGCGCA GGGGAAGCGA TCACTCATGA GGCCGGCCAT AATCTTGGGC TTAGCCACGA CGGACAGAGT GATGGCACCA GCTACTACCA GGGACATGGA TCCGGCGCCA CTGGCTGGGC CCCAATCATG GGAGTCGGCT ATTACCGCGA ACTCGTCCAA TGGAGCAAGG GGGAATACCC CCAGGCGAAC CAGCAACAAG ATGATATTCA GGTGATACAG AATTACGGCG CGCCTTTGAT GCTCGATGAT CACGGCGATA GTATCGCTAA TGCCTCGGCG TTAAGCGAAA CTCCCAATGG CGACACGACC ACACTCGATG GCAACGGCCT GATACGCCGC CGTGATGATA TGGACATGTT TAGCTTCACT GCCGGCGCGG GAAGTTTCAC TCTGAATATC TCGCCATCCG CTGTAAGCCC GAACCTGGAT ATTCTGGCTC AGCTCTATAA CAGTTCAGGC ACTCTGATAG CCAGCAATAA CCCCAGCTCT TCACTCCCCG CGGCGATCAG CGGCAGCTTT GCCAGCGGCG GAGAATACTA CCTCAAGGTG GATGGCACTG GTAAAGGCGA GCTCTCTACC GGTTACAGTG ATTACGGCAG CTTAGGCTGG TATACCATCA ATGGTAGCGT GCAAAATGAC GGCAATTTAC AATCCCCGAC CGCCGCCGTC TCAGTTGGTT ATGTTCCGGG ATATGCACCT ATCACCGCGT TCTTCAATGG TTCAGCCTCC ACCGATGATA TCGGGATCAC CGACTATAGC TGGAATTTCG GCGACGGTGG AGTGGGTAGT GGAGTGAGTC CGTCTCATGA ATATCTTGCC CCGGGCGTAT ATAATGTCAC TCTGACCGTT ACCGATACCG ACAACCTCAC GGACTCAGAC ACGCTGGCCA TCTCTGTGGT TAACCGCTCG CCGATTGCGA TCGCCAACGC CGACAGCTAC TCAGGCACCG CCCCCCATAG CGTTCAATTC TATAGTACGG GCTCAAAAGA TCAGGATGAT TTGGGCACCA TTACCTATGC CTGGCTATTC GGCGACGGCA ACGGTTCAAC TTCCGCCAAT CCGAGTCATC TCTATGCGGC TCAAGGTACA TATACTCCGA GTCTGACCGT CACCGATAAC TTAGGCGTTC AGGATAGCGT CACACTGTCG GATATCAGTA TCTCACCACC GGCTTATCTG GACCAATATG CACAAGGCGA GATATTAGGT TCGGGTACGG TAGGAGGGAA TTACACAGAC ACCTTCGATA ACAGCTCGGC TCAGACAATT AGGGAGCGGG AGTCCGGCGG CAGAAAAAGC AGTCGCTATA GCTATTTGCT ACATACCTGG TTATTTAATG TCTCATCGGG TAACACAGTG ACCATACATC TGAACGCCTG GATGACAGGC TCTTCAGACA GCGATCAGAT GCGCTTCTCC TACTCAGTGA ATGGCGCAGA CTATGTCGAA TTCACAAGAG TACAGAACAC TGACAATGTG GGACTTAAGT CTTTCTTTAT GCCTCAGGGC AGCAGCGGGG ATGTACGCAT TCGTGTCGAG GACACAAATC ACACCCCGGG ACGCCGAGGC TTAGATACCG TCTATATCCA GCAACTCTAT ATCAGTAGCG AGACCCTGCT CGACGGTGAT GTGCCAGCAA CGCCGTCCAT GCAGACTGCG ACGCCAATAT CATCGAGCCA GATAGATATC GCCTGGACTG AAACATCGGA GAATGAATCT GGCTTCAATA TCGAGCGCTC AACCGACCAA AGCAGCTGGA GCTCAGCAGG CTCGGTAGGG GCTAATGTCA CCACTTTCAG CGATAGCGGA TTATTGGCAG GCACGACCTA TTACTACCGC ATCAGTGCCT TTAACGGTTA TGGCAGCTCG CTTGAGTCCA ATACCGTTTC GGCCACCACC GATGCTGCAA GCCCCATCAG CCTCACCGCC ACTGGCAGGA AGGTTAAGGG AATTAAACAT ATCGACCTGC AGTGGTATCA ATACCCAGAT GTCGATATCT ACTTCGATAA TGCTGACGCC GTTACTTTTA GCGCAGAAAG TAGTGAACCC TATGGTTACG ATCTCAACAC CGGCCTGAAA GGCGGTGGCA ATCATTCTAT TCAGGTGTGT AACGCCGGCG GCGGAGACTG CTCGGAGGTG GTGACTGTGA TCTTTTAA
|
Protein sequence | MDYRANIIRV ALLSLSLSAA GLSGSDANAA NPTTNTNTIS DATSNPGAQH RAFPDVNLPE PANGEHAIGL LGDKLPDVAA AYEMTTSEFA KLIRTDKTVW LDRRAHIFYV EVEAPTELAE SDPGGEIQTA LNEVETFSLN SRPGAPRTIF LDFDGHTTTG TAWNSSNNVT TINSPAYNTD GTSASFTQLE LDRIYLMWQQ VAEDFAPFNV NVTTQEPSPD KITRTTSSDQ TFGTRVIITQ DNFANCGCGG FAYLRIFDDY GSNGDYYKPA FVFNSSVVGA GEAITHEAGH NLGLSHDGQS DGTSYYQGHG SGATGWAPIM GVGYYRELVQ WSKGEYPQAN QQQDDIQVIQ NYGAPLMLDD HGDSIANASA LSETPNGDTT TLDGNGLIRR RDDMDMFSFT AGAGSFTLNI SPSAVSPNLD ILAQLYNSSG TLIASNNPSS SLPAAISGSF ASGGEYYLKV DGTGKGELST GYSDYGSLGW YTINGSVQND GNLQSPTAAV SVGYVPGYAP ITAFFNGSAS TDDIGITDYS WNFGDGGVGS GVSPSHEYLA PGVYNVTLTV TDTDNLTDSD TLAISVVNRS PIAIANADSY SGTAPHSVQF YSTGSKDQDD LGTITYAWLF GDGNGSTSAN PSHLYAAQGT YTPSLTVTDN LGVQDSVTLS DISISPPAYL DQYAQGEILG SGTVGGNYTD TFDNSSAQTI RERESGGRKS SRYSYLLHTW LFNVSSGNTV TIHLNAWMTG SSDSDQMRFS YSVNGADYVE FTRVQNTDNV GLKSFFMPQG SSGDVRIRVE DTNHTPGRRG LDTVYIQQLY ISSETLLDGD VPATPSMQTA TPISSSQIDI AWTETSENES GFNIERSTDQ SSWSSAGSVG ANVTTFSDSG LLAGTTYYYR ISAFNGYGSS LESNTVSATT DAASPISLTA TGRKVKGIKH IDLQWYQYPD VDIYFDNADA VTFSAESSEP YGYDLNTGLK GGGNHSIQVC NAGGGDCSEV VTVIF
|
| |