Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_0640 |
Symbol | |
ID | 4251658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 745025 |
End bp | 746245 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 638117203 |
Product | hypothetical protein |
Protein accession | YP_732777 |
Protein GI | 113968984 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0000134735 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAGTA ATTTACATCA GGCTGGGGTG AAGCTACTCA AGCAGTTAGG TCGCCATGCT GACATCATTA TGGACGCCTA TTTAGCGGGT TCCTTAAACG AGGATGCCCA TGATCCCGCC GTAGTTGAAA AACTCAAGCA GGCGGGGATT TTATGGCGCC CAGAGCCTGA CCAAGAGCTG CGCCTTAAAC GCTCGGTGCG TGCCTTACTC GAAGAGGGCT TAAGTGATGA ACGCAATCGC CAAATCGACT CCAACGTCGG CTCGGCGCTC GCCACCATTA AGACCTTGGC CGACCACTAT AAAGAAGCGC GCCACAGCTC AGATTACAGT GCCGCCGAGG CGTATCTGTC TGATTTAAGT GAGCATGTGT ATAGCTTTGC CGACAGTTTA CGTTACTCCA TCCGCGTGTT GTGGGGGCGC ATCAACAACG AGTTCGGTTA TGTCGGTACC ATTAACGCTA AGATCCGTGA AAACGAACTC GCTCAAAGCC AAGTATCTGA ATTACTTAAT GGTTTGGAGA TGTTCCAGTT TAGCGAATTA GGTGAAATCG CCGGTGATAT CCGTGAGCTG CGTAAGCTGC TGGTGACGAC TTTGCAGGAA ACCATGAGCG ACTGCGCCCA GGAACTCAGT GTGGTGCAGG GCAGGTTGCT GGAACTCCTC GGCCGCTTTA GGCAAATTCG CGGCCGTACC CGCTTGCTTA AGGGCTGGTT ACTGTACACC GATTTGCATC CGGATTATCG CCCTGCGGAC CATGTGTCCC ACAAGGAAAT CCCGAGTTTT TTCAATCGCG CCGAAGTGCT GTTGGCTCCA GCATCTGTGG ATGTGCATAA CGCCAGCCAA GAGTTTGAGT TGATGAACAT TGTCGCCCAT ATCAAGGCGA TTAGCCGTCA GGGCATAGTC GAAACGGTGC GCGAGCAGGA TGTGGCCGTG CCGCTGACGC AGAATGAAGA CTTTGATATT CCTGATAATC CACTCAAGCA AGCGGTCGAC ACTTACTTTG TCGATGTGAT TGAGTCGGGC TTACGCCAGT CGGCGCTCGA TTACTTAGCC GAAAAAGCGC TGCCGTGGGA TGCCGAAAGC TGGATTTATC AAGTGATTGG CGGCTACGAA GGCTTACCCG ATGAGCATAA GGCTTACTTC GAGTTAGAAC CCTTAGGTGA ACCGCACCCC ATCTACAGCG GTAACTTTAT TATCCGCGAC GTGGAATTAT GGCTCGCCTA G
|
Protein sequence | MSSNLHQAGV KLLKQLGRHA DIIMDAYLAG SLNEDAHDPA VVEKLKQAGI LWRPEPDQEL RLKRSVRALL EEGLSDERNR QIDSNVGSAL ATIKTLADHY KEARHSSDYS AAEAYLSDLS EHVYSFADSL RYSIRVLWGR INNEFGYVGT INAKIRENEL AQSQVSELLN GLEMFQFSEL GEIAGDIREL RKLLVTTLQE TMSDCAQELS VVQGRLLELL GRFRQIRGRT RLLKGWLLYT DLHPDYRPAD HVSHKEIPSF FNRAEVLLAP ASVDVHNASQ EFELMNIVAH IKAISRQGIV ETVREQDVAV PLTQNEDFDI PDNPLKQAVD TYFVDVIESG LRQSALDYLA EKALPWDAES WIYQVIGGYE GLPDEHKAYF ELEPLGEPHP IYSGNFIIRD VELWLA
|
| |