Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_0122 |
Symbol | |
ID | 4251001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 133332 |
End bp | 134990 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 638116665 |
Product | hypothetical protein |
Protein accession | YP_732260 |
Protein GI | 113968467 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.785107 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000543761 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTCTATC TTCTCGCTAG CTGTATCGCA CTATTAATCG GACCACTGTT TTATCGCTAC TTCTCATCGG GCAGCGGCTT ACAGAAGGGA CTCGATGGCT TTATCTTCGT CTCTTTAGGC GGGCTAGTGC TGATCCATAT TTTGCCTGAA TTGCTCGAGC ACGGCGGCCT ATTGGCCATC GTTTTTGTTG TCCTCGGCCT CTGGGGACCG ACCGCCAGCG AACGCTTATT CCATCGCTAC TCAGAGATCA CCCATAATCT CACCCTATCC CTCGGGATTG GCGGTCTTTT ACTGCATACC ATCACAGATG GCGGCGCTAT GGTGCTGGCG CAGCAGGATG GCAATTCCAG CCTATTAGCG CTCGGGGTGA TATTACACCG ACTACCCGTG GGTCTCGCCA TTTGGTGGTT ACTCAAACCT CAAGTCGGCA CCCGTTGGGC GAGTTTAGTA TTAGTCGCCA TGATGCTGCT CACAGGCGTG GGTTATTTCG CGGGCGAGCA ATTATTATCC CAACTCAGTC TAGATAATAC CGTTTATTTA CAAGCCTTTG TGACAGGTTC AATCCTGCAC GTCGTGCTGC ATCAACCCCA TGGTCAACAC GACACCGATA AGCAAGGCCA GTATGAATAT CAGGCCGGTA TCGGCAGTTT ACTGGGGATT GGATTGTTGA TGGTGCTGTT ACTGATGGAT TCTGGCGGCC ATGAACATGC CCACCACGAT CACAGCACCG AACAGCTAAC CACTTGGTTA ATGACCATAG CGCCGGTACT TTTACTCAGC TATGCCGCGG CCGCGCTGCG ATTCCAGTTT GGTTTAACTC CGCAGGATAA CAGCCTTGCT CGCCGTTGGT TCCAACGCTT AGCGGGCCCA GAGGCGCTTG TGCTCACCGC GCTACTGCTT GGCCCTTGGC TGGCACTATT CCAACTACTC GTGGTGTTTA TTATGAGTGC TTATCTGGCC CATGCTCGGG TCGAGATAAC CGATCCCCAC AACAAGTTGC CCAACAATGC GCTACGCTTT GGCTTTGCCC ATCTGGTTGA TCGCAGTGCC CCTTGGGTGC TCCTCAGCCT AGTGCTAGTC AATCTGATCG GCCATCCTTC GGTACCGTTA AGCAATCCTA TGTTGCAGAT TGCCGTACTC TTGTTGGTCT TTTTACCTAT GCGCTTCTGT AATTTAGGTG CAGCAGTACT GTCGATTGCC CTCGCCTACA GCGGCTGGAG CCCAATTGCC ATCATACTGC CCCTGATTGC AGCACCCGTA CTGAACATCG CCCAACTTAA ACTCATGAGC TGGCCACAGC GCGGTATTTT GCTGGCAATA ATCGCCGTAT CACTTGTCGC AGCGCTTAAG TTGCCCATGT GGTTCTCGAT GATCACTTTA CCCGAGGCGA TTAACTTAGC TGCACTATTT ATCCTATCTG CCTTATTTGC CGCGAGCTTA CTGCGCCTAG GACCTCGTAA GTTCTTGCGT CGTTTAATGT TGCTAAAACC TGCACCCCAT GGTCATCATC ACGGACACGC TCATGCTCAC GCGCATACCC AGGCAGCAGA AGCGTCACAT GGGCACGGAC ATAGCCATAC ACAGTCTCAT ACACACAGTC AGAACCATAG TCACGACCAT GATCATTCCC ATGGTGATGG CAATAAGCAC CACCACTAA
|
Protein sequence | MLYLLASCIA LLIGPLFYRY FSSGSGLQKG LDGFIFVSLG GLVLIHILPE LLEHGGLLAI VFVVLGLWGP TASERLFHRY SEITHNLTLS LGIGGLLLHT ITDGGAMVLA QQDGNSSLLA LGVILHRLPV GLAIWWLLKP QVGTRWASLV LVAMMLLTGV GYFAGEQLLS QLSLDNTVYL QAFVTGSILH VVLHQPHGQH DTDKQGQYEY QAGIGSLLGI GLLMVLLLMD SGGHEHAHHD HSTEQLTTWL MTIAPVLLLS YAAAALRFQF GLTPQDNSLA RRWFQRLAGP EALVLTALLL GPWLALFQLL VVFIMSAYLA HARVEITDPH NKLPNNALRF GFAHLVDRSA PWVLLSLVLV NLIGHPSVPL SNPMLQIAVL LLVFLPMRFC NLGAAVLSIA LAYSGWSPIA IILPLIAAPV LNIAQLKLMS WPQRGILLAI IAVSLVAALK LPMWFSMITL PEAINLAALF ILSALFAASL LRLGPRKFLR RLMLLKPAPH GHHHGHAHAH AHTQAAEASH GHGHSHTQSH THSQNHSHDH DHSHGDGNKH HH
|
| |