Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_3713 |
Symbol | |
ID | 4254276 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | - |
Start bp | 4438748 |
End bp | 4439653 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638120358 |
Product | DNA binding domain-containing protein |
Protein accession | YP_735833 |
Protein GI | 113972040 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1910] Periplasmic molybdate-binding protein/domain |
TIGRFAM ID | [TIGR01764] DNA binding domain, excisionase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.017295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.392744 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCCG CCAGTGAATT GATTTACATG AGCGCGAAAC AGGTTGCTGA GTATTTAGAT CTGAACGAGA AGAAGGTCTA CGCCATGGCG AATGACCGAA TTCTGCCAGC AACGAAAATC ACCGGTAAAT GGCTATTCCC TAAGGTGCTA ATTGATCGCT GGGTGATGGA CTCTTGCCAC AGCGGCATGT TAACTGATCG ACTCTTGATC ACTGGCAGCG ATGATCCTCT GCTTTCTATG TTGGTCGCAC GCTTAATGGC GCAGGTTGGT AGTCGTGAGC TGATCAGCTA CAGCGCAACA GGCTCGCGCC TCGGGTTAGA GCTATTAGCA AAGGGATATG CTGATGTTTG TACTCTACAC TGGGGCAGCA TGGAGGATAG AAATATCCGC CATCCAGCCT TACTCAAAGG GTATCAGAAC CATCAGCAAT GGATCATGGT CCATGGCTAT TCCCGTCAGC AAGGTTTGAT CATGCGAACC GATATGCACC ACAGATGCCA AGAAGAGGAC AAGGTACTCA CCTTACCCTG GCGCTGGGTC AGTCGCCAAG GTGGCGCAGG GAGCCAACAA CATTTAGAAC AATGGCTATT AAAGCAAGGA GCACGTTTAG ACCAACTCAA TGTGGTGCTA ACAGCCTATA GCGAACGTGA ACTTGCGGGA TATATAGCAC GTGGAGATGC CGATATTGGT TTTGGTTGCC AATCCGTTGC CTTGGAAAGT GGCTTAAGCT TTGTGCCGCT CGTCAAAGAG TCCTTCGATT TTGTTATGCC GCAAAGCATT TACTTCCGTA GGCAGCTGCA ACAGCTCTTT ACTATGTTAA GCAGCGGGCA TACACGTCAA ATGGCGGCGT TGTTAGGTGG ATACGACCTT ACCGATTGCG GCCAGTTATT GTGGAGTGCT AACTAA
|
Protein sequence | MTSASELIYM SAKQVAEYLD LNEKKVYAMA NDRILPATKI TGKWLFPKVL IDRWVMDSCH SGMLTDRLLI TGSDDPLLSM LVARLMAQVG SRELISYSAT GSRLGLELLA KGYADVCTLH WGSMEDRNIR HPALLKGYQN HQQWIMVHGY SRQQGLIMRT DMHHRCQEED KVLTLPWRWV SRQGGAGSQQ HLEQWLLKQG ARLDQLNVVL TAYSERELAG YIARGDADIG FGCQSVALES GLSFVPLVKE SFDFVMPQSI YFRRQLQQLF TMLSSGHTRQ MAALLGGYDL TDCGQLLWSA N
|
| |