Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_3133 |
Symbol | |
ID | 4253704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 3751287 |
End bp | 3752450 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 638119775 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_735261 |
Protein GI | 113971468 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCTG TCAATCAAGA TCGTCGAACG CTGTTAAAAG GGATTGGCGC CGCCACGCTA CTTTGCCCCT TTGCCAGTTT CTCCAGCCTT GCAGCGCCGC GCTCTCGGCG CTTATATATA GATGGTTTAT CCTTTCTGCC CGATGACTTA GCCGATGTTC CCGCATCGGG GCTCGATGCC TTTTTATGTG ATATCTCTGC CATCGAAACC ATTGAACAAG CCGATGGCAC CTTAAACTAC AAACGCACTT ACAAAGCCTG TATGGAAAGC ATCCAGCAAG CCGCGAAACG CGTCAGCGAA CACTCAGACA TTCTCCTACA AGGCTTAACT GGACGCGATA TCAAACTGGC AAGAGAGAGC AATCGCACCG CGGTTTTCTT CCAGATCCAA GGCGCGGATT GCGTGGAAGA AGACAGCGAG GCCAACCAAT GGGCCCGTGT CGATGAGTTT CACCGCCAAG GTCTGCGAGC ACTGCAGCTG ACCCACCATT ACGGCAATAC CTTTGCGGGC GGCGCCCTCG ATAACGATGC CAATGGCGGG CTCAATAAAC CTCTTACCGC CCATGGTCGT GCACTGATTG AAAAACTCAA TCATGCCAAT ATCTTAGTCG ATGTCAGCCA CTCGAGCGCC CAAACCGCGT TAGATGCTGC CAAACTCAGC CGCGCGCCCA TAGTCCAAAG CCATGGCGCG GCGCGCGGTA TTGTCAAACA TGCCCGTTGT AGCCCCGATG AAGTGATCCG CGCCATTGCC GACTCAGGCG GCGTATTCGG GGTCTTTATG ATGAGCTTTT GGCTCACCAA TAATGCCGTT CCAACTGTCG ATGATTATAT CCGCCAGTTA GAATATGTCA CCCGTATCGG CGGGGTCGAT TGTGTTGCCA TCGCCAATGA TTATCCGCTC AGAGGCCAAG AAAATCTGTT AGCCCTGAAT AACGACAATG CCCAGGGCGT GAAGGAATAT CAGGAATGGT GGTACAGCCT AAGGGCTAAG CAAGTGTTAG GTTTTGATGC CGAACCAAGG CATGTGGTGA TTCCCGAGCT AAACCATATC GAGCGTATGA GCCGTATCGA CGATGCATTA GCTAAGGCCC GTTTTAAGTC GACCGATCGC GACCGCTTTA TGGGCGGAAA CTGGCAAAGA GTGCTCAATC AGGTACTCAT CTAA
|
Protein sequence | MKAVNQDRRT LLKGIGAATL LCPFASFSSL AAPRSRRLYI DGLSFLPDDL ADVPASGLDA FLCDISAIET IEQADGTLNY KRTYKACMES IQQAAKRVSE HSDILLQGLT GRDIKLARES NRTAVFFQIQ GADCVEEDSE ANQWARVDEF HRQGLRALQL THHYGNTFAG GALDNDANGG LNKPLTAHGR ALIEKLNHAN ILVDVSHSSA QTALDAAKLS RAPIVQSHGA ARGIVKHARC SPDEVIRAIA DSGGVFGVFM MSFWLTNNAV PTVDDYIRQL EYVTRIGGVD CVAIANDYPL RGQENLLALN NDNAQGVKEY QEWWYSLRAK QVLGFDAEPR HVVIPELNHI ERMSRIDDAL AKARFKSTDR DRFMGGNWQR VLNQVLI
|
| |