Gene Shewmr4_3133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3133 
Symbol 
ID4253704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3751287 
End bp3752450 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content53% 
IMG OID638119775 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_735261 
Protein GI113971468 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCTG TCAATCAAGA TCGTCGAACG CTGTTAAAAG GGATTGGCGC CGCCACGCTA 
CTTTGCCCCT TTGCCAGTTT CTCCAGCCTT GCAGCGCCGC GCTCTCGGCG CTTATATATA
GATGGTTTAT CCTTTCTGCC CGATGACTTA GCCGATGTTC CCGCATCGGG GCTCGATGCC
TTTTTATGTG ATATCTCTGC CATCGAAACC ATTGAACAAG CCGATGGCAC CTTAAACTAC
AAACGCACTT ACAAAGCCTG TATGGAAAGC ATCCAGCAAG CCGCGAAACG CGTCAGCGAA
CACTCAGACA TTCTCCTACA AGGCTTAACT GGACGCGATA TCAAACTGGC AAGAGAGAGC
AATCGCACCG CGGTTTTCTT CCAGATCCAA GGCGCGGATT GCGTGGAAGA AGACAGCGAG
GCCAACCAAT GGGCCCGTGT CGATGAGTTT CACCGCCAAG GTCTGCGAGC ACTGCAGCTG
ACCCACCATT ACGGCAATAC CTTTGCGGGC GGCGCCCTCG ATAACGATGC CAATGGCGGG
CTCAATAAAC CTCTTACCGC CCATGGTCGT GCACTGATTG AAAAACTCAA TCATGCCAAT
ATCTTAGTCG ATGTCAGCCA CTCGAGCGCC CAAACCGCGT TAGATGCTGC CAAACTCAGC
CGCGCGCCCA TAGTCCAAAG CCATGGCGCG GCGCGCGGTA TTGTCAAACA TGCCCGTTGT
AGCCCCGATG AAGTGATCCG CGCCATTGCC GACTCAGGCG GCGTATTCGG GGTCTTTATG
ATGAGCTTTT GGCTCACCAA TAATGCCGTT CCAACTGTCG ATGATTATAT CCGCCAGTTA
GAATATGTCA CCCGTATCGG CGGGGTCGAT TGTGTTGCCA TCGCCAATGA TTATCCGCTC
AGAGGCCAAG AAAATCTGTT AGCCCTGAAT AACGACAATG CCCAGGGCGT GAAGGAATAT
CAGGAATGGT GGTACAGCCT AAGGGCTAAG CAAGTGTTAG GTTTTGATGC CGAACCAAGG
CATGTGGTGA TTCCCGAGCT AAACCATATC GAGCGTATGA GCCGTATCGA CGATGCATTA
GCTAAGGCCC GTTTTAAGTC GACCGATCGC GACCGCTTTA TGGGCGGAAA CTGGCAAAGA
GTGCTCAATC AGGTACTCAT CTAA
 
Protein sequence
MKAVNQDRRT LLKGIGAATL LCPFASFSSL AAPRSRRLYI DGLSFLPDDL ADVPASGLDA 
FLCDISAIET IEQADGTLNY KRTYKACMES IQQAAKRVSE HSDILLQGLT GRDIKLARES
NRTAVFFQIQ GADCVEEDSE ANQWARVDEF HRQGLRALQL THHYGNTFAG GALDNDANGG
LNKPLTAHGR ALIEKLNHAN ILVDVSHSSA QTALDAAKLS RAPIVQSHGA ARGIVKHARC
SPDEVIRAIA DSGGVFGVFM MSFWLTNNAV PTVDDYIRQL EYVTRIGGVD CVAIANDYPL
RGQENLLALN NDNAQGVKEY QEWWYSLRAK QVLGFDAEPR HVVIPELNHI ERMSRIDDAL
AKARFKSTDR DRFMGGNWQR VLNQVLI