Gene Sama_2521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_2521 
Symbol 
ID4604768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp3029669 
End bp3030796 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content59% 
IMG OID639781916 
Productagmatine deiminase 
Protein accessionYP_928393 
Protein GI119775653 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID[TIGR03380] agmatine deiminase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.415458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGCAC GGGGATATAA GGACTTTGCG ATGAGCATCA ACACCCTAAA TTCGACCCCC 
GCCGCCGATG GCTTCACCAT GCCAGCCGAA TGGGCACACC AGCAGGCTGT TTGGATGATC
TGGCCCTACC GCCCTGACAA CTGGCGCGAA GCCGGTCGTT TTGCACAGGC GACCTTTGCC
AAGGTTGCCG ATGCCATTGG CGGCGCCACG CCTGTGTTTA TGGGGGTTCC GGCCGAGTTT
ATGGATAAGG CCCGCGCCAT CATGCCTGCC CACGTAACTC TGGTTGAGAT GAACAGCGAC
GACTGCTGGG CCCGCGATAC AGGCCCCACG GTAGTGACCA ATGGAGCCGG CGAGTGCCGT
GGCATTGACT GGGGCTTTAA CGCTTGGGGC GGCCACAAGG GCGGTCTCTA TTTCCCCTGG
GATCAGGATG AAAAAGTGGC AGCCCAAATG CTGGCCCAGC ACGGCATGGA CAGATACGCC
GCGCCGCTTA TCCTGGAAGG CGGCTCAATA CACGTGGACG GCGAAGGCAC CTGTATGACC
ACCGCCGAGT GCCTTTTGAA TGAAAACCGC AACCCGCATC TCACCAAAGA GCAAATCGAA
GCTCATCTTC GCGATTATCT TGGTGTAACC AGCTTTATCT GGCTGGGTGA CGGCGTGTAT
ATGGACGAGA CCGACGGTCA TATCGACAAC ATCTGCTGTT TTGTCCGCCC CGGCGAAGTG
GCCCTGCACT GGACCGACGA TGTGAACGAC CCACAATACG AGCGCTCTGT GGCTGCGCTC
AAGCTGCTTG AAGCCGCCGT GGATGCCAAA GGCCGCAAAC TGAAGGTGTG GAAGCTGCCT
CAGCCAGGCC CGCTGTACTG CACCGAAGAA GAGTCGGCCG GCGTTGAGAG CGGCACCGGT
GTGCCCCGTG AGGCCGAAGG TCGTCTGGCG GGTTCCTATG TGAACTTCCT TATCACCAAC
GACCGTATCG TCTACCCGCT TTTGGACGAA GCCACCGACG GTGAAGCGCA GCGCATTCTG
GAAGACATCT TCCCTGAGTA TCAGGTTATC GGCGTGCCTG CCCGTGAGAT CCTGCTTGGC
GGTGGCAACA TCCACTGCAT TACCCAGCAA ATCCCATCCG GCAAATAA
 
Protein sequence
MPARGYKDFA MSINTLNSTP AADGFTMPAE WAHQQAVWMI WPYRPDNWRE AGRFAQATFA 
KVADAIGGAT PVFMGVPAEF MDKARAIMPA HVTLVEMNSD DCWARDTGPT VVTNGAGECR
GIDWGFNAWG GHKGGLYFPW DQDEKVAAQM LAQHGMDRYA APLILEGGSI HVDGEGTCMT
TAECLLNENR NPHLTKEQIE AHLRDYLGVT SFIWLGDGVY MDETDGHIDN ICCFVRPGEV
ALHWTDDVND PQYERSVAAL KLLEAAVDAK GRKLKVWKLP QPGPLYCTEE ESAGVESGTG
VPREAEGRLA GSYVNFLITN DRIVYPLLDE ATDGEAQRIL EDIFPEYQVI GVPAREILLG
GGNIHCITQQ IPSGK