Gene SbBS512_E3370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3370 
SymbolspeB 
ID6271024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3136097 
End bp3137017 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content55% 
IMG OID641727261 
Productagmatinase 
Protein accessionYP_001881711 
Protein GI187730456 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.402836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCT TAGGTAATCA ATACGATAAC TCACTGGTTT CCAACGCCTT TGGTTTTTTA 
CGCCTGCCGA TGAACTTCCA GCCGTATGAC AGCGATGCTG ACTGGGTGAT TACTGGTGTG
CCGTTCGATA TGGCCACTTC TGGTCGTGCG GGGGGACGCC ACGGTCCGGC AGCGATCCGT
CAGGTTTCGA CGAATCTTGC CTGGGAACAC AATCGCTTCC CGTGGAATTT CGACATGCGT
GAGCGCCTGA ACGTCGTGGA CTGCGGCGAT CTGGTATATG CCTTTGGCGA TGCCCGTGAG
ATGAGCGAAA AACTGCAGGC ACACGCCGAG AAGCTGCTGG CTGCCGGTAA GCGTATGCTC
TCTTTCGGTG GTGACCACTT TGTTACGCTG CCGCTGCTGC GTGCTCATGC GAAGCATTTC
GGCAAAATGG CGCTGGTACA CTTTGACGCC CACACCGATA CCTACGCGAA CGGTTGTGAA
TTTGACCACG GCACCATGTT CTATACCGCG CCGAAAGAAG GTCTGATCGA CCCGAATCAT
TCCGTGCAGA TTGGTATTCG TACCGAGTTT GATAAAGACA ACGGTTTTAC CGTGCTGGAC
GCCTGCCAGG TGAACGATCG CAGCGTGGAT GACGTTATCG CCCAAGTGAA ACAGATTGTG
GGTGATATGC CGGTTTACCT GACCTTTGAT ATCGACTGCC TGGATCCTGC TTTCGCGCCG
GGAACCGGTA CGCCAGTGAT TGGCGGCCTG ACCTCCGATC GCGCCATTAA ACTGGTACGC
GGCCTGAAAG ATCTCAACAT TGTTGGGATG GACGTAGTGG AAGTGGCTCC GGCATACGAT
CAGTCGGAAA TCACCGCTCT GGCTGCGGCG ACGCTGGCGC TGGAGATGCT GTATATTCAG
GCGGCGAAAA AGGGCGAGTA A
 
Protein sequence
MSTLGNQYDN SLVSNAFGFL RLPMNFQPYD SDADWVITGV PFDMATSGRA GGRHGPAAIR 
QVSTNLAWEH NRFPWNFDMR ERLNVVDCGD LVYAFGDARE MSEKLQAHAE KLLAAGKRML
SFGGDHFVTL PLLRAHAKHF GKMALVHFDA HTDTYANGCE FDHGTMFYTA PKEGLIDPNH
SVQIGIRTEF DKDNGFTVLD ACQVNDRSVD DVIAQVKQIV GDMPVYLTFD IDCLDPAFAP
GTGTPVIGGL TSDRAIKLVR GLKDLNIVGM DVVEVAPAYD QSEITALAAA TLALEMLYIQ
AAKKGE