Gene Namu_4832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4832 
Symbol 
ID8450462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5389110 
End bp5390246 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content71% 
IMG OID645043871 
Productagmatinase 
Protein accessionYP_003204096 
Protein GI258654940 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01227] formimidoylglutamase
[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG GGAGCAAGCC GGCAACGGCC GGCCAGTCGG AGAACGACGA CGAGGGTCTG 
CCCACGTTCA GTTCCCGGGA CTGGGTGGCC ACCGCGGCCG GACTGGCCCA GCGCCGGCCG
GACGTGGCGA TCGTCGGCGC GCCGATGGAC ATCAACACGA CCTACCGGCC GGGAGCCCGG
TTCGGTCCGA AGTACATGCG GTCCAACGCC TATGACCCCG GCACCTACCA CCTGGACCTG
GGTCTGGACA TCTTCGAGTG GCTGGACGTG GTGGACGCCG GCAACGCCTA CTGCCCGCAC
GGCCAGTCGG CGCGGTCGCA ACGCAACATC GAGGCCAAGG TCACCGACGT GCTGCGGGCC
GACGCCTTCC CGATGATCAT CGGCGGCGAC CACTCGATCA CCTACCCGGC GGCCACCGCG
GTCGCCCGCA AGTACGGCTG GGGCAAGGTC GGCCTGCTGC ACTTCGACGC GCACGCCGAC
ACCGCGGACA GCATCGAGGG GCACCTGCAC TCGCACGGCA CCCCGATGCG CCGGTTGATC
GAGTCGGGCG CGATCCGCGG ACGCAATTTC GTCCAGGTCG GGCTGCGCGG CTACTGGCCG
CCGCCGGAGG TCTTCGACTG GATGCGCGAG CAGGAGATGA CGTGGCACCT GATGCACGAC
GTGTGGGACC GGGGCATGCG GCCGGTCATC GCCGACGCCA TCGCCCGGGC CGGCGACGGG
TGCGACTGGC TCTACCTGTC GGTCGACATC GACGTGCTCG ACCCGGGTTT CGCCCCCGGT
ACGGGAACTC CGGAGCCGGG CGGGATGAAC CCGGCGGACC TGCTGCGGGC GGTCCGGCAG
ATCGCGCTGG AGACCCCGCT GGTCGCGATG GACGTGGTCG AGGTCTCGCC GCCGTACGAC
CACGCCGACA ACACCGTCAA CAACGCGCAC CGGGTCATCC TGGAGGCGCT GGGCGCGCTG
GCCACCAAGA AGCGGGAGCG GGCCGGCGGC GCGGTGACCC GGCCGGGCAG CCGCCCCGAT
CCCGCCCGGC TGCGCTATCC GGTCGAGCCC ACCGAGTGGT CGCGCCCCGG TGACGGGACG
AACACCTATA CCGACGCGGA CGCCCACCTG CAGGGCGAGC ACGACGACCA CACCTGA
 
Protein sequence
MSGGSKPATA GQSENDDEGL PTFSSRDWVA TAAGLAQRRP DVAIVGAPMD INTTYRPGAR 
FGPKYMRSNA YDPGTYHLDL GLDIFEWLDV VDAGNAYCPH GQSARSQRNI EAKVTDVLRA
DAFPMIIGGD HSITYPAATA VARKYGWGKV GLLHFDAHAD TADSIEGHLH SHGTPMRRLI
ESGAIRGRNF VQVGLRGYWP PPEVFDWMRE QEMTWHLMHD VWDRGMRPVI ADAIARAGDG
CDWLYLSVDI DVLDPGFAPG TGTPEPGGMN PADLLRAVRQ IALETPLVAM DVVEVSPPYD
HADNTVNNAH RVILEALGAL ATKKRERAGG AVTRPGSRPD PARLRYPVEP TEWSRPGDGT
NTYTDADAHL QGEHDDHT