Gene Noca_3224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3224 
Symbol 
ID4599162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3424882 
End bp3425883 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content73% 
IMG OID639777830 
Productagmatinase 
Protein accessionYP_924413 
Protein GI119717448 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01227] formimidoylglutamase
[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.583379 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGGT ACGGCGCCCA GTTCGGTCCC GACATCACCT TCCTCGGGGT CGACCCGATC 
GACCTCGACG ACGCCGACGG GCTGGCCGCC GCCGACGTGG TCGTCCTCGG TGCGCCGTTC
GACGGCGGCA CCTCGCACCG GCCCGGCACC CGGTTCGGGC CCAGCGCGAT CCGGCAGACC
GACTACCTGC CCCAGGACGG ATCGCGGCCG CACCTCGCGC TGCGGGTCGA CGCCCTGCGC
GACCTGCGGG TGGTCGATGC GGGCGACGTC GAGATGCCGC CCGGAGACAT CACCCGGGCC
CTCGGCAACC TCGAGGAGGC CGTCTACGCC GTCGCTCGCT CCGGTGCCGT CCCGCTGGTC
CTCGGCGGCG ACCACTCGAT CGCGCTCCCC GATGCGACCG GGGTGGCCCG CCACCTCGGC
TTCGGCCGGG TCTCGATGAT CCACTTCGAC GCGCACGCCG ACACCGGCCA CATCGAGTTC
GGGTCGCTCT ATCGCCACGG CCAGCCGATG CGCCGGCTGA TCGAGTCGGG CGCGCTGCGC
GGGGACCGGT TCCTCCAGAT GGGGCTGCGC GGCTACTGGC CCGGCCCCGA GACGCTCGAC
TGGATGGCGG CGCAGCACAT GCGCTCCTAC GAGATGACCG AGATCGGCCG GCGCGGCCTC
GAGGAGTGCC TGGACGAGGC CTTCGAGATC GCCCTCGACG AGTGCGATGC GGTCTTCCTC
TCCGTCGACA TCGACGTGTG CGACCCCGGC CACGCACCCG GCACCGGCAC GCCGGAGCCC
GGTGGGCTCT CCAGCCGCCA GCTCCTGGAC GCGGTCCGCC GGATCTGTCG CGAGCTCCCG
GTCGCCGGCA TCGACGTGGT CGAGGTGTCC CCGCCGTACG ACCACGCCGA GATCACCGCG
TACCTCGCCA ACCGGGTCTG CCTCGAGGCC CTCTCCGGCC TGGCGGCCCG CCGCCACGGC
ATCTCGCACG ATCCGGCCGG CCCGCTGCTG GAAGGTCGCT GA
 
Protein sequence
MTRYGAQFGP DITFLGVDPI DLDDADGLAA ADVVVLGAPF DGGTSHRPGT RFGPSAIRQT 
DYLPQDGSRP HLALRVDALR DLRVVDAGDV EMPPGDITRA LGNLEEAVYA VARSGAVPLV
LGGDHSIALP DATGVARHLG FGRVSMIHFD AHADTGHIEF GSLYRHGQPM RRLIESGALR
GDRFLQMGLR GYWPGPETLD WMAAQHMRSY EMTEIGRRGL EECLDEAFEI ALDECDAVFL
SVDIDVCDPG HAPGTGTPEP GGLSSRQLLD AVRRICRELP VAGIDVVEVS PPYDHAEITA
YLANRVCLEA LSGLAARRHG ISHDPAGPLL EGR