Gene Mflv_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_3303 
Symbol 
ID4974624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp3488963 
End bp3489985 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content70% 
IMG OID640457526 
Productputative agmatinase 
Protein accessionYP_001134568 
Protein GI145223890 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01227] formimidoylglutamase
[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.345323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.151363 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAC AACTGGAGCT GGCGTACGCC GGGATGGCGT CCTTCGGGCA TCGCCCGTTC 
CTGACCGAGG TCGAGCAGCT CGACTCCTGG AGGCCCGACG CGGCGATCGT CGGCGCACCG
TTCGACGTGG GGACCACCAA CAGACCCGGC GCGCGCTTCG GGCCCAGGGC CATCCGCGCC
ACCGCCTATG AGCCCGGCAC CTATCACATG GACCTCGGGC TGGAGATCTT CGACTGGCTC
GAGGTCGTCG ACTTCGGTGA CGCCTACTGC CCACACGGCC AGACCGAGGT GTCGCACAAC
AACATTCGCG AACGCGTGCA CATGCTCGCC TCCCGCGGCA TCGTGCCCGT GGTGCTCGGC
GGCGACCACT CCATCACCTG GCCCGCGGCG ACCGCGGTCG CCGACGTCCA CGGCTACGGC
AACGTCGGCA TCGTGCACTT CGACGCCCAC GCCGACACCG CCGACGAGAT CGAGGGCAAC
CTCGCCAGCC ACGGCACTCC GATGCGCCGG CTGATCGAGT CCGGTGCGGT GCCCGGGTCA
CATTTCGTTC AGGTCGGCCT ACGCGGCTAC TGGCCGCCCC GCGACACGTT CGACTGGATG
CTCGAGCAGA AGATGACCTG GCACACGATG CAGGAGATCT GGGAGCGAGG GTTCAAGGCG
GTGATGGCCG ACGCGGTCGG CGAGGCCCTG GCCAAGGCCG ACAAACTCTA CGTCTCCGTG
GACATCGACG TGCTCGATCC CGCGCACGCA CCGGGCACGG GAACGCCGGA GCCGGGCGGC
ATCACCAGCG CCGACCTGCT GCGCATGGTG CGGCAACTCT GCTACGAACA CGACGTCGCC
GGCGTGGACG TCGTCGAGGT GGCACCGGCC TACGACCACG CCGAGCTCAC GGTCAACGCC
GCGCACCGGG TGGTGTTCGA GGCACTCGCC GGCATGGCGG CGCGATGCCG GGACGCCGCG
AACGGCGAGG TGGGCCAACC GGCGCGGTCC TACCGGGACC GGGACGCTAC TTCGCGAGAG
TGA
 
Protein sequence
MAEQLELAYA GMASFGHRPF LTEVEQLDSW RPDAAIVGAP FDVGTTNRPG ARFGPRAIRA 
TAYEPGTYHM DLGLEIFDWL EVVDFGDAYC PHGQTEVSHN NIRERVHMLA SRGIVPVVLG
GDHSITWPAA TAVADVHGYG NVGIVHFDAH ADTADEIEGN LASHGTPMRR LIESGAVPGS
HFVQVGLRGY WPPRDTFDWM LEQKMTWHTM QEIWERGFKA VMADAVGEAL AKADKLYVSV
DIDVLDPAHA PGTGTPEPGG ITSADLLRMV RQLCYEHDVA GVDVVEVAPA YDHAELTVNA
AHRVVFEALA GMAARCRDAA NGEVGQPARS YRDRDATSRE