Gene Mvan_3024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3024 
Symbol 
ID4647206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3189601 
End bp3190665 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content69% 
IMG OID639806502 
Productputative agmatinase 
Protein accessionYP_953833 
Protein GI120404004 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01227] formimidoylglutamase
[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.197883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.651701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCACG ATCACCGACC TCACCGTGAG CTCGCACCGG GTATGGCCGA GCAGCTGGAC 
CTTCCCTACG CCGGGGTGGT GTCCTTCGGC CACCGGCCCT TTCTCACCGA GTCCGAACAG
CTCGACTCGT GGAAGCCGGA CGTTGCCGTC GTCGGCGCGC CGTTCGACGT CGGGACCACC
AACCGTCCCG GCGCCCGTTT CGGTCCGCGG GCGATCCGCG CGACGGCCTA TGAACCCGGG
ACGTACCACA TGGATCTGGG TCTGGAGATC TTCGACTGGC TCGAGGTCGT CGACTTCGGC
GACGCCTACT GCCCGCACGG CCAGACGGAG GTGTCACACC GCAACATCCG GGAGCGGGTG
CACGCGGTGG CCGACCGAGG GATCGTCCCG GTCATCCTCG GCGGCGACCA TTCGATCACC
TGGCCCGCCG CCACGGCCGT TGCCGACGTG CACGGCTACG GCAACGTCGG CATCGTGCAC
TTCGACGCCC ACGCCGACAC CGCCGACGAG ATCGAAGGCA ACCTCGCCAG CCACGGCACG
CCGATGCGCC GGCTGATCGA ATCGGGCGCC GTGCCCGGTT CGCATTTCGT CCAGGTCGGG
CTGCGCGGTT ACTGGCCGCC CCAGGATACT TTCGAGTGGA TGCTCGAACA GAAGATGACC
TGGCACACCA TGCAGGAGAT CTGGGAGCGC GGCTTCAAGG CGGTGATGGC CGACGCGGTC
GCCGAGGCGC TGGCCAAGGC CGACAAGCTG TACGTCTCGG TCGACATCGA CGTGCTGGAC
CCGGCCCACG CACCGGGCAC CGGGACCCCG GAGCCCGGCG GCATTACCAG CGCAGACCTG
TTGCGCATGG TGCGGCAACT CTGTCACGAG CACGACGTCG TCGGGGTGGA CGTGGTCGAG
GTGGCGCCCG CCTACGACCA CGCCGAGCTC ACGATCAACG CCGCGCACCG GGTGGTGTTC
GAGGCGCTCG CCGGGATGGC GGCCAGGCGC CGCGACGCAG CCGACGGCGA GGTGGGACAG
CCGGCCCGGT CCTACCGGGA CCGGGGCGTC ACTTCTCCAG AGTGA
 
Protein sequence
MGHDHRPHRE LAPGMAEQLD LPYAGVVSFG HRPFLTESEQ LDSWKPDVAV VGAPFDVGTT 
NRPGARFGPR AIRATAYEPG TYHMDLGLEI FDWLEVVDFG DAYCPHGQTE VSHRNIRERV
HAVADRGIVP VILGGDHSIT WPAATAVADV HGYGNVGIVH FDAHADTADE IEGNLASHGT
PMRRLIESGA VPGSHFVQVG LRGYWPPQDT FEWMLEQKMT WHTMQEIWER GFKAVMADAV
AEALAKADKL YVSVDIDVLD PAHAPGTGTP EPGGITSADL LRMVRQLCHE HDVVGVDVVE
VAPAYDHAEL TINAAHRVVF EALAGMAARR RDAADGEVGQ PARSYRDRGV TSPE