Gene Mvan_4810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4810 
Symbol 
ID4646900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5151276 
End bp5152484 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content68% 
IMG OID639808280 
Productarginine deiminase 
Protein accessionYP_955589 
Protein GI120405760 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2235] Arginine deiminase 
TIGRFAM ID[TIGR01078] arginine deiminase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGATG CGACGCTGGG ATGCAATTCG GAGGTCGGCA GGCTTCGGGT CGTCATCCTG 
CATCGCCCGG GGCCTGAGTT GCAGCGGTTG ACTCCCCGCA ACAACGACAC CCTGCTCTTC
GACGGGCTGC CCTGGGTGGC AAGGGCTCAG CAGGAGCATG ACGCGTTCGC CGAGCTGCTG
CGGTCGCGGG GGGTCGAGGT GCTGCTGCTC GGTGTGCTGT TGACCGAGGC GCTGTCCAAC
AGCGGCGCGG CCCGCATGCA CGGCATCTCC GCTGCCGTCG ATTCCCGCCG TCTCGGTGTG
CCGCTGGCCC AGGAACTTTC GGCGTACCTG CGCACACTGG ACGCGGCTGC GCTGGCCCGC
ATCCTGATGG CGGGCATGAC GTTCGACGAG TTGCCGTTCG GGGAGAACGA GTTGTCGTTG
GTGCGGCGCA TGCACCACGG TGCGGACTTC GTCATCGACC CACTGCCCAA CCTGCTGTTC
ACCCGCGACT CGTCGTTCTG GATCGGTCCG CGGGTGGCGA TCACCTCGCT GTCGATGCAC
GCGCGGGTGC GGGAGACGTC GCTGACCGAT CTGATCTATG CCCACCATCC CCGCTTTCTC
GGGGTGCGGC GGGCCTACGA GTCGCGGTCG GCACCGATCG AGGGCGGCGA CGTGCTGCTG
CTCGCGCCCG GTGTGGTGGC GGTCGGCGTG GGGGAGCGCA CCACACCTGC CGGGGCGGAA
GCGTTGGCAC GCAGCCTGTT CGACGACGAC CTCGCGCATA CGGTGCTGGC GGTGCCGATC
GCCCAGGAGC GCGCCCAGAT GCATCTGGAC ACGGTGTGCA CGATGGTCGA CACCGATGCG
GTGGTGATGT ACCCGAACAT CCAGGACTCG TTGACCGCCT TCACGATTCG CCGTGAGTCG
GGCGGGGTGA AGATCGACCG TGCCGCACCG TTCGTCGACG CGGCCGCCGA CGCGATGGGA
ATCGCCAAGC TGCGGGTGAT CGACACCGGG CTGGATCCCG TCACCGCCGA GCGCGAGCAG
TGGGACGACG GCAACAACAC TTTGGCGGTA GCGCCCGGCG TGGTGGTCGC CTACGAGCGC
AACACCGAAA CCAATGCGCG CCTGGCAGAT TCGGGTATCG AGGTGCTGCC GATCTCGGCC
TCGGAACTCG GTACCGGCCG CGGCGGGCCG CGCTGTATGT CCTGCCCGGC CGGCCGCGAC
CCGCTCTAG
 
Protein sequence
MTDATLGCNS EVGRLRVVIL HRPGPELQRL TPRNNDTLLF DGLPWVARAQ QEHDAFAELL 
RSRGVEVLLL GVLLTEALSN SGAARMHGIS AAVDSRRLGV PLAQELSAYL RTLDAAALAR
ILMAGMTFDE LPFGENELSL VRRMHHGADF VIDPLPNLLF TRDSSFWIGP RVAITSLSMH
ARVRETSLTD LIYAHHPRFL GVRRAYESRS APIEGGDVLL LAPGVVAVGV GERTTPAGAE
ALARSLFDDD LAHTVLAVPI AQERAQMHLD TVCTMVDTDA VVMYPNIQDS LTAFTIRRES
GGVKIDRAAP FVDAAADAMG IAKLRVIDTG LDPVTAEREQ WDDGNNTLAV APGVVVAYER
NTETNARLAD SGIEVLPISA SELGTGRGGP RCMSCPAGRD PL