Gene Mvan_4899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4899 
Symbol 
ID4648832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5247973 
End bp5250285 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content64% 
IMG OID639808370 
Producthydantoinase B/oxoprolinase 
Protein accessionYP_955678 
Protein GI120405849 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.313887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAG TCAACGAACG CCCGGCCGGC AAGCTCACCG CAACTGAGCA ACGGTGGGTC 
GATCAGTTCA TGGACGAGAC CACGCTGTTC CTGGGCCCGG ACCCGGAAAT CATGCGCAGC
CACCGCATCT CGGAACGCTC GCTGCACGAG GAGGTCGCCA TCGCGGCCGG CGTGGACCGG
CTGCAGGTGG AGCGCATCCG AAAGCGCATC GCCGGCGCCC TGGACGAGGG CTACGAGATG
TGCGAGGCAC AGGGTGCGGC CCCAGGTGCC AAGTGGGGTG ACCTGACCAC CGCCATCTAC
ACGGGCTCCG GCGATGTCTC CTACCTGTCG TGCCACGGCG TCATCGCCTT CAGTGCGATC
CTGCACCACC CGATCCGATA CATCATGAAG TACTGGAAGG ACGAGCCGAC CGTCGGGATC
AACCCTGGTG ACGGCTTCAT CCACAACGAC GCCCGATTCG GCAACGTGCA CAACACCGAC
CAGTCGATGA TCATGCCGAT CATCCGCGAC GACGAGATCA TCGCGTGGGT GGCAGCCACC
ATCCACGAGG GCGAGAACGG GGCGTGTGAA CCCGGCGGCA TGCCCTCGGG CTCGGAGACG
GCGTTCGACG ACGGGCTGCG GATGAGCCCG TTCAAGATCG TCGAGCGCGG CGAGTTGCGC
CGGGATCTGC TGACCTTCCT GCAGCATTCG GTGCGCGACC CGAAGCTCCA GCTGGCCGAC
CTGAAGGTGA AGATCACCGC GGTGCGCAAG ATCATGGAGC GCATCGACAA GGTGATCGAC
GAGGTGGGGG TCGACACCTT CGTGGCAGCA CTGCGCACCA CCGTCGAGGA CGTCGACGCC
GAGGTGCGCC GCCGCATCGC CGAACTGCCG GATGGGACCT ACCGGTTCGA CCAGTTCATG
GACAGCACGC TCAAGGAGAA CATCCTCATC AAGTTCGCGT GCAAGATCAC GGTCAAGGGC
GACCACATGA CCGTGGATCT GCGCGGAACG GGGCCCGAAA TCCTCAACCG GGCTATCAAT
TCGCCGTTGT GTTCGGTGAA ATCGATGATG ATGCAGGCGA TCCTGGCGTT CTGGTGGCCG
GACCTGCCGC GCTGCACCGC GGCGATGAGT TGCATCGACA TCATCTCCGA CGAGGGCACC
TGGGCGGATG CCTCCTACGA CGCCCCGATG GGGCAGTCGC TGCAGGCTTC GTTCCGCGGC
TTCTCCTGCA TGCAGGCGCT CTACAGCCGG ATGTCGTTCT CCACCCCGCA CAAGTACTCC
AACGTGGTGG CCAACTGGTT CAACCAGATC AACACGTTCT TGTGGGGCGG CATCACCCAG
CACGGCGACA TGGTGGGCAA CCTGTGCGCC GACCTCAACG GAATGCCAGG TGGCGCCAAG
CCGTTCCACG ACGGCGAGGA CGCCGTCTCA CCACTGTTCT GTGCGATGGC CGACACGGCC
GAGCAGGAGG TGATGGAGGA AGAGGTGCCG TTCATGCAAC TGGTCGCCAA GCGCCTGGTC
CGCGACAACA TGGGATTCGG CAAGTTCACC GGCGGGATGG GTTACGAGAT GATCGTGGCC
GCCGAGGGAA CGCCTCAGTG GGGATTCATG ACGGTGACCT CCGGTGCGAA GTTCTCGTCG
ATCTACGGCA TGTACGGCGG ATACGGCTGC GGCACCTACC CGTTGGCGAT GGTCAAAGGC
ACCAATGTCT ATGAACATTT CCGACGTGAC AACAAGAAGT TCGACCTGTC GATCGAAAAG
GTGATGAACG AACGCCCCTT CCCCGACGGC CGCTACTCCA CGTATCACAT GGGTCTACAG
TTCGACCTGG CCAAGGACGG CGAGCTGTAC ATGATCAGCC AGGGCGCCGG GGGCGGTTAT
GGAGATCCCC TGGAGAGGCT GCCCGAGTCG GTGGTCCGCG ACGCTGAGCT CGGCCGGATC
AGCCAGAAGG TGGCCGAAGA CATCTTCGGA GTTCGTTACG AGCCGAAGAC GTTTCGCCTG
GACGTGGCCG GCACCGAGGC GGCACGCGCC GCGGCGCGCA AGAACCGGCT ACAGCGGGGG
AAGCCGTTTG CCGCGTTCTG CGAGGAGTTC GTCACGCCCG AACCGCCGAA GGATCTGCCC
TACTACGGAT CGTGGGGAAC CTGGACCGAC GAGAACCAGG ACATCACCGC CACCGTGCAC
TCCATCGACG GTCCCGAGCG GGTGTGCGCA CCGCTGAGCG CACTTCCGAT CGTGATGGTT
CCTGACCGCC GCGAAGTCAA GATCGGCAAG CTCGAGGCCC GCATTGCCGA GCTTGAAGCC
AAGCACGGCG AGAAGGTCAC CCGACTCACC TGA
 
Protein sequence
MTTVNERPAG KLTATEQRWV DQFMDETTLF LGPDPEIMRS HRISERSLHE EVAIAAGVDR 
LQVERIRKRI AGALDEGYEM CEAQGAAPGA KWGDLTTAIY TGSGDVSYLS CHGVIAFSAI
LHHPIRYIMK YWKDEPTVGI NPGDGFIHND ARFGNVHNTD QSMIMPIIRD DEIIAWVAAT
IHEGENGACE PGGMPSGSET AFDDGLRMSP FKIVERGELR RDLLTFLQHS VRDPKLQLAD
LKVKITAVRK IMERIDKVID EVGVDTFVAA LRTTVEDVDA EVRRRIAELP DGTYRFDQFM
DSTLKENILI KFACKITVKG DHMTVDLRGT GPEILNRAIN SPLCSVKSMM MQAILAFWWP
DLPRCTAAMS CIDIISDEGT WADASYDAPM GQSLQASFRG FSCMQALYSR MSFSTPHKYS
NVVANWFNQI NTFLWGGITQ HGDMVGNLCA DLNGMPGGAK PFHDGEDAVS PLFCAMADTA
EQEVMEEEVP FMQLVAKRLV RDNMGFGKFT GGMGYEMIVA AEGTPQWGFM TVTSGAKFSS
IYGMYGGYGC GTYPLAMVKG TNVYEHFRRD NKKFDLSIEK VMNERPFPDG RYSTYHMGLQ
FDLAKDGELY MISQGAGGGY GDPLERLPES VVRDAELGRI SQKVAEDIFG VRYEPKTFRL
DVAGTEAARA AARKNRLQRG KPFAAFCEEF VTPEPPKDLP YYGSWGTWTD ENQDITATVH
SIDGPERVCA PLSALPIVMV PDRREVKIGK LEARIAELEA KHGEKVTRLT