Gene Mflv_5171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_5171 
Symbol 
ID4976482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp5511509 
End bp5513350 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content69% 
IMG OID640459401 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001136425 
Protein GI145225747 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.72822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.406258 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGT TGAGGTCGCG GACGGTCACG CACGGTCGGA ACATGGCGGG AGCGCGCGCG 
TTGCTGCGTG CCGCGGGGGT GGACGGCGCC GACATCGGCA AGCCGATCGT CGCCGTCGCC
AACAGCTTCA CCGAGTTCGT CCCCGGGCAC ACGCATCTTC AACCGGTGGG CCGGATCGTG
TCGGAGGCGA TCAGGGCGGC CGGCGGGGTG CCGCGCGAGT TCAACACGAT CGCCGTCGAC
GACGGCATCG CGATGGGCCA CGGCGGCATG CTCTACTCCC TGCCCAGCCG GGAGCTCATC
GCCGACTCTG TGGAGTACAT GATCAACGCG CACTGCGCCG ACGCGATGGT GTGTATCTCC
AACTGCGACA AGATCACTCC GGGCATGGTG ATGGCCGCGC TGCGCCTGGA CATCCCGACG
GTGTTCGTCT CGGGCGGCCC GATGGAGGGT GGCTCCGCGG TGCTGGTCGA CGGCACGGTG
CGCACCCGGC TGAATCTGGT CAGTGCGATC GCCGATGCGG TCGACAGCGG GGTGTCGGAT
CCGGACCTGG CCCGTATCGA GGAGTCGGCG TGCCCGACGT GCGGGTCGTG CTCGGGCATG
TTCACCGCGA ACTCGATGAA CTGCCTGACC GAGGCACTCG GGCTGGCGCT GCCCGGCAAC
GGGTCGGTGC TGGCCACCCA CACTGCGCGT CGCGCGCTGT ACGAGAACGC CGGCGCGACG
GTGATGGACC TGTGTCGCCG CTACTACGAC GAGGACGACA CCAGCGTGCT GCCGCGCGCG
ATCGCGAACC GCGAGGCGTT CGACAACGCG ATGGCGATGG ACATGGCGAT GGGTGGCTCG
ACGAACACGA TTCTTCATCT GCTGGCCGCC GCACGGGAGG CGGAGCTGGA CTACACGCTC
GAAGACATCG AGAAGCGCAG CCGCCAGATC CCGTGTCTGT GCAAGGTCGC CCCGAACGGC
CACTATCTGA TGGAAGACGT GCACCGCGCC GGCGGCATCC CGGCGATCCT GGGTGAGCTG
TGGCGCGGGG GGCACCTGCA CGAGACTGTC CACTCGGTGC ATGCGGGCTC GCTTCCGGAA
TGGTTGCGCC GCTGGGATGT TCGCGGCGGG CAGGCGTGCG AGGAGGCGAT CGAACTGTTC
CATGCCGCGC CGGGTTGTGT CCGTTCGGCG TCGGCGTTCA GCCAGTCCGA GCGGTGGGAG
TCGCTGGACA CCGACGCGTC CACGGGTTGC ATCCGCGATG TGCGCCATGC CTATTCGGAG
GACGGCGGGC TGGCGATCCT GCGCGGCAAC CTGTCCCGCG ACGGCTGCAT CGTCAAGACC
GCGGGCGTCG ACGAATCGAT CTGGACATTC TCCGGCCCCG CGGTGGTGGT CGAATCCCAG
GAGGACGCCG TCGACGCGAT CCTGAACGGA CGGGTGCAGC CCGGCGACGT CGTCGTGGTG
CGCTTCGAGG GCCCCAAGGG CGGTCCGGGC ATGCAGGAGA TGCTGTACCC CACCTCATAT
TTGAAGGGCC GCGGGCTGGG CAAGGTGTGC GCGCTGGTGA CCGACGGCAG GTTCTCGGGC
GGGTCGTCGG GACTCGCGGT CGGCCACGTG TCCCCCGAGG CGGCGGCTGG TGGCACCATC
GCACTCGTCG AGGACGGCGA CCGGATCACC ATCGACATCC CGCGCCGCAC GATCGGGCTC
GACGTTGCCG ACGACGAATT GGACCGGCGC CGTGCGGCAT TGGAGGGCGG CGGATATCGG
CCCCGCCACC GGGAGCGACC GGTGTCAGCG GCGCTGCGCG CCTATGCCGC GTTGGCGCTG
TCGGCGGACA AGGGCGCGGT ACGTGATGTC GGCAGGTCCT GA
 
Protein sequence
MPELRSRTVT HGRNMAGARA LLRAAGVDGA DIGKPIVAVA NSFTEFVPGH THLQPVGRIV 
SEAIRAAGGV PREFNTIAVD DGIAMGHGGM LYSLPSRELI ADSVEYMINA HCADAMVCIS
NCDKITPGMV MAALRLDIPT VFVSGGPMEG GSAVLVDGTV RTRLNLVSAI ADAVDSGVSD
PDLARIEESA CPTCGSCSGM FTANSMNCLT EALGLALPGN GSVLATHTAR RALYENAGAT
VMDLCRRYYD EDDTSVLPRA IANREAFDNA MAMDMAMGGS TNTILHLLAA AREAELDYTL
EDIEKRSRQI PCLCKVAPNG HYLMEDVHRA GGIPAILGEL WRGGHLHETV HSVHAGSLPE
WLRRWDVRGG QACEEAIELF HAAPGCVRSA SAFSQSERWE SLDTDASTGC IRDVRHAYSE
DGGLAILRGN LSRDGCIVKT AGVDESIWTF SGPAVVVESQ EDAVDAILNG RVQPGDVVVV
RFEGPKGGPG MQEMLYPTSY LKGRGLGKVC ALVTDGRFSG GSSGLAVGHV SPEAAAGGTI
ALVEDGDRIT IDIPRRTIGL DVADDELDRR RAALEGGGYR PRHRERPVSA ALRAYAALAL
SADKGAVRDV GRS