Gene Mflv_4233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_4233 
Symbol 
ID4975546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp4492323 
End bp4493333 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content67% 
IMG OID640458460 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_001135490 
Protein GI145224812 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.397196 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.260479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCG CGGTGATCGC CGGCGACGGC ATCGGACCCG AAGTCATCGG CGAAGCCCTC 
AGGGTGCTCG ACGCCGTCGT GCCGGGGGTG GAGAAGACCG AGTACGACCT CGGCGCCCGG
CTGTATCACC GCACAGGTGA GGTGCTGCCC GACTCGGTGC TCGACGAGCT CAAGGGTCAC
GACGCGATCC TGCTCGGCGC GATCGGCGAT CCGTCGATGC CCAGCGGTGT GCTGGAACGT
GGCCTGTTGC TGCGCATCCG TTTCGAGCTC GACCACCACA TCAACCTGCG TCCCGGACGT
CTCTACCCGG GTGTGCAGAG TCCGCTGGCC GGGAATCCCG AGATCGACTT CGTCGTGGTC
CGGGAGGGCA CCGAGGGTCC GTACACCGGT AACGGCGGCG CGATCCGGGT CGGTACCCCG
CACGAGATCG CGACCGAGGT CAGTGTCAAC ACCGCCTACG GTGTGCGCCG TGTCGTGCAG
GACGCGTTCA AGCGTGCCCA GCAGCGGCGC AAGCATCTGA CGTTGGTGCA CAAGAACAAT
GTGCTGACCA ACGCCGGGTC CCTGTGGTGG CGCACCGTGC AGGCGGTCGC CGCGGAGTAC
CCGGAGGTCG AGGTCGCCTA CCAGCACGTC GACGCCGCCA CCATTCACAT GGTCACCGAC
CCGGGCCGCT TCGATGTGAT CGTCACCGAC AACCTGTTCG GCGACATCAT CACCGACCTC
GCCGCGGCGG TGTGTGGTGG TATCGGCCTG GCGGCCAGCG GCAACATCGA TGCGACGCTG
ACGAACCCGT CGATGTTCGA ACCCGTGCAC GGCAGCGCGC CCGATATCGC CGGGCAGGGC
ATCGCTGACC CGACGGCCGC GATCATGTCG GTGTCGCTGC TGCTGGCCCA CATGGCCGAG
ATCGATGCGG CGGCCCGGGT CGACAAGGCC GTCGCCGAGC ACCTGGCCAC CCGCGGGGAC
GAGAAGCTCT CGACCACTCA GGTGGGCGAT CGGATCCTCG GAAAGCTGTA G
 
Protein sequence
MKLAVIAGDG IGPEVIGEAL RVLDAVVPGV EKTEYDLGAR LYHRTGEVLP DSVLDELKGH 
DAILLGAIGD PSMPSGVLER GLLLRIRFEL DHHINLRPGR LYPGVQSPLA GNPEIDFVVV
REGTEGPYTG NGGAIRVGTP HEIATEVSVN TAYGVRRVVQ DAFKRAQQRR KHLTLVHKNN
VLTNAGSLWW RTVQAVAAEY PEVEVAYQHV DAATIHMVTD PGRFDVIVTD NLFGDIITDL
AAAVCGGIGL AASGNIDATL TNPSMFEPVH GSAPDIAGQG IADPTAAIMS VSLLLAHMAE
IDAAARVDKA VAEHLATRGD EKLSTTQVGD RILGKL