Gene Mvan_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4020 
Symbol 
ID4647476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4301177 
End bp4302205 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content67% 
IMG OID639807482 
Productinositol 2-dehydrogenase 
Protein accessionYP_954803 
Protein GI120404974 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.141873 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.256136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGC TTCGTGTGGC CGTGTTGGGT GTGGGCGTGA TGGGCGCCGA CCACGTCGCC 
CGGATCACCT CCCGGATCTC CGGGGCCCGG GTGTCGGTCG TCAACGACTA CGTCACCGAG
AAGGCCGAGC AGATCGCCTC GGAGGTCGAC GGCTGTCGCG CGGTGGTCGA TCCGCTGGAT
GCGATCGCCG ATCCCGAGGT GGACGCGGTG GTGCTGGCCA CTCCCGGCAG CACCCACGAG
AAGCAGCTGC TGGCCTGCCT GGATCACAGA AAACCGGTGA TGTGCGAGAA GCCGCTCACC
ACCGATGTTT TCACCTCACT GGAGATCGCC CGGAGGGAGG CGGAGCTCGA GTGCCCGCTG
ATCCAGGTCG GGTTCATGCG CCGGTTCGAC GACGAGTACA TGCGTCTCAA GGCACTGCTC
GACGGCGGCG AACTCGGACA GCCCCTGGTG ATGCACTGCG TGCATCGCAA CCCGGGCGTG
CCGTCGTACT TCGACAGTTC GCTGATCGTC AAGGACTCCC TGGTTCACGA GGTCGACGTG
ACGCGGTACC TGTTCGGCGA AGAGATCGCC AGCGTGCAGA TCGTCAGACC CGTCTCGAAT
CCCGCTGCGC CAGAAGGGGT CATCGACCCG CAGATCGCGA TCCTGCGCAC CGTCTCCGGG
CGGCACGTGG ACGTGGAACT GTTCGTGACC ACCGGTGTCG CCTATGAGGT CCGCACCGAG
GTGGTCGGCG AACGCGGCAG CGCGATGATC GGCTTGGACG TCGGGCTCAT CCGCAAGAGT
GCACCCGGCA CGTGGGGCGG TCTGATCGCC CCCGGCTTCC GGGAGCGCTT CGGCCGCGCG
TACGACACCG AAATCCAGCG CTGGGTCGAC GCGGTGCGGG CCGGCACCAA CATCGACGGT
CCGACCGCCT GGGACGGTTA CGCCGCCGCG GCGGTGTGCG CCGCGGGCGT CGAATCACTC
GAGTCGGGAT TGCCCGTCCC GGTGCACCTT GCTGAACGAC CTGACCGCTC CACGATCAGG
CCCCGTTGA
 
Protein sequence
MSELRVAVLG VGVMGADHVA RITSRISGAR VSVVNDYVTE KAEQIASEVD GCRAVVDPLD 
AIADPEVDAV VLATPGSTHE KQLLACLDHR KPVMCEKPLT TDVFTSLEIA RREAELECPL
IQVGFMRRFD DEYMRLKALL DGGELGQPLV MHCVHRNPGV PSYFDSSLIV KDSLVHEVDV
TRYLFGEEIA SVQIVRPVSN PAAPEGVIDP QIAILRTVSG RHVDVELFVT TGVAYEVRTE
VVGERGSAMI GLDVGLIRKS APGTWGGLIA PGFRERFGRA YDTEIQRWVD AVRAGTNIDG
PTAWDGYAAA AVCAAGVESL ESGLPVPVHL AERPDRSTIR PR