Gene Mflv_2172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_2172 
Symbol 
ID4973494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp2248080 
End bp2251502 
Gene Length3423 bp 
Protein Length1140 aa 
Translation table11 
GC content71% 
IMG OID640456381 
Productaldehyde dehydrogenase 
Protein accessionYP_001133438 
Protein GI145222760 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.387028 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0145109 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCA TCGGCGGCGA CGAGCACCTG GTGTCCGAAG TCGAAACCCT GGTCCGGCGC 
TGGCTCGACG ACGCGGCGGG ACACCGCGTC GCACCTGCGG CACGACGGCT CGCCGACGTG
CTGCGCGACC CCGGCGGGCT CGATTTCACC GTCGGGTTCG TCGACCGCGT GATCCGCCCG
GACGACCCCC GCGTCGCGGC GGCGAACCTG CGCGAGCTGG CCCGCACCGC GCCCGGGTTC
CTGCCGTGGC ATCTGCGGCT GCTCATCAGG CTCGGTGCGG CAGTATCCGT CGTCCTACCC
GGCGTCGTGA TCCCGATCGC CTCCCGCGCC CTGCGCGAGA TGGTCGGCCA TCTCCTGGCC
GACGCGACCG ACGCCCGCCT CGGCCGCTCC ATCGAGCGGA TCCGCCGTCG CGGGGTCAAC
CTCAACGTCA ACCTCCTCGG CGAGGCGGTG CTGGGGCGGC GCGAAGCGCG ACGGCGCCTG
ACGGGCACCG AGCGGCTCCT GGCCAGACCC GACGTCGACT ACGTGTCGAT CAAGGTCTCG
GCGAGCGTGC CTCCGCATTC GGTTTGGGCG TTCGACGAGG CCGTCGACCA CATCGAGCAG
AGCCTGCTGC CGCTGTTCAC CCAGGCGGCG ACCGCGGCCA CCCCCAAGTT CATCAACCTC
GACATGGAGG AGTACCGCGA TCTGGAACTG ACGATGGCGG TGTTCACCCG ACTGCTCGAC
CGGCCCGAAC TGCGCGCGCT GGAGGCCGGC ATCGTGTTGC AGGCCTATCT GCCCGACGCG
CTGGGCGCGA TGATCCGCCT GCAGGACTGG GCCCGTGACC GCAGACAGCG CGGCGGCGCC
GGCATCAAGG TCCGTCTGGT CAAGGGGGCC AACCTGCCGA TGGAGCATGT CGAGGCGTCG
CTGCACGGCT GGCCGGTCGC GACATGGCCG ACCAAGCAGG ACACCGACAC CAACTACAAG
AGAGTGCTGA ACTACGCACT GCAGCCCGAC CGGATCGCGA ACGTGCGCAT CGGCGTGGCC
GGCCACAACC TGTTCGACGT CGCCTATGCC TGGATCCTGG CGGGACGCCG CGGTGTCCGC
GACGGCGTCG AGTTCGAGAT GCTGCTGGGC ATGGCCGACG CCCAGGCCGA GGCGGTCCGG
GCGACGGTGG GCAGCCTGCT GCTGTACGTG CCGGTGGTGC ACCCCGAGGA CTTCGACGTT
GCGATCGCCT ACCTGGTGCG CCGGTTGGAG GAGGGTTCGA GCCACCAGAA CTTCATGTCT
GCGGTGTTCG AATTGCACAG TGACGACATG CTTTTCCAGC GTGAGCGGGA GCGGTTCGTC
GCGTCGCTGG CCGCCGTCGA CGACGACGTG CCGCTGCCCA ACCGCCGTCA GGACCGGCGG
GCCGACGACC CGGCGGCCGC GGTGAGGGCC GAGTTCTCGA ACGTCGCCGA CACGGATCCG
GCACTCCCGG GCAACCGGGC GTGGGGGCGG GCCGTCACCG GGCGCATCGC CGGGTCGACC
GCGGGACGCT CGGCGGTGGA CGAGCACACC GTGACGACGG TCGACGGCCT CGAGGCGGTG
ATCTCCCGCG GCGTCGACGC CGGTGCGCGG TGGGCGGCCC TGTCGGGCGC CGAGCGCGCT
GCCGTTCTGC GTCGTGCCGC AGCCACGCTC GAGGCCCGTC GTGCCACACT TCTGGAGGTG
ATGGCCGCCG AGACCGGCAA GACCCTGGAC CAGGGTGACC CGGAGGTCTC GGAGGCGATC
GACTTCGCGA ACTACTATGC GTCCCTTGCA GAGAAGCTCG ACCGGGTGGA CGGCGCCATC
GCGCGACCGG TCGGGCTCAC CGTGGTGACG CCGCCGTGGA ACTTCCCCGT CGCGATCCCC
GCCGGTTCCA CCCTGGCCGC ACTGGCCGCC GGCAGTGCCG TGGTGATCAA GCCCACCACG
CAGGCCAGAC GGTGCGGATC GGTCATGGTC GAGGCGCTGT GGGATGCCGG GATCGGCCGC
GACACACTGC AATTGGTTCA CCTCGACGAA GGCGAGCTGG GCGCCCGGCT GATCTCCGAC
GCCCGGGTCG ACCGGGTGAT CCTGACCGGA GCGTTCGACA CTGCCGCGCT GTTCCGCAGC
TTCCGTCCCG ACCTCGTTCT GCTGGCCGAG ACCAGCGGCA AGAACGCGAT CGTGGTCACC
CCGCACGCGG ACGTCGACCT GGCCGTGAAG GATCTCGTGT ACTCGGCGTT CGGGCACGCC
GGGCAGAAGT GCTCGGCGGC CTCGCTGGGG ATCCTGGTGG GTTCGGTGGC GAGATCGGCG
CGATTCCGTG ACCAGCTCGT CGACGCGGTC ACCTCGCTCA ACGTCGGGTA TCCGACCGAT
CCCACCGTCC AGATGGGTCC GGTCATCGAA CCGGCGGACG GCAAACTGCT GCGCGCACTC
ACCACGCTGG GGCCGGGGGA GAAATGGCTG GTGGCGCCCC GGCAGCTCGA CGACAGCGGC
CGGCTGTGGT CTCCCGGGGT CAGGACCGGG GTCCGGCGCG GTTCGGAGTT CCACCTCACC
GAATACTTCG GCCCGGTGCT GGGCCTGATG GAGGCCCGGG ACCTGGACGA GGCGATCGCG
ATGCAGAACC AGGTCGACTA CGGGTTGACG GCCGGTCTGC ACAGTCTCGA CCGGCACGAG
GTCGAACGCT GGATCGACCG TGTCGAGGCG GGCAACGCCT ACGTCAACCG GTCGACGGTC
GGGGCCGTCG TGGCGCGCCA ACCGTTCGGG GGCTGGAAGA AGTCGGCGGT CGGCGCCGGA
GCCAAGGCGG GCGGCCCGAA CTATCTGATC GGGCTCACCG ACTGGGAGTC CGCACCCGCG
CAGCAGGCGG GTCGCCTCAG CCCCGCGATG AGGGAATTCC TCTCTGCGGC AGGGCGCCTC
GATCTGAGTG CGGAGGATCG GGAATTCCTC GGCCGGTCGG CGAGGTCCGA CGCGCTGGCC
TGGGAACAGG AGTTCGGCAT CGCCCGCGAC GTGGCGGGGC TGACCGCGGA GAAGAACGTC
CTGCGCTACC GCCCGGTGCC GGTGACCGTG CGCGCCGAGG ACGGTCACAC CGCGTCTCTG
CTCCGGGTGG TCGCCGCCGG CCTGCTCGCC GGCGCCGCGA TCACGGTGTC GACCCCGGCG
CCGCCGGCCT CCGCGGTCGC GGACCTGCTG CACATGCGAG GAGTCACCTT GCGCGTCGAC
GATGCTGAGC ACTGGCGGAA GCTGTTGCGC GGCAACGCCC CCGCCCGCGT CCGGTTGATC
GGTGGGAGCC GCGAGGCATT CGCCGGCGCC GGTGAGGGCA GGGCGGACGT CGCGCTCTAC
GCGCAGCCCG TCGTCGAGGC CGGGCGGATC GAGCTGCTCA CGTTCCTGCG GGAACAGGCG
GTCGCGGTCA CCGCGCACCG GTTCGGTTCG CCGACCACCC TGGCCGACGG CCTCTTCGCG
TGA
 
Protein sequence
MTRIGGDEHL VSEVETLVRR WLDDAAGHRV APAARRLADV LRDPGGLDFT VGFVDRVIRP 
DDPRVAAANL RELARTAPGF LPWHLRLLIR LGAAVSVVLP GVVIPIASRA LREMVGHLLA
DATDARLGRS IERIRRRGVN LNVNLLGEAV LGRREARRRL TGTERLLARP DVDYVSIKVS
ASVPPHSVWA FDEAVDHIEQ SLLPLFTQAA TAATPKFINL DMEEYRDLEL TMAVFTRLLD
RPELRALEAG IVLQAYLPDA LGAMIRLQDW ARDRRQRGGA GIKVRLVKGA NLPMEHVEAS
LHGWPVATWP TKQDTDTNYK RVLNYALQPD RIANVRIGVA GHNLFDVAYA WILAGRRGVR
DGVEFEMLLG MADAQAEAVR ATVGSLLLYV PVVHPEDFDV AIAYLVRRLE EGSSHQNFMS
AVFELHSDDM LFQRERERFV ASLAAVDDDV PLPNRRQDRR ADDPAAAVRA EFSNVADTDP
ALPGNRAWGR AVTGRIAGST AGRSAVDEHT VTTVDGLEAV ISRGVDAGAR WAALSGAERA
AVLRRAAATL EARRATLLEV MAAETGKTLD QGDPEVSEAI DFANYYASLA EKLDRVDGAI
ARPVGLTVVT PPWNFPVAIP AGSTLAALAA GSAVVIKPTT QARRCGSVMV EALWDAGIGR
DTLQLVHLDE GELGARLISD ARVDRVILTG AFDTAALFRS FRPDLVLLAE TSGKNAIVVT
PHADVDLAVK DLVYSAFGHA GQKCSAASLG ILVGSVARSA RFRDQLVDAV TSLNVGYPTD
PTVQMGPVIE PADGKLLRAL TTLGPGEKWL VAPRQLDDSG RLWSPGVRTG VRRGSEFHLT
EYFGPVLGLM EARDLDEAIA MQNQVDYGLT AGLHSLDRHE VERWIDRVEA GNAYVNRSTV
GAVVARQPFG GWKKSAVGAG AKAGGPNYLI GLTDWESAPA QQAGRLSPAM REFLSAAGRL
DLSAEDREFL GRSARSDALA WEQEFGIARD VAGLTAEKNV LRYRPVPVTV RAEDGHTASL
LRVVAAGLLA GAAITVSTPA PPASAVADLL HMRGVTLRVD DAEHWRKLLR GNAPARVRLI
GGSREAFAGA GEGRADVALY AQPVVEAGRI ELLTFLREQA VAVTAHRFGS PTTLADGLFA