Gene Mmcs_0779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_0779 
Symbol 
ID4109624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp834316 
End bp837315 
Gene Length3000 bp 
Protein Length999 aa 
Translation table11 
GC content66% 
IMG OID638029905 
ProductPKD domain-containing protein 
Protein accessionYP_637955 
Protein GI108797758 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCTA ACGACCGCAA GGTGACCTCC ACTTACGCCA GGTACATCGG GCGGGTCGGT 
GCGCTTGCGC TCGCGCTCGG TGTCGGGGGA GCCGTGGCAA CGACTCCGGG GGTTGCCCGG
GCCGAGGACT CCCCGTCGTC GGTGACCGAT TCGACCGGCG TCTCGAGTCC GTCGGGCGGT
GAGCAAGCGC GGACCACCAC GGGCGACGAC ACCACCCCGG ATGACGACCA GTCCGAGATC
GTCGACGACG GTGACGCCGA AGACCCCGCC GAGAGCGAGG ACGCCGAAGA AAGCGAAGAC
CTCGCCGAGG ACGAAGACCC CGACGAGGGC GAGGACCTCG ACGAGCCGGA CGGCATCGGT
GACCCCCTCG AGCCGGCGCA GCCACCGGCA GAGCCCGTCG AGGAACCCGA CGTTCCCGCC
GAACCCGCCG TCGAGACGCC GGTCGACACC GGTACGCCCG CGCAGGAAAC CGGAGGACCG
GCGGGCGACG ACGCCGGCCA AGCCGCCGAC CCGGCCGACA CCACCCTCAC TGACGACCTA
CCCGCGCCGG ACCCGGACGC CGCCGTGCCC GACGTGGACG CGGCGAGTGA CGAGTCGACG
AATTCCCGCT CGTACCAAAC ACTGTCGTTG ACCGAAAGCG ACGATTCGGC ACTGGTCACC
TCGGCGCGCT CGAAGCTCCC CGGTCTGCCG CCCCTGCGGC TGAACCTGCC GACGCCCGAA
CAGTTCATCG CGAGCATTCC GGCGCCGGTC GTCACCTGCC TGTGCGGGGT CATCAACACG
GTGACGAACT TCTTCGACAA CGTGCTCAAA CCGATGCTGG GCGGGGGCGC GGGAGCGGCG
GGCCCCGGCC CCGCAGCACC AGGCTCCTCG CCCGCGCTGT GGGCGGTCGC GGCCTGGGTC
CGCCGGCAGA CGACCCAGGC GATCGACGGC TTCCTCGCGT CGCCGCTGGC CACTCCGATC
CGGATGTTCG AAAGAGCCGT CTTCGACTTC GGTGGCTCTC CGCAGGGCCG CGCGCTGAGT
GCGGCCGTCG TCCAATTCAT CGGACAATGC GGGCCGAGTG CCGACCTGCC TGCGGAACTC
GACCGCACGG TGGTGGTCTC CGGCCTCACC GAGCCGACCG ATTTCAAGAT CCTCAACAAG
CACGACAAGG ACGAGGTCGA CCGGATCTTC ATCGCCGAGA AGGGCGGTTC GGTCAAGGTC
TACAACCCGG AGACCCGAAC GGTCACCACG TTGACGGTCA TCTCCACGAC CACCGGCGGA
GAACGTGGCC TCACCGGCAT CGAGGTCCAT CCTGATTTCT GGCACGAAGA CGAATTCGGA
TATCGCTCCA TCTACGTCGC CTACACAGCG GGCGACACCA ATAGGGACAC CCTCTCGCGA
CTGATCCTGT CCGACGACAT GACCAGGGTC GAGCGCTCCG AGATCCTCAT CGAATCGACC
GAGAACGCCA ACACCTTCCA CCACGGCGGC GATCTGTCCT TCGACAACGA AGGTCAGCAC
CTGTACTGGG TGGTCGGCGA CAACACCCAG GGTGTGGTGA ATTCACAGAG CCTGAGCAAC
ATTCACGGCA AGGTGCTACG GCTCAACGCC GATGGTTCGG TGCCCGAGGA CAATCCGTTC
GTCGACGACG ATCCCGACAC GGCGAGCCCG GCCGACTACA TCTACGCCTA CGGTTTCCGC
AATCCCTTCC GGCTGACCTT CACACCGGAC GGGAAGCTGC TCGTCGCCGA CGTGGGCGAG
TCCAAATGGG AAGAACTCAA CCTCGTCGTG AAGGGCGGCA ACTACGGCTG GCCGCAGGCC
GAGGGCAACT GCACCGGATG TGCGTCGATC AATCCCATTT ACGTGTACGA ACATTCGGCA
CCACCCGTCA GCGGCGGCGC GATCACGTCG GTGACGGTCT TCGACGGCGC CGGATTCCCC
GAGGAGTACC GCAACCGGGT GTTCATCGCC GATTACAGCC TGGGCTGGAT CCGCGTGCTC
GACTTCGACG ACCAGTACAC CAGCCTGATC AGTGCGAAGA CGTTCTGGGG CAACGCCGGT
GCGACGGTCA ACCTGGCCCA GGGACCCGAC GACAACCTCT ACCAGCTGAC GATCTATCCC
GGTGAGCTGT CGATGATCTC ACCGTCCGGC GGCAACCGGG CGCCCACCGC GGTCATCGAT
GCCTCGCAGA CGACGACGGC GGACAAGACG CTGGTCGTCA ACTTCTCCGG CCTGCGCTCG
TACGACCCCG ATGACGACGA CACCCTCACC TACCGCTGGG ATTTCGGCGA CGGCAACAGC
TCCACGGAGG CGACGCCGGT CAACGAGTTC AAGACCACGG GGTCCTACAG CACCTATACG
GTCACCCTGA CGGTCAGCGA CGGTGAGAAG ACGAACACGA CCACGCAGAA GATCACGGTG
GGCAGTACGC CACCGGTCGC CGAGATCGTG AGCATCCCGT CGTCATACGA CGCCGGTGAC
ACCATCACGT TCACGGGCAG GGGGACGGAC GCTCAGGACC GTCCGGACGG TACGCCGCTC
CCCGGAACCG CCTACAAGTG GACGGTCGTG TTCCACCACA ACGAACACAC GCACCCGTTC
GCCGACAACC TCGTGGGGGA GACCGCCACC ATCACCATCC CGCGGTCACG CGATCAGATC
GACGGCACCT TCTACCGCGT GCACCTGACG GTCACCGACA GCAGCGGTCT GTCGACGACG
ACGTACAAGG ACGTCAATCC CAACCTCGTC GAGCTCACGA TCGCCGCGAG CGATCCCGGC
GCGCGGTTCA GCATCGACGG GTTGCCGTAC ACGGGCTCGT ACACCGAGCG TGCGGTGGTG
GGCGTGGACT ACGTGATCAG CGCCCCGACC ACACAGACGG TCAACGGCAG GCAGCTGACC
TTCGACGGGT GGTCCGACGG TGGCGCGGCG ACGCACACGA TCCGGGTTCC GGAACAGGCG
ACGACGTACA CCGCGACCTA CACGGCCAGT TCCGGGGCGA ATCTGGCGCA GGTGGTGTGA
 
Protein sequence
MTANDRKVTS TYARYIGRVG ALALALGVGG AVATTPGVAR AEDSPSSVTD STGVSSPSGG 
EQARTTTGDD TTPDDDQSEI VDDGDAEDPA ESEDAEESED LAEDEDPDEG EDLDEPDGIG
DPLEPAQPPA EPVEEPDVPA EPAVETPVDT GTPAQETGGP AGDDAGQAAD PADTTLTDDL
PAPDPDAAVP DVDAASDEST NSRSYQTLSL TESDDSALVT SARSKLPGLP PLRLNLPTPE
QFIASIPAPV VTCLCGVINT VTNFFDNVLK PMLGGGAGAA GPGPAAPGSS PALWAVAAWV
RRQTTQAIDG FLASPLATPI RMFERAVFDF GGSPQGRALS AAVVQFIGQC GPSADLPAEL
DRTVVVSGLT EPTDFKILNK HDKDEVDRIF IAEKGGSVKV YNPETRTVTT LTVISTTTGG
ERGLTGIEVH PDFWHEDEFG YRSIYVAYTA GDTNRDTLSR LILSDDMTRV ERSEILIEST
ENANTFHHGG DLSFDNEGQH LYWVVGDNTQ GVVNSQSLSN IHGKVLRLNA DGSVPEDNPF
VDDDPDTASP ADYIYAYGFR NPFRLTFTPD GKLLVADVGE SKWEELNLVV KGGNYGWPQA
EGNCTGCASI NPIYVYEHSA PPVSGGAITS VTVFDGAGFP EEYRNRVFIA DYSLGWIRVL
DFDDQYTSLI SAKTFWGNAG ATVNLAQGPD DNLYQLTIYP GELSMISPSG GNRAPTAVID
ASQTTTADKT LVVNFSGLRS YDPDDDDTLT YRWDFGDGNS STEATPVNEF KTTGSYSTYT
VTLTVSDGEK TNTTTQKITV GSTPPVAEIV SIPSSYDAGD TITFTGRGTD AQDRPDGTPL
PGTAYKWTVV FHHNEHTHPF ADNLVGETAT ITIPRSRDQI DGTFYRVHLT VTDSSGLSTT
TYKDVNPNLV ELTIAASDPG ARFSIDGLPY TGSYTERAVV GVDYVISAPT TQTVNGRQLT
FDGWSDGGAA THTIRVPEQA TTYTATYTAS SGANLAQVV