Gene Mmcs_4938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4938 
Symbol 
ID4113767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5227751 
End bp5229352 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content67% 
IMG OID638034092 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_642098 
Protein GI108801901 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAGCT ACGACTACAT CATCACCGGA GCCGGATCGG CCGGTTGCGT CCTGGCCAAT 
CGGCTGTCGG AGGATCCGAG ACTCAACGTG CTGCTGCTCG AGGCAGGCGG CGGCGATCGC
AATCTGTGGT TCCACATCCC CAAGGGTTCG GGAAAGCTCT TCGAGAGCGA AAAGCACATG
TGGCACTACG AGACCACGCC GTTCGGACCC GATCAGCACG TCGAGCAGTG GATGCGCGGG
AAGGCGCTCG GCGGGTCGAG TTCGATCAAC GGACTGCTCT ACAACCGCGG CAACCGCGCC
GACTACGACG GGCTGGAGCG GTTGGGCAAC AAGGGGTGGG GCTGGGATGA GATCCTGCCC
ATCTTCAAGG GATTCGAGAA CAACGAGTTC GGTCCGTCCG CGACGCGCGG GACCGGCGGC
CCGCTGAACA TCTCCGTCCC CCGCGACCCC GACCCGCTGT GCGAGGAGAT GATCGACGCC
GCGACCCGGA TCGGGATGTC CCGCGTCGAG GACATCAACG AATCCGACGC CGAACGCATC
GGATACGCGA CCTCGACGAT CCGCAAGGGG CGCCGCGTCA GCGCCGCGAC TGCGTTCCTC
AAGCCCGCAA TGCGGCGCCC AAACCTGACC GTGCGCACCG GCGCGCTCGT CCACCGGGTG
ATCATCGAGG GCGGGCGCGC GGCCGGGGTC GAGGTCACGA CGCCCAGCGG TGTCGAACGG
CTCCGCGCCA CCCGCGAGGT GATCGTGTCG ATGGGCAGCC TCAACTCGCC GAAGCTGCTC
CAGCTCTCCG GTATCGGGCC ACGCGAGGTG CTCTCGGCCG CCGGGGTGGA GGTCCGCCTG
GAACGCGACA ACGTCGGTCG TGGACTGCGT GAACACCGTT GCGCGACTCT GCGTTACGGG
CTCAACGAGG ATCTCGGTTA CAACAGGTAC CTCGCAACGA GTATGGGGCA GGCGCTCACC
GGGATGAAAT ACCTCGCCAC ACGCAAGGGG CCACTCGCCG CGCCGTCGTT CGACGTCGTC
GGCTTCGTGA AAACGCGGCC CGACGAAGAA CGCCCCGACG GGCAGGTGAT GATGGGCCCG
TACACGTTGC CGCCCTACAA CGTCGGCGAA CCCGTCTCCA TCCAGCGCGA GCCGGGCGTG
TCCTGCCTCG GCATGGTGTT GCGGCCGACG TCCGAGGGGT ATATCGAGAT CACCTCGGCC
GATCCCGCCG CGGCGCTGAG GATCAACCCC AACTATCTGG GCACCGACTA CGACCGCGAG
ACGACGGCCG GTCTACTCCG CAGGATGCGT GCGATCTTCG AGCAGTCACC GATCGCCGGC
CGCATCAGCC ACGAGACCTA TCCTGGTCCG GGGGTGCAGA GCGATGACCA ACTCGTCGAC
GCCGCGCTCG ACGGGGGTTA CTGCGGCTAC CACGCCGTCG GGACCTGCGC GATGGGGCCC
AGCGACCACG ACGTGGTCGA CCATCAGCTC CGGCTCAGGG GAGTGGACGG ACTGCGCGTC
GTCGACTGTT CGGTGATGCC GACGATCGTG GCCGGCAACC TCAACGGCCC GATCATGGCA
ATGGCCTGGC GGGCAGCGGA TTTCATCCTG CACGGGTGCT GA
 
Protein sequence
MASYDYIITG AGSAGCVLAN RLSEDPRLNV LLLEAGGGDR NLWFHIPKGS GKLFESEKHM 
WHYETTPFGP DQHVEQWMRG KALGGSSSIN GLLYNRGNRA DYDGLERLGN KGWGWDEILP
IFKGFENNEF GPSATRGTGG PLNISVPRDP DPLCEEMIDA ATRIGMSRVE DINESDAERI
GYATSTIRKG RRVSAATAFL KPAMRRPNLT VRTGALVHRV IIEGGRAAGV EVTTPSGVER
LRATREVIVS MGSLNSPKLL QLSGIGPREV LSAAGVEVRL ERDNVGRGLR EHRCATLRYG
LNEDLGYNRY LATSMGQALT GMKYLATRKG PLAAPSFDVV GFVKTRPDEE RPDGQVMMGP
YTLPPYNVGE PVSIQREPGV SCLGMVLRPT SEGYIEITSA DPAAALRINP NYLGTDYDRE
TTAGLLRRMR AIFEQSPIAG RISHETYPGP GVQSDDQLVD AALDGGYCGY HAVGTCAMGP
SDHDVVDHQL RLRGVDGLRV VDCSVMPTIV AGNLNGPIMA MAWRAADFIL HGC