Gene Mmcs_1624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1624 
Symbol 
ID4110460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1761035 
End bp1762090 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content65% 
IMG OID638030745 
Product2,3-dihydroxy-2,3-dihydrophenylpropionate dehydrogenase 
Protein accessionYP_638791 
Protein GI108798594 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.242088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGAGGAA CTGGCTACGC TGCTGGCCGG GGCGACCGGC GATGCACTGA ACGTCGCCAA 
CGAGTTCGCA GAGGTCGTCG TGCGCCGAGT GACCACTCGC AACGGAGCTC GGCTGTTGAT
TCAGGCGCCA AAATCGGCGC AGTGGGTCTC GCTGGATGCC TTGGAGATCG AGGCGCTGAC
CTGGCAGAAC CCAGCCACCT TAGCGGCCAT GGTGGGGAAC GCAGGTACTC CGTTGATTCT
GGGCGACGAG ATATGACTGG CTGGCTGGCG GGCAAACGCG CATTGATCGT CGGCGGAGGT
TCGGGTATCG GTCGGGCCAC CGTCGACGCG TTCCTGAACG AGGATGCCCG GGTGGCCGTG
CTCGAGTACG ACAGCGGTAA GTGCGCCACG CTGCGCCAGC AGCTGCCCGA CGTTCCGGTG
ATCGAGGGCG ATGGGACCAC CCGCACCGCC AACGACGAAG CAGTGCAGGT CGCTGTCGAC
GCCTTCGGCG GACTCGACAC ATTGGTGAAC TGCGTGGGGA TTTTCGATTT CTACCGCCGT
ATTCAGGACA TCCCCGCCGA GCGAATCGAC CAGGCGTTCG ACGAAATGTT TCGCATCAAC
GTCCTGAGTC ATATCCACTC AGTCAAGGCA GCAGTGCCGG CGCTGATGGG CCAGGACGGC
GCCTCGATCG TGCTCACCGA GTCCGCGTCA TCGTTCTATC CAGGACGCGG CGGCCTGCTG
TATGTGGCGT CGAAGTTCGC GGTGCGCGGC GTAGTCACTG CACTGGCTCA CGAACTCGCG
CCGAGGATCC GTGTGAACGG TGTCGCCCCC GGGGGGACGT TGAACACCGA CCTGCGCGGA
CTCGACAGCC TCGACCTTGG CGCCCGCCGT CTGGATGCTG CACCAGATCG GGCCCGCGAA
TTGGCTGCCC GCACACCGCT GGGCGTGGCC CTATCGGGTC ACGACCACGC GTGGAGTTAC
GTCTTCCTCG CTTCTCACCG CTCCCGCGGG CTCACCGGCG AAACGATCCA TCCCGACGGC
GGTTTCAGCC TCGGACCCCC ACCCCAGCGG AATTGA
 
Protein sequence
MRGTGYAAGR GDRRCTERRQ RVRRGRRAPS DHSQRSSAVD SGAKIGAVGL AGCLGDRGAD 
LAEPSHLSGH GGERRYSVDS GRRDMTGWLA GKRALIVGGG SGIGRATVDA FLNEDARVAV
LEYDSGKCAT LRQQLPDVPV IEGDGTTRTA NDEAVQVAVD AFGGLDTLVN CVGIFDFYRR
IQDIPAERID QAFDEMFRIN VLSHIHSVKA AVPALMGQDG ASIVLTESAS SFYPGRGGLL
YVASKFAVRG VVTALAHELA PRIRVNGVAP GGTLNTDLRG LDSLDLGARR LDAAPDRARE
LAARTPLGVA LSGHDHAWSY VFLASHRSRG LTGETIHPDG GFSLGPPPQR N