Gene Mmcs_4408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4408 
Symbol 
ID4113237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4688213 
End bp4689829 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content72% 
IMG OID638033554 
ProductNHL repeat-containing protein 
Protein accessionYP_641568 
Protein GI108801371 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3386] Gluconolactonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCGC CCACCGAGTC TTTGCAGGCC ACCCGCTACC CCGGAGCGCA GCCGACTGTC 
ACCGACGGGT GGCGCCTCGA GCGGTTGACC GCACCGAGCC GGTTGTTCGG CGCCAACGGC
CTGCGCACCG GCCCCGACGG GCGGGTCTAC GTCGCCCAGG TGACCGGCAG CCAGATCAGC
GCCCTGGACC TGGGCAGCGG GACCCTGGAG ACGGTCAGCC CGAAGGGCGG CGACATCATC
GCCCCCGACG ACGTGGCGTT CGGACCCGAC GGCGCCCTGT TCGCCACCGA GGTGATGGAC
GGTCGGGTCA GCGTGCGCGA GGCCGACGGC CGGACCCGCG TCCTGCGCGA CGATCTGCCG
TGTGCCAACG GCATCACCGT CCACCAGGGC CGGCTGTTCA TCGGTGAGTG CCGCGAAGGT
GGCCGGCTGC TGGAACTCGA CCGCAACGGC GGCATCGTCC GCACGCTCGC CGAGAACCTG
CCGTCGCCGA ACGCGATGGA GGTCGGTCCG GACGGATTGC TGTACTACCC GCTGATGACG
GCCAACGAGA TCTGGCGGGT GGATCCGAAC GGCGGGGAGC CGCAGCGGGT CGCCGGTGAC
CTCGGGGTGC CGGATGCGGT GAAGTTCGAC GCCGACGGCT TCCTCGTCTC CACTCAGGTG
GCCAGCGGCC AGGTGCTGCG CATCGACCCG CGCAGCGGTG CGCAGACGCT GCTCGCGCAG
CTGTCGCCCG GGTTGGACAA CCTGACCTTC GTCGGCGACC GGTTGTTCGT CTCGAATTTC
ACCGGCGAGA TCACCGAGAT CCTGTCCGGC GGCGAAACCC GCACCACCCT GGCCGGCGGG
CTGAACTGGC CGCTGGACCT GGCGGTCGGC GACGACGGCC GGCTCTACGT CGCCGACGGC
ACCTACTTCT ACGTCGTGGA GGCCGACGGA TCGCTGCAGA CCGTCGGGAT GTTGTTCACC
CCCGGCTATC CCGGCTTCCT GCGCGGTCTG ACCCCCGTCG GCGGCGGCGA GTTCGTGGTC
ACCACCTCCG GCGGCCAGGT GACGCGCTAC CGGCCCGCTG CCGGTGAGAG CGAGGTCCTG
GCGGACGGTT TCGACCAGCT GTACGGCATC GCGGCCACAC CGGGCGGTGC GGCCGTCTTC
GCCGAACTCG GCACCGGCCG GGTGCACTCC GTCCACGACG GCGAGGTGCA ACTGCTGGCC
GGCGATCTCC GTGAACCCGT GGGGGTGGCG ATCGCCCCGG ACGGCAACCC ACTGGTCGCC
GAGTCCGGGG CCGGGCGGGT GGTGCGCCTG GCCGGGTCGC AGGCGGTCAC CGTGGTCGAC
GGGTTGCAGC GACCGCAGGG GCTGGTGGTC GCCGACGGCG TGCTCTACAT CGTCGACGCC
GGTGCGAAGG AAGTGGTCGC GGTCGACATG AACAGCGGCG CCCGCCAGAC GATCGCCACC
GGGTTGCCCG TCGGCCCGCC GCCGGGTGTG GAACCCAAAC CACTGCGCGG CATGCCGCCG
TTCTCCGGCC CGCAGGGACC GTTCGCGGGG ATCGCCGCGG CGCCGGACGG CACGCTGTTC
GTCTCCGCCG ACGGCGACGG CAGCGTGCTG GCGTTACGAC GCACCGCCCG CACATGA
 
Protein sequence
MTSPTESLQA TRYPGAQPTV TDGWRLERLT APSRLFGANG LRTGPDGRVY VAQVTGSQIS 
ALDLGSGTLE TVSPKGGDII APDDVAFGPD GALFATEVMD GRVSVREADG RTRVLRDDLP
CANGITVHQG RLFIGECREG GRLLELDRNG GIVRTLAENL PSPNAMEVGP DGLLYYPLMT
ANEIWRVDPN GGEPQRVAGD LGVPDAVKFD ADGFLVSTQV ASGQVLRIDP RSGAQTLLAQ
LSPGLDNLTF VGDRLFVSNF TGEITEILSG GETRTTLAGG LNWPLDLAVG DDGRLYVADG
TYFYVVEADG SLQTVGMLFT PGYPGFLRGL TPVGGGEFVV TTSGGQVTRY RPAAGESEVL
ADGFDQLYGI AATPGGAAVF AELGTGRVHS VHDGEVQLLA GDLREPVGVA IAPDGNPLVA
ESGAGRVVRL AGSQAVTVVD GLQRPQGLVV ADGVLYIVDA GAKEVVAVDM NSGARQTIAT
GLPVGPPPGV EPKPLRGMPP FSGPQGPFAG IAAAPDGTLF VSADGDGSVL ALRRTART