Gene Mmcs_5190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5190 
Symbol 
ID4114019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5476778 
End bp5478058 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content68% 
IMG OID638034348 
Producthypothetical protein 
Protein accessionYP_642350 
Protein GI108802153 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0807] GTP cyclohydrolase II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.150921 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGCGC CTTTCCCCCT GAGGACGGGA GGCGCCATGT CGACTGGGAG CGGACACATC 
CGGTTGACGT CGCACAGCGG TGCGGTGGGC ACCCCGACGA TCCACTGGGG TGCGCCCACC
GCCGCCGAGC GCGGGCCGGT GATCGGGACG ACCGCCAACC GCTCGCACCG CAACGTCATC
GGCACCCACA GCGGGTCCTA CAGCGTCTAC CGGGCACTGG CCGTCGCCGC GGGTGCGCTC
AAACGGGAGC ACCGCGCCGA CCTCACCGAC ACCTCGCCGA CCGACACCAT CGGCCCATAC
CCGCAGTGGG GTGAACCGGC CACGATCGTC AGCATGGACC CGTGGGGCGC CAGTGTGGCC
GACGTGTTCA CCGCCGAACT CGCTGCGGGC CTCGACATCC GGCCGACCAT CGCGATCACC
AAGGCGCATG TGATCCTGCC CGAGATCACC GACGCGATCG CCAAGGGCCG CCTGGTCCCC
GACGGGCGCG TCCTGCTCGC CAGCGGCGCC GCGCTGGTCA CTAAGGCCGC CGTCGAACCC
GTCTGGTGGC TGCCCGGGGT GGCGCAGCGG TTCGGGTGCA GCGAGACTGA TCTGCGCCGG
GTGCTGTTCG AGGAGACCGG CGGGATGTAC CCGGAACTCG TCACCCGCTC CGACCTCGAG
GTGTTCCTCC CGCCGATCGG CGGGCAGACC GTCTACATCT TCGGCAGCGC AACCGATCTC
GCCGATCCGT CGGTCGAGTT GACCGCCCGC GTACACGACG AGTGCAACGG CTCGGACGTC
TTCGGGTCCG ACATCTGCAC GTGCAGGCCG TATCTCACGC ATGCGATCGA GGAGTGCATC
CGCGGCGCCC AGAACGGTGG CGTCGGCCTG GTCGCCTACT CCCGCAAGGA GGGCCGCGCG
CTGGGTGAGG TCACCAAGTT CCTGGTGTAC AACGCCCGCA AGCGTCAGGT CGGCGGCGAC
ACCGCCGATC AGTACTTCGC GCGCACCGAA TGTGTGGCCG GCGTCCAGGA CATGCGCTTC
CAGGAACTGA TGCCCGACGT GCTGCACTGG TTCGGTATCC GCAAGATCCA CCGCCTGGTG
TCGATGAGCA ACATGAAGTA CGACGCGATC ACCGGGTCGG GAATCGAAGT GGGAGAACGG
GTCAACATCC CCGAGGAACT CATCCCTCCG GATGCGCGGG TGGAGATCGA CGCCAAGATG
GCGGCCGGTT ACTTCACCCC GGGCCCGGTG CCCGACGCCG AGAAACTCAA GCAGGTCAAG
GGTCGCGGGT TGTCGCAGTG A
 
Protein sequence
MLAPFPLRTG GAMSTGSGHI RLTSHSGAVG TPTIHWGAPT AAERGPVIGT TANRSHRNVI 
GTHSGSYSVY RALAVAAGAL KREHRADLTD TSPTDTIGPY PQWGEPATIV SMDPWGASVA
DVFTAELAAG LDIRPTIAIT KAHVILPEIT DAIAKGRLVP DGRVLLASGA ALVTKAAVEP
VWWLPGVAQR FGCSETDLRR VLFEETGGMY PELVTRSDLE VFLPPIGGQT VYIFGSATDL
ADPSVELTAR VHDECNGSDV FGSDICTCRP YLTHAIEECI RGAQNGGVGL VAYSRKEGRA
LGEVTKFLVY NARKRQVGGD TADQYFARTE CVAGVQDMRF QELMPDVLHW FGIRKIHRLV
SMSNMKYDAI TGSGIEVGER VNIPEELIPP DARVEIDAKM AAGYFTPGPV PDAEKLKQVK
GRGLSQ