Gene Mkms_3235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3235 
Symbol 
ID4611160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3392089 
End bp3393864 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content64% 
IMG OID639792907 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_939219 
Protein GI119869267 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.814108 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCAGAC GCCACCAGCC CACGCTGACG GGCTGGCAGA AGCATCTCCT CGCCGAACTC 
AAAGCGCTAG CGCAACAACG TCCCAACGAA ATCCAGGTCC GAGGAAGACC CACGCTCGAC
GCTAGCGGCG AGGCGTCGCT CCGCATCTCG CTACGCACGG CCGCCATCCC GCATCATCCG
GGTGGCCTGC AACTTCAAGA GACCGAGGAG TTCATCCTTC AGCTTCGCCC CTCTTCGTAC
TCGCTACCGG CTATCGACGT TGATCACACT CGGTTTCTCG GCTACCCCCA TGTGCTCGCG
GGTCAACGAC TATGCATCTA TTTGGATCCT TCACGGGAAT GGCAGCCGAC GCTTGGTGTG
GCCGGCCTAC TCACCCGCCT TTGGGACTGG CTCGTCGACG CGGCCGCCGG AAACTTCGAC
GCCGCCACCG CCATGTACCA CGCTGTTGGC GGAGTGCCGC ATCAGGCACA TGACACACCG
ACGATCGTTA CCCGAGAACC CGGACCGGCG AAGCGCCACC AAACGGCTCA CCTGATCGCC
CGGTCAACGC ACCGATACGA CCTGACGTAC TCGCCTGGAG CTGCCGGGCA TCGCGTACCG
GTAATTACCC TGGCCACCGC GCTGCCGTTC GGTGCCGCAT CCACATTCGC GCTACTGCTT
GCTCTCCTGG ACGACCCCTA CCTTGACCGC CTCGAAGGAC GGGCTCCCCG GATCGCACCG
CAATCGCCGG CGTTCCTCAC CGCCCTCCTG GCGAGTGCGT TACGAAATCA CCACGACGCC
GAGCAATACT TCGTCCTCGC CGTGCCGCAC CCCGCTGGAG GCCCACCCCA CCTCTTGGGC
GGACGGCTCC CCACCCCAAC GGCGAATGCG CTCCGCGAGG TCGCGCAGCA ACGGGGTGTG
GGGGTTGTTC TCGACCCCGC GAAGATCAAC ACTGAAATCC CGATTGAGTG GTGCAGGATG
TCCGACGAAC GACCCGAAGT GACAACCCGC CGCGACGACG GCCGCCCCGT GAACGGATTT
CAACGAAAGA CTGTCCACAT CTGGGGCTGC GGCGGGCTCG GATCATGGAT CGCCGAATTC
ATCGCTCGCG CAGGAGCATC GGAGATCACC GTGTGCGACC CTGGCATCGT CACCGGCGGC
TTGCTCGTCC GACAAAACTA CGTCGAAGAC GACATTGGCC GTTCCAAAGC CGAGGCACTC
GCTGGACGGC TCCGCGCGAT CCGTGATGAC CTGACGGTCA CCGTCGCAGA AGGGCACCTC
CCAGAAGACC ACACGTCATG CCTGGCAGCG GATCTCATCA TCGACGCCAC AGTGAACAAC
GGCATCACGA GCTGTCTCGA TGCGTTGGCA ACTGCGCCGA CGCGAAAGGC ATTGATCGCT
CAGGTCGCCA CAGACGCTCG CTCTGGCACG CTCGGCCTAG CCGTGCTGTG CGCCGCAAGC
GCAACAGCGA CAGTTTCCAG CATCGATCAA GACGCTGGCC GAACAATCCA GGGCGACAGC
GGACTTGAGC TCTACCACAC GCTGTGGCAA GAACCCAGCG ATGACGAACT TATACCAACC
AGGGGCTGCT CGGTCCCCAC ATTCCACGGC TCGGCAGCCG ACCTCGTAGC GGTCGCAGCC
ACACTCGTCA ACCTGATCGG AAGCCACCTC CAACAACCGG ACTCCGCGGT TTCGGGCACA
CACCTCATCG CTCTGCCGCA CGCGGCCAGC GGCCCCCGAC ACCACTTCCT CCCCGGTGTA
ACGCACCCCA TGGATCACAC AGCAGGGACA GAATGA
 
Protein sequence
MTRRHQPTLT GWQKHLLAEL KALAQQRPNE IQVRGRPTLD ASGEASLRIS LRTAAIPHHP 
GGLQLQETEE FILQLRPSSY SLPAIDVDHT RFLGYPHVLA GQRLCIYLDP SREWQPTLGV
AGLLTRLWDW LVDAAAGNFD AATAMYHAVG GVPHQAHDTP TIVTREPGPA KRHQTAHLIA
RSTHRYDLTY SPGAAGHRVP VITLATALPF GAASTFALLL ALLDDPYLDR LEGRAPRIAP
QSPAFLTALL ASALRNHHDA EQYFVLAVPH PAGGPPHLLG GRLPTPTANA LREVAQQRGV
GVVLDPAKIN TEIPIEWCRM SDERPEVTTR RDDGRPVNGF QRKTVHIWGC GGLGSWIAEF
IARAGASEIT VCDPGIVTGG LLVRQNYVED DIGRSKAEAL AGRLRAIRDD LTVTVAEGHL
PEDHTSCLAA DLIIDATVNN GITSCLDALA TAPTRKALIA QVATDARSGT LGLAVLCAAS
ATATVSSIDQ DAGRTIQGDS GLELYHTLWQ EPSDDELIPT RGCSVPTFHG SAADLVAVAA
TLVNLIGSHL QQPDSAVSGT HLIALPHAAS GPRHHFLPGV THPMDHTAGT E