Gene Mkms_1391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1391 
Symbol 
ID4614222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1494464 
End bp1495852 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content66% 
IMG OID639791066 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_937393 
Protein GI119867441 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03229] benzoate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.109638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGCAC CGATGAGCGC ACCGACAGCC GAACCCCGCA GTCACCTCGA GACCGTCCTC 
GCCGATGCGG TGATCGACGA CCACGCGGCC GGGATCTACC GCACCAACCG GCGGATCTTC
ACCGACGAGG ACATCTTCGA GCTCGAGATG GAGCACATCT TCGAGGGCAA CTGGATCTAC
CTCGCCCACG AAAGTCAGGT CGCCGAACCG GGCGACTACT TCACCACGTA CATGGGCCGC
CAGCCCGTCG TCATCACCCG CGACAAGAAC GGGGGCCTCA ACTGCCTGGT CAATGCCTGT
GCACACCGAG GGGCGATGGT GTGCCGACGC AAGACCGACA ACCGGATGAC GCTCACCTGT
CCCTTTCACG GGTGGACCTT CCGCAACGAC GGGACCTTGC TCAAGGTCAA GGATCCCGAG
GGGGCCGGCT ACCCGGCGAC GTTCGACGTC GACGGCTCGC ACAACATGAC CAAGGTGGCC
CGGTTCGACA GCTACCGCGG ATTCCTGTTC GGCAGCCTCA ACCCGGACGT CGTCCCCCTC
CTCGAGCACC TCGGTGACAC CACCAAGGTC ATCGACATGC TCGTCGACCA GTCCCCCGAC
GGCCTGGAGG TGTTGCGCGG ATCGTCGACC TACACCTACG ACGGCAACTG GAAAGTGCAG
GCGGAGAACG GCGCCGACGG TTATCACGTC ACCGCGACGC ACTGGAACTA CGCCGCGACC
ACCTCACGGC GCAACACCGG CGAGTCCGCC AACGACACCA AGGCGCTCGA CGCCGGCAGC
TGGGGGAAGT CCGGCGGCGG CTACTGGTCC TACCCGAACG GCCACCTCTG CCTGTGGACG
TGGGCGGCCA ACCCCGAGGA CCGCCCGCTG TGGGACCGGC TCGACGACCT CAAGAGCGTC
CACGGCGCGG CCAAGGGCGA GTTCATGGTG AAGGGTTCAC GCAACCTGTG CCTGTACCCG
AATGTGTATC TGATGGACCA ATTCTCGACG CAGATCCGCC ACTTCCGGCC GATCGCGCCG
GACAAGACCG AGGTCACCAT CTACTGCATC GCCCCCAAGG GTGAGAACGC CGATGCCCGC
GCCAGGCGCA TCCGCCAGTA CGAGGACTTC TTCAACGCCT CGGGCATGGC CACCCCGGAC
GACCTCGAGG AGTTCCGCTC CTGCCAGCTG ACCTACCAGG CCACCGCCGC CCCGTGGAAC
GACATGAGCC GCGGTGCGCA GCACTGGCTG TCCGGACCCG ACGAGGTCGC CGAATCGCTG
GGGATGCACG GCGTCATCTC CGCGGGCGTG CGCAACGAGG ACGAGGGCCT CTACCCCGTC
CAGCACGGCT ACTGGCTGCA GACCATGCGT GCGGCGCTGG CCCAAAACGA GACCGGATCG
GAGAAGTGA
 
Protein sequence
MEAPMSAPTA EPRSHLETVL ADAVIDDHAA GIYRTNRRIF TDEDIFELEM EHIFEGNWIY 
LAHESQVAEP GDYFTTYMGR QPVVITRDKN GGLNCLVNAC AHRGAMVCRR KTDNRMTLTC
PFHGWTFRND GTLLKVKDPE GAGYPATFDV DGSHNMTKVA RFDSYRGFLF GSLNPDVVPL
LEHLGDTTKV IDMLVDQSPD GLEVLRGSST YTYDGNWKVQ AENGADGYHV TATHWNYAAT
TSRRNTGESA NDTKALDAGS WGKSGGGYWS YPNGHLCLWT WAANPEDRPL WDRLDDLKSV
HGAAKGEFMV KGSRNLCLYP NVYLMDQFST QIRHFRPIAP DKTEVTIYCI APKGENADAR
ARRIRQYEDF FNASGMATPD DLEEFRSCQL TYQATAAPWN DMSRGAQHWL SGPDEVAESL
GMHGVISAGV RNEDEGLYPV QHGYWLQTMR AALAQNETGS EK