Gene Mkms_3783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3783 
Symbol 
ID4611718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3997190 
End bp3998467 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content64% 
IMG OID639793463 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_939766 
Protein GI119869814 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0573009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.423928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCGCT GGCCTAAACC GCCGGAAGGC AGTTGGACGG AGCACTATCC CGAACTCGGC 
ACCGGACCGA TCTCGTTCCG CGATTCCGTC TCGCCGGAGT TCTACGAACT CGAGCGCGAG
GCGGTGTTCA AACGGGCCTG GCTCAACGTC GGCCGGGTCG AGGAACTGCC GCGGGTCGGC
AGTTACCTCA CCAAGCAGAT CGACGTCGCA GGTGTCTCGG TGATCGTGGT GAAGGGCCGC
GACGAGCAGA TCCGCGCCTT CTACAACATC TGCCGCCACC GCGGGAACAA ACTGGTGTGG
AACGACTTTC CGGGTGAGGA GGTCAAAGGC ACCTGCCGGC AGTTCACGTG CAAATACCAC
GGCTGGCGCT ACGGCCTCGA CGGTGCGCTG AAGTTCGTCC AGCAACCCGG GGAGTTCTTC
GACCTCGACA CCGAGAAGCT GGGACTCGCA CCCGTGCAGT GCGACGTGTG GAACGGGTTC
ATCTTCATCA ACCTCGATCC CGAACCGCGG TGGAGCCTGC GCGAGTTCCT CGGCCCGATG
ATCACCGCGC TCGACGACTA CCCGTTCGAG TTGATGACCG AGCGTTACGA GTTCGAGGCG
CACAACAACA GCAACTGGAA AATCTTCGCC GACGCGTTCC AGGAGTACTA CCACGTGCCG
TCGCTGCATT CGCAGCAGGT GCCCAGCGCG GTGCGCCAAC CCAATGCCAC GTTCGAGTGC
GGGCACTTCC AGATCGACGG TCCGCACCGC CTGGTGTCCA CCGCGGGTAC CCGGCGGTGG
CTGCTCGACC CGGAATTCAT GTACCCCGTC GAACGGATCA CCCGCAGCGG GCTCGTGGGG
CCATGGCGGA CTCCGGAGAC CCTCCAGTCG GCCGGTCTGA ACCCGGGCGG TATCGAGCCA
TGGGGTATCA CCAACTTCCA GATCTTCCCG AACCTGGAGA TCCTCATCTA CCACGGCTGG
TACCTGCTGT ACCGGTACTG GCCGACCTCG CACAACACCC ACAAGTTCGA GGCGTACAAC
GCCTTCCATC CGGCCCGCAC GGTGCGCGAG CGGATCGAAC ACGAGGTCGC CTCCGTCGTG
CTCAAGGAGT TCGCGCTGCA GGACGCCGGC ATGCTCGGCG GCACGCAGGC GGCGCTGGAG
TACGGCCTGG ACGAGCCGAT AATCGACGAC TATCCGCTCA GCGATCAGGA GATTCTGGTG
CGCCATCTGC ACCACGAGGC TGTCAAGTGG GTCGACGACT ACCGGGCCGA ACGCGCACCG
GTGGGGGTGC GAGCATGA
 
Protein sequence
MGRWPKPPEG SWTEHYPELG TGPISFRDSV SPEFYELERE AVFKRAWLNV GRVEELPRVG 
SYLTKQIDVA GVSVIVVKGR DEQIRAFYNI CRHRGNKLVW NDFPGEEVKG TCRQFTCKYH
GWRYGLDGAL KFVQQPGEFF DLDTEKLGLA PVQCDVWNGF IFINLDPEPR WSLREFLGPM
ITALDDYPFE LMTERYEFEA HNNSNWKIFA DAFQEYYHVP SLHSQQVPSA VRQPNATFEC
GHFQIDGPHR LVSTAGTRRW LLDPEFMYPV ERITRSGLVG PWRTPETLQS AGLNPGGIEP
WGITNFQIFP NLEILIYHGW YLLYRYWPTS HNTHKFEAYN AFHPARTVRE RIEHEVASVV
LKEFALQDAG MLGGTQAALE YGLDEPIIDD YPLSDQEILV RHLHHEAVKW VDDYRAERAP
VGVRA