Gene Mjls_1407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_1407 
Symbol 
ID4877143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp1510307 
End bp1511695 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content66% 
IMG OID640138715 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001069700 
Protein GI126434009 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03229] benzoate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0632088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGCAC CGATGAGCGC ACCGACAGCC GAACCCCGCA GTCACCTCGA GACCGTCCTG 
GCCGATGCGG TGATCGACGA CCACGCGGCC GGGATCTACC GCACCAACCG GCGGATCTTC
ACCGACGAGG ACATCTTCGA ACTCGAGATG AAGCACATCT TCGAGGGCAA CTGGATCTAC
CTCGCCCACG AAAGTCAGGT CGCCGAACCG GTCGACTACT TCACCACGTA CATGGGCCGC
CAGCCCGTCG TCATCACCCG CGACAAGAAC GGGGGCCTCA ACTGCCTGGT CAATGCCTGC
GCACACCGCG GGGCAATGGT GTGCCGACGC AAGACCGACA ACCGGATGAC GCTCACCTGT
CCCTTTCACG GGTGGACCTT CCGCAACGAC GGGACCTTGC TCAAGGTCAA GGATCCCGAG
GGGGCCGGCT ACCCGGCGAC GTTCGACGTC GACGGCTCGC ACAACATGAC CAAGGTGGCC
CGGTTCGACA GCTACCGCGG ATTCCTGTTC GGCAGCCTCA ACCCGGACGT CGTCCCGCTC
CTCGAGCACC TCGGTGACAC CACCAAGGTC ATTGACATGC TCGTCGACCA GTCCCCCGAC
GGCCTGGAAG TGTTGCGCGG ATCGTCGACC TACACCTACG ACGGCAACTG GAAAGTACAG
GCGGAGAACG GCGCCGACGG TTATCACGTC ACCGCGACGC ACTGGAACTA CGCCGCGACC
ACCTCACGGC GCAACACGGG CGAGTCCGCC AACGACACCA AGGCGCTCGA CGCCGGCAGC
TGGGGGAAGT CCGGCGGCGG CTACTGGTCC TACCCGAACG GCCACCTCTG CCTGTGGACA
TGGGCGGCCA ACCCCGAGGA CCGCCCGCTG TGGGACCGGC TCGACGACCT CAAGAGCGTC
CACGGCGCGG CCAAGGGCGA GTTCATGGTG AAGGGTTCAC GCAACCTGTG CCTGTACCCG
AATGTGTATC TGATGGACCA ATTCTCGACG CAGATCCGCC ACTTCCGGCC GATCGCGCCG
GACAAGACCG AGGTCACCAT CTACTGCATC GCCCCCAAGG GTGAGAACGC CGATGCCCGC
GCCAGGCGCA TCCGCCAGTA CGAGGATTTC TTCAACGCCT CGGGCATGGC CACCCCGGAC
GACCTCGAGG AGTTCCGCTC CTGCCAGCTG ACCTACCAGG CCACCGCCGC CCCGTGGAAC
GACATGAGCC GCGGTGCGCA GCACTGGCTG TCCGGACCCG ACGAGGTCGC CGAATCACTG
GGGATGCACG GCGTCATCTC CGCGGGCGTG CGCAACGAGG ACGAGGGCCT CTACCCCGTC
CAGCACGGCT ACTGGCTGCA GACCATGCGT GCGGCGCTGG CCCACAACGA GACCGGATCG
GAGAAGTGA
 
Protein sequence
MEAPMSAPTA EPRSHLETVL ADAVIDDHAA GIYRTNRRIF TDEDIFELEM KHIFEGNWIY 
LAHESQVAEP VDYFTTYMGR QPVVITRDKN GGLNCLVNAC AHRGAMVCRR KTDNRMTLTC
PFHGWTFRND GTLLKVKDPE GAGYPATFDV DGSHNMTKVA RFDSYRGFLF GSLNPDVVPL
LEHLGDTTKV IDMLVDQSPD GLEVLRGSST YTYDGNWKVQ AENGADGYHV TATHWNYAAT
TSRRNTGESA NDTKALDAGS WGKSGGGYWS YPNGHLCLWT WAANPEDRPL WDRLDDLKSV
HGAAKGEFMV KGSRNLCLYP NVYLMDQFST QIRHFRPIAP DKTEVTIYCI APKGENADAR
ARRIRQYEDF FNASGMATPD DLEEFRSCQL TYQATAAPWN DMSRGAQHWL SGPDEVAESL
GMHGVISAGV RNEDEGLYPV QHGYWLQTMR AALAHNETGS EK