Gene Mmcs_1373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1373 
Symbol 
ID4110210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1490156 
End bp1491544 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content66% 
IMG OID638030494 
ProductRieske (2Fe-2S) region 
Protein accessionYP_638541 
Protein GI108798344 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03229] benzoate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGCAC CGATGAGCGC ACCGACAGCC GAACCCCGCA GTCACCTCGA GACCGTCCTC 
GCCGATGCGG TGATCGACGA CCACGCGGCC GGGATCTACC GCACCAACCG GCGGATCTTC
ACCGACGAGG ACATCTTCGA GCTCGAGATG GAGCACATCT TCGAGGGCAA CTGGATCTAC
CTCGCCCACG AAAGTCAGGT CGCCGAACCG GGCGACTACT TCACCACGTA CATGGGCCGC
CAGCCCGTCG TCATCACCCG CGACAAGAAC GGGGGCCTCA ACTGCCTGGT CAATGCCTGT
GCACACCGAG GGGCGATGGT GTGCCGACGC AAGACCGACA ACCGGATGAC GCTCACCTGT
CCCTTTCACG GGTGGACCTT CCGCAACGAC GGGACCTTGC TCAAGGTCAA GGATCCCGAG
GGGGCCGGCT ACCCGGCGAC GTTCGACGTC GACGGCTCGC ACAACATGAC CAAGGTGGCC
CGGTTCGACA GCTACCGCGG ATTCCTGTTC GGCAGCCTCA ACCCGGACGT CGTCCCCCTC
CTCGAGCACC TCGGTGACAC CACCAAGGTC ATCGACATGC TCGTCGACCA GTCCCCCGAC
GGCCTGGAGG TGTTGCGCGG ATCGTCGACC TACACCTACG ACGGCAACTG GAAAGTGCAG
GCGGAGAACG GCGCCGACGG TTATCACGTC ACCGCGACGC ACTGGAACTA CGCCGCGACC
ACCTCACGGC GCAACACCGG CGAGTCCGCC AACGACACCA AGGCGCTCGA CGCCGGCAGC
TGGGGGAAGT CCGGCGGCGG CTACTGGTCC TACCCGAACG GCCACCTCTG CCTGTGGACG
TGGGCGGCCA ACCCCGAGGA CCGCCCGCTG TGGGACCGGC TCGACGACCT CAAGAGCGTC
CACGGCGCGG CCAAGGGCGA GTTCATGGTG AAGGGTTCAC GCAACCTGTG CCTGTACCCG
AATGTGTATC TGATGGACCA ATTCTCGACG CAGATCCGCC ACTTCCGGCC GATCGCGCCG
GACAAGACCG AGGTCACCAT CTACTGCATC GCCCCCAAGG GTGAGAACGC CGATGCCCGC
GCCAGGCGCA TCCGCCAGTA CGAGGACTTC TTCAACGCCT CGGGCATGGC CACCCCGGAC
GACCTCGAGG AGTTCCGCTC CTGCCAGCTG ACCTACCAGG CCACCGCCGC CCCGTGGAAC
GACATGAGCC GCGGTGCGCA GCACTGGCTG TCCGGACCCG ACGAGGTCGC CGAATCGCTG
GGGATGCACG GCGTCATCTC CGCGGGCGTG CGCAACGAGG ACGAGGGCCT CTACCCCGTC
CAGCACGGCT ACTGGCTGCA GACCATGCGT GCGGCGCTGG CCCAAAACGA GACCGGATCG
GAGAAGTGA
 
Protein sequence
MEAPMSAPTA EPRSHLETVL ADAVIDDHAA GIYRTNRRIF TDEDIFELEM EHIFEGNWIY 
LAHESQVAEP GDYFTTYMGR QPVVITRDKN GGLNCLVNAC AHRGAMVCRR KTDNRMTLTC
PFHGWTFRND GTLLKVKDPE GAGYPATFDV DGSHNMTKVA RFDSYRGFLF GSLNPDVVPL
LEHLGDTTKV IDMLVDQSPD GLEVLRGSST YTYDGNWKVQ AENGADGYHV TATHWNYAAT
TSRRNTGESA NDTKALDAGS WGKSGGGYWS YPNGHLCLWT WAANPEDRPL WDRLDDLKSV
HGAAKGEFMV KGSRNLCLYP NVYLMDQFST QIRHFRPIAP DKTEVTIYCI APKGENADAR
ARRIRQYEDF FNASGMATPD DLEEFRSCQL TYQATAAPWN DMSRGAQHWL SGPDEVAESL
GMHGVISAGV RNEDEGLYPV QHGYWLQTMR AALAQNETGS EK