Gene Mkms_4430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4430 
Symbol 
ID4612373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4659739 
End bp4660785 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID639794116 
ProductAraC family transcriptional regulator 
Protein accessionYP_940411 
Protein GI119870459 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.330528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.133415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCA ATCCGGGGGT TCAGCCTCAG CCCCGCACTG CGTCACCTGC AGAGGTACCT 
CCCATAGCAA GACATCTCGA CTCCCTTGGC GTCCTGGCAC GGACCCAGGT CAAGATCATC
GATTCCGACG AGGCGGCGGC GTTCCTCGAC GACGCCTACG GCTCCCGCCT GCGGTTGTCG
CGGCTGGCGA ATCCGACCGG CGGCCCGGTG CTGACCTACA GCCGTCACGA CGCCGGCTCC
TTCACGATCG ACGACATGGC GATGGCCGGC GGGTTCACCG CGTCACCCGA CCCGCTGCAC
AAGGTGCTCG CGGTGTGGGC GAACCGGGGC CGGATCGCAG GCCGGTGCGC CGGTATCGGC
GGCCTGGCCC GCGCGGGCGA GGTCGCGCTG ATGGCCCAGC CGGACCTCCC GCACGATGCC
GAAGCCGAGG ACGTCGCGCT CACGACGGTG CTGCTCGATC CGGCGCTGGT CGCGAGCCTG
GCCACCGGTG TGCCGGAGGC CGAGGCCTCG CCGATCCGGT TCTCCCTGTT CCAGCCCGTC
GACGACTCGG CCCGACAGCT CTGGCAACAG ACCGTCCACT ACGTCAAGGA GTGTGTGCTC
GCCGACGAGG CGCTCGCCAC GCCGCTGGTG CTCGGCCATG CCGCCCGGCT CCTCGCCGCG
GTGACGCTCG CGGCCTTCCC GAGCGCCTCG ACGGTCGCGT CCACCGCACA TGACCGCGAT
GCCAAACCCG TTCTCCTGCA ACGGGCGATC GGCTTCATCG AGGAGAACCT CGCCAACGAC
ATCGCCCTCG CCGACATCGC CGCGGCCGTC CACGTCTCGC CGAGAGCGGT GCAGTACATG
TTCCGCCGCC ATCTGGAGAC GACCCCGCTG CAGTACCTCC GCCGGTCGCG CCTGCACCAC
GCGCACATGG ACCTGCTGGC CGCGGACCCG GCTCGCGAGA CCGTCACACG GATCGCCGCC
CAGTGGGGGT TCGCCCACAC CGGCAGGTTC GCGGTGATGT ACCGCGAGGC CTACGGGCAG
AGCCCGCACA CCACCCTTCG CGGGTGA
 
Protein sequence
MSTNPGVQPQ PRTASPAEVP PIARHLDSLG VLARTQVKII DSDEAAAFLD DAYGSRLRLS 
RLANPTGGPV LTYSRHDAGS FTIDDMAMAG GFTASPDPLH KVLAVWANRG RIAGRCAGIG
GLARAGEVAL MAQPDLPHDA EAEDVALTTV LLDPALVASL ATGVPEAEAS PIRFSLFQPV
DDSARQLWQQ TVHYVKECVL ADEALATPLV LGHAARLLAA VTLAAFPSAS TVASTAHDRD
AKPVLLQRAI GFIEENLAND IALADIAAAV HVSPRAVQYM FRRHLETTPL QYLRRSRLHH
AHMDLLAADP ARETVTRIAA QWGFAHTGRF AVMYREAYGQ SPHTTLRG