Gene Mkms_1738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_1738 
Symbol 
ID4613845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp1851259 
End bp1852368 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content61% 
IMG OID639791404 
ProductAraC family transcriptional regulator 
Protein accessionYP_937730 
Protein GI119867778 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.167182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGCAGG ACCCTGAGTC GGAGGCCAGA CCTTCCGCGT TGGTGTTCGA CGGAGCGACA 
GATTCTGTCG GCGAACTGCG GTTACCGCAG CGAGAAAGTG AGTTGTTACC AATGGCGAAT
GGACCGAAGA TGGCAGGATC GTACTTCAGC CGACGCGAGA CCGTATGCAC CGAACAGTTG
AGCGCGGCCG AGCACCTCGC TTCGGAAATT TTCACACCGC ACCGTCTCCG CTACTCCCGT
CGCGGCCAGC GTTTGAATGC GCGGCTCAGC GCCGCACGAA TCGGTGCTGT GACCGTTGGC
TATCTTCGGT ACGGTGCAGA TGTCGAGCTC GTCAATTCGA CTGCGCTCGA TGATTACCAC
ATCAACGTTC CCCTCACCAG CCACGCCGAT TCGTGGTGCG GCCCAGCCAC GGCCCAAGCG
ACGCCCACGC AGGCGGCGGT TTTCCTCCCA GGCCGTCCTG CCGGCATCAA GTGGGCCGCC
GATTGTGGTC AGCTGTGCGT GAAGTTCAGC CGCACGGATC TGGAATACGA ACTCGAGGGC
TTGCTTGGAC GACCTGCGGT TAGGCCGCTG AATATCGCGC ATAGCATGGA CCTGACGGCG
GACAGCAGTC GTGCGTGGCG TGCGCTGCTA TCGGTCTTAT ATCGGGAGGT CTCTCGCCCT
GAGAGCATCG TGCAACATCC GATGACCGGG CGACATTTCG AACAGTTGCT CATACATGGG
TTTCTACTCG CCCAACCTCA CTATCAGGCC ATGGCGCTCG CCGCCGACCA ACACGCAGCG
CCGCCGCCCA AAGTACTCAG GACAGCTCTC GACCTAATCG AGGAGCATCC CGAACGGAGT
TGGACGACAA CCGATCTTGC TCGCGAGGTT GGTATCAGTG TCCGCGCTCT TCAAGAGAGC
TTCAAGCGGC ACGTCGGGCT TCCTCCGCTC ACGTATCTGC GCGAAGTCCG GCTGATCCGT
GCGCATGCAG AACTGGCGGC GTCGTCTGAG AGTGCCGTGA CTGTTGCCCG CGTAGCGATG
AAGTGGGGCT TTGGTCATCT GGGCCGCTTC AGCGCGGAGT ATCGGCGCAA GTTCGGCGTC
ACTCCCCAAC ACACCTTGCG CGCTACCTGA
 
Protein sequence
MTQDPESEAR PSALVFDGAT DSVGELRLPQ RESELLPMAN GPKMAGSYFS RRETVCTEQL 
SAAEHLASEI FTPHRLRYSR RGQRLNARLS AARIGAVTVG YLRYGADVEL VNSTALDDYH
INVPLTSHAD SWCGPATAQA TPTQAAVFLP GRPAGIKWAA DCGQLCVKFS RTDLEYELEG
LLGRPAVRPL NIAHSMDLTA DSSRAWRALL SVLYREVSRP ESIVQHPMTG RHFEQLLIHG
FLLAQPHYQA MALAADQHAA PPPKVLRTAL DLIEEHPERS WTTTDLAREV GISVRALQES
FKRHVGLPPL TYLREVRLIR AHAELAASSE SAVTVARVAM KWGFGHLGRF SAEYRRKFGV
TPQHTLRAT