Gene Mmcs_4344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_4344 
Symbol 
ID4113174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp4620534 
End bp4621580 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID638033490 
ProductAraC family transcriptional regulator 
Protein accessionYP_641505 
Protein GI108801308 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.309673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACCA ATCCGGGGGT TCAGCCTCAG CCCCGCACTG CGTCACCTGC AGAGGTACCT 
CCCATAGCAA GACATCTCGA CTCCCTTGGC GTCCTGGCAC GGACCCAGGT CAAGATCATC
GATTCCGACG AGGCGGCGGC GTTCCTCGAC GACGCCTACG GCTCCCGCCT GCGGTTGTCG
CGGCTGGCGA ATCCGACCGG CGGCCCGGTG CTGACCTACA GCCGTCACGA CGCCGGCTCC
TTCACGATCG ACGACATGGC GATGGCCGGC GGGTTCACCG CGTCACCCGA CCCGCTGCAC
AAGGTGCTCG CGGTGTGGGC GAACCGGGGC CGGATCGCAG GCCGGTGCGC CGGTATCGGC
GGCCTGGCCC GCGCGGGCGA GGTCGCGCTG ATGGCCCAGC CGGACCTCCC GCACGATGCC
GAAGCCGAGG ACGTCGCGCT CACGACGGTG CTGCTCGATC CGGCGCTGGT CGCGAGCCTG
GCCACCGGTG TGCCGGAGGC CGAGGCCTCG CCGATCCGGT TCTCCCTGTT CCAGCCCGTC
GACGACTCGG CCCGACAGCT CTGGCAACAG ACCGTCCACT ACGTCAAGGA GTGTGTGCTC
GCCGACGAGG CGCTCGCCAC GCCGCTGGTG CTCGGCCATG CCGCCCGGCT CCTCGCCGCG
GTGACGCTCG CGGCCTTCCC GAGCGCCTCG ACGGTCGCGT CCACCGCACA TGACCGCGAT
GCCAAACCCG TTCTCCTGCA ACGGGCGATC GGCTTCATCG AGGAGAACCT CGCCAACGAC
ATCGCCCTCG CCGACATCGC CGCGGCCGTC CACGTCTCGC CGAGAGCGGT GCAGTACATG
TTCCGCCGCC ATCTGGAGAC GACCCCGCTG CAGTACCTCC GCCGGTCGCG CCTGCACCAC
GCGCACATGG ACCTGCTGGC CGCGGACCCG GCTCGCGAGA CCGTCACACG GATCGCCGCC
CAGTGGGGGT TCGCCCACAC CGGCAGGTTC GCGGTGATGT ACCGCGAGGC CTACGGGCAG
AGCCCGCACA CCACCCTTCG CGGGTGA
 
Protein sequence
MSTNPGVQPQ PRTASPAEVP PIARHLDSLG VLARTQVKII DSDEAAAFLD DAYGSRLRLS 
RLANPTGGPV LTYSRHDAGS FTIDDMAMAG GFTASPDPLH KVLAVWANRG RIAGRCAGIG
GLARAGEVAL MAQPDLPHDA EAEDVALTTV LLDPALVASL ATGVPEAEAS PIRFSLFQPV
DDSARQLWQQ TVHYVKECVL ADEALATPLV LGHAARLLAA VTLAAFPSAS TVASTAHDRD
AKPVLLQRAI GFIEENLAND IALADIAAAV HVSPRAVQYM FRRHLETTPL QYLRRSRLHH
AHMDLLAADP ARETVTRIAA QWGFAHTGRF AVMYREAYGQ SPHTTLRG