Gene Mkms_5531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5531 
Symbol 
ID4610271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008703 
Strand
Start bp37627 
End bp38865 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content71% 
IMG OID639789196 
Producthypothetical protein 
Protein accessionYP_935531 
Protein GI119854926 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.219128 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000520747 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGGGGGT TACGGCGTGA AGGGGTGGGG GCTGTCGTGG TGCAGGCAGG CGAGGAGCTC 
GGCGGTGCCG GTGTGGTCGG GGCTGCCGTC GACGGCGTCG CCGGTCTGGA TGATGGCGGC
GATGTCGGCT CCGAGGTGCA GTGTGTAGCG CGGCCATTCG TCATCGCCGG CGCGGGCGAT
CAGGGCCGCG GTGGCGGCCG TGAGCCGGAT GCGTGCGTGG GCGGGGTTGT TGCCGAGACC
GATGATGGTC TGCTTGCTGC GGTGGCCGCT GCTGGTCACT GCCCAGGTGA TGGTCATGTT
CCGATGATCG CCGCCAGGGG GCGGTGCCGG GAGGGGCCGC ACTCCCGGTT GTGGATTAAG
TGCGGCCCCT GTGGATCACC GGGTCGGGCG GTAGCTGGCG TCGGTGGGGT CGAAGCGGTG
TTGGTCTCCG TCGAGGCAGA CAGCGGCGGC TGTTTTGTGG GCGAGTTGGA GGACTTCGAC
AGCGCTGCGG TAGGCGGCGA GGTCGACGAT CTGGAATGTC ACGGGCCGCA TGTAGACCTC
GACCCAGCGG ATGGTGCGTC GCTTGTCGGG GGTGAGGGCT TCGCGGCGGG TGACGGCGTA
GGGCGGGCGT CGGGTCCAGT CGACGCCGAT GGCGGGCCCG TCGTAGGGCT GTTGGTCGCG
CGGGGCGAGG TTGTCGGGGA GCCCGGCCAG CGCCTGGCGG GCGGCTCCGA AGGCTTCCAG
CACGCCCTGG GCTGCTTGGG TGTTGAAGAA GGTGAACAGG ATGCGTCCCC AGGTGAGGGT
GACGCGGGCG ATGTCGGTGT GGGCGTGATG GACGCTGATG TCGGGCTGTT GGGCGCCGCG
CAGTCGTACG TAGGCCTGGG AGATCGCTCC GGCGGGGATG CCGATGTGCG ACTCGGGGAT
GGCGATCTTG GTCATGGTTG TTCTCCGTTC GTGTACGGGT TGGGTGCGCC GGTTAGCGCT
GGGGTTGCGG GGGACCGGCG GGGTGGCGCG CGCAACCTCG AAGAGGCCGT GACGGGCGGA
GCAGGGTTTT CGGCCCGCGC GAGCCGGCGC GCAGCGCCCT GGCGAGCTGG CGTTGGGCCG
AAAAGGTCAC TCGGCCTGCG CAGTTCGGCG CGGACGTGGG TTGCGCGTGA CGGGGCCTCG
TCGGTCACAC TGGGCGACCC AGCTGAACGG TTAAAGGTGA GCAGCTGCCC AACCCACGCC
CGCCGCAGGC GGGCATCGGC GCCTCCGGGC GCCGCTTGA
 
Protein sequence
MRGLRREGVG AVVVQAGEEL GGAGVVGAAV DGVAGLDDGG DVGSEVQCVA RPFVIAGAGD 
QGRGGGREPD ACVGGVVAET DDGLLAAVAA AGHCPGDGHV PMIAARGRCR EGPHSRLWIK
CGPCGSPGRA VAGVGGVEAV LVSVEADSGG CFVGELEDFD SAAVGGEVDD LECHGPHVDL
DPADGASLVG GEGFAAGDGV GRASGPVDAD GGPVVGLLVA RGEVVGEPGQ RLAGGSEGFQ
HALGCLGVEE GEQDASPGEG DAGDVGVGVM DADVGLLGAA QSYVGLGDRS GGDADVRLGD
GDLGHGCSPF VYGLGAPVSA GVAGDRRGGA RNLEEAVTGG AGFSARASRR AAPWRAGVGP
KRSLGLRSSA RTWVARDGAS SVTLGDPAER LKVSSCPTHA RRRRASAPPG AA