Gene Mkms_4454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4454 
Symbol 
ID4612397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4686129 
End bp4687190 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content72% 
IMG OID639794140 
Producthypothetical protein 
Protein accessionYP_940435 
Protein GI119870483 
COG category[R] General function prediction only 
COG ID[COG4195] Phage-related replication protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.159304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0629243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGCC CTGGTCGACA CGCCTACTTC GCGTACGGGT CCAACCTGTG CGTGCAGCAG 
ATGGCGCAGC GCTGCCCCGA CGCCGCCGAC CCGCGTCCGG CCACCCTCGC CGACCACGAC
TGGCTGATCA ACGAACGCGG TGTCGCCACG GTCGAACCGG TCGACGGCGC ACAGGTGCAC
GGCGTGCTGT GGCAGGTGTC CGATCACGAC CTCGCGACAC TCGACAGCGC CGAGGGGGTT
CCGGTGCGGT ACCGCCGCGA CCGGCTCACC GTGCAGACGG ACGACGGTCC GGCACCGGCG
TGGGTGTACA TCGACCACCG CATCGAACCG GGCCCGCCGC GACCCGGCTA CCTGGAACGC
ATCATCGACG GCGCACTGCA TCACGGGCTG CCGCACCGCT GGGTGGAGTT CCTGCGGCGG
TGGGATCCGA TGCACTGGCC GCACCGTCCG CACCATGCGG ACACCAAAGG TCCCAAATCG
CTTTCGGAGC TGCTCACCGA CCCCGCGGTC ACCGAACACA GTGTGCTGCG GTCACGCTTC
GGGTTCCTGG CGATCCACGG CGGCGGGCTG GAGCAGATGA CCGACGTCAT CGCCGAACGC
GCCGCCGAAG CCGCCGACGC GTCGGTGTAC GTGGTGCGCC ATCCCGAGCA GTACCCCCAC
CATCTGCCGT CGGCGCGGTA CCTGGCCGCG GAGTCGGCGC GACTCGCGGA GTTCCTCGAC
CACGTCGAGG TGGCCGTCTC ACTGCACGGC TACGGCCGCA TCGGGCGCAG CACCGAACTG
CTGGCCGGCG GCGGCAACCG GGCCCTGGCC GCGCACCTCG CCGCGCACGT CGAGATCCCC
GGCTACCGGA TCGTCACCGA CCTCGACGAC ATCCCGCCGG AGTTGCGGGG GCTGCACGCC
GACAACCCGG TCAACCGGGT GCGCGGCGGC GGCGCCCAAC TCGAACTGAC CTCCCGGGTG
CGCGGCCTGA GCCCGCGCAG CCCGCTGCCG GGCGACGACG GTCTCTCCCC CGCCACCTCG
GCGCTGGTGC AGGGGCTGGT CGCGGCCGCA AGATCCTGGT GA
 
Protein sequence
MLRPGRHAYF AYGSNLCVQQ MAQRCPDAAD PRPATLADHD WLINERGVAT VEPVDGAQVH 
GVLWQVSDHD LATLDSAEGV PVRYRRDRLT VQTDDGPAPA WVYIDHRIEP GPPRPGYLER
IIDGALHHGL PHRWVEFLRR WDPMHWPHRP HHADTKGPKS LSELLTDPAV TEHSVLRSRF
GFLAIHGGGL EQMTDVIAER AAEAADASVY VVRHPEQYPH HLPSARYLAA ESARLAEFLD
HVEVAVSLHG YGRIGRSTEL LAGGGNRALA AHLAAHVEIP GYRIVTDLDD IPPELRGLHA
DNPVNRVRGG GAQLELTSRV RGLSPRSPLP GDDGLSPATS ALVQGLVAAA RSW