Gene Mkms_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3501 
Symbol 
ID4611430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3677166 
End bp3678914 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content65% 
IMG OID639793176 
Productcytochrome-c oxidase 
Protein accessionYP_939485 
Protein GI119869533 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.553433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.857188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAACCG AATCCGTTGT GCAGCCGACG TTGACGGCCA GGCGTCCGTT TCCCGAACGG 
CTGGGCCCCA AGGGCAACCT GATCTACAAG GTCATCACGA CCACCGATCA CAAGCTGATC
GGGATCATGT ACTGCGTCGC CTGCTTCGCA TTCTTCCTGG TCGGCGGGTT GATGGCGCTG
TTCATGCGCA CCGAACTCGC GGTCCCCGGG CTGCAGTTCC TGTCGAACGA GCAGTACAAC
CAGCTGTTCA CCATGCACGG CACCGCGATG CTGCTGTTCT ACGCGACGCC CATCGTGTTC
GGGTTCGCGA ACCTGGTGCT GCCGCTGCAG ATCGGCGCCC CGGACGTGGC GTTCCCGCGC
CTCAACGCGC TGTCGTTCTG GCTGTTCGTC TTCGGTGCGC TGATCGCGCT GGCCGGGTTC
ATCACCCCCA GTGGTGCCGC GGACTTCGGC TGGACCGCCT ATACGCCGCT GTCCGATGCG
GTGCACTCAC CCGGGCCGGG TGGTGACCTG TGGATCTTCG GCTTGGGTGT GGGCGGTCTT
GGCACCATCC TCGGCGGCGT CAACATGATC ACCACCGTGG TGTGCATGCG CGCCCCCGGC
ATGACGATGT TCCGGATGCC GATCTTCACC TGGAACATCC TGGTGACCTC GATCCTGGTG
CTGCTGATCT TCCCGCTGCT GACCGCGGCG CTGTTCGCCC TCGCCGCAGA CCGCCACCTC
GGTGCGCACA TCTACGACCC GGCCAACGGC GGTGTGCTGC TGTGGCAGCA CCTGTTCTGG
TTCTTCGGCC ACCCCGAGGT GTATGTCATC GCGCTGCCGT TCTTCGGCAT CGTCAGTGAG
ATCTTCCCGG TGTTCTCCCG CAAGCCGATC TTCGGCTACA CCACCCTGAT CTACGCCACC
ATCGCGATCG CCGCGCTGTC GGTGGCGGTG TGGGCGCACC ACATGTTCGC GACCGGAGCC
GTTCTCCTGC CGTTCTTCTC GTTCATGACG TTCCTCATCG CGGTGCCGAC GGGGATCAAG
TTCTTCAACT GGATCGGCAC GATGTGGCGG GGTCAACTCA CCTTCCAGAC ACCGATGCTG
TTCTCGGTGG GCTTCCTGTT GACGTTCCTG CTCGGTGGTC TGTCGGGGGT GCTGCTGGCC
AGCCCGCCGC TGGACTTCCA CGTCACCGAC AGCTACTTCG TCATCGCGCA CTTCCACTAC
GTGCTGTTCG GCACCATCGT GTTCGCCACC TATGCCGGGA TCTACTTCTG GTTCCCGAAG
ATGACCGGCC GACTGCTCGA CGAGAAGCTG GGCAAGCTGC ACTTCTGGTT GACCTTCATC
GGCTTCCACA CCACGTTCCT GGTGCAGCAC TGGGTCGGTG ACGAGGGTAT GCCGCGCCGC
TACGCCGACT ACCTGCCGTC GGACGGGTTC ACCACGCTCA ACATCGTCTC GACCATCGGG
GCGTTCATCC TGGGCATCTC CACGCTGCCG TTCCTGTGGA ACATCTTCCG CAGCTGGCGC
TACGGCGAGC CCGTCGTGGT CGACGACCCG TGGGGGCACG GCAACTCACT GGAGTGGGCC
ACCAGCTGCC CGCCGCCGCG GCACAACTTC ACCGAACTGC CCCGCATCCG GTCCGAGCGC
CCCGCGTTCG AGCTGCACTA CCCGCACATG GTCGAGCGGA TGCGCGCCGA GGCGCACATC
GGCCGCCACG CGACCGCCGG AGAGATCCAG GAGGGCTACG GGCCACGAAC GGAGCCCGAC
CGCAGCTGA
 
Protein sequence
MTTESVVQPT LTARRPFPER LGPKGNLIYK VITTTDHKLI GIMYCVACFA FFLVGGLMAL 
FMRTELAVPG LQFLSNEQYN QLFTMHGTAM LLFYATPIVF GFANLVLPLQ IGAPDVAFPR
LNALSFWLFV FGALIALAGF ITPSGAADFG WTAYTPLSDA VHSPGPGGDL WIFGLGVGGL
GTILGGVNMI TTVVCMRAPG MTMFRMPIFT WNILVTSILV LLIFPLLTAA LFALAADRHL
GAHIYDPANG GVLLWQHLFW FFGHPEVYVI ALPFFGIVSE IFPVFSRKPI FGYTTLIYAT
IAIAALSVAV WAHHMFATGA VLLPFFSFMT FLIAVPTGIK FFNWIGTMWR GQLTFQTPML
FSVGFLLTFL LGGLSGVLLA SPPLDFHVTD SYFVIAHFHY VLFGTIVFAT YAGIYFWFPK
MTGRLLDEKL GKLHFWLTFI GFHTTFLVQH WVGDEGMPRR YADYLPSDGF TTLNIVSTIG
AFILGISTLP FLWNIFRSWR YGEPVVVDDP WGHGNSLEWA TSCPPPRHNF TELPRIRSER
PAFELHYPHM VERMRAEAHI GRHATAGEIQ EGYGPRTEPD RS