Gene Mmcs_1302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1302 
Symbol 
ID4110139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp1407129 
End bp1408466 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content68% 
IMG OID638030423 
Producthypothetical protein 
Protein accessionYP_638470 
Protein GI108798273 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCACAC CGGTCGAGCC GGGGCCCGTC GTGGTCATCG GCGGCGGGAT ATCGGGTCTG 
ACGGCGGCCT ACCGGCTGGC CCGCGCGTCG ATTCCGGTCC GCCTCGTCGA AAGTGCCGGA
CAGCTCGGCG GATTGGGTAT AGGTGGCGAG ATCGGCGGCG TCGAGATCGA ACGCTTCTAC
CACTGTGTGA TGCCCACCGA CGAACATCTC CTTCCCCTGC TCGAGGAGCT CGGGCTCGCC
GCGGACATCG GCTGGCAGCA GACGACGATG GGTATGAACA TCGACGGCCG CCGATATCCC
TTCAACAGCG CCGTGGACCT GCTGCGGTTC ACCCCGCTGC GGTTCACCCA GCGGATCCGG
TTCGGAGCGG TCTCGCTGCT GTTGCGCCGG CTCGGGCGCG GGAAGGACCT CGACAACACC
AGAACCGAGG ACTGGCTGCG CGGCGTGTAC GGCCCCACCG TGTGGCAGCG CCTGCTCAGA
CCGTTGTTCG GCGCGAAGTT CGGTGACGCC TTCGGTGAGG TCCCCGCCCG CTACCTCTAT
CAGCGCCTCG GCCGGGAGAG CAACGTCGCC ACCCGCGGCT ATCCGCTGGG TACGTACCGC
TCGATCGTCG ACCGCTTGAG GGAGTCCATC GAATCCGACG GCGGACGTGT CGATTTGGGC
GTGGGGGTGC AGAGCGTGAC GGCCGACGGC GACGGCGCGA CCGTTCGTCT GGACACCGGC
GAGGAGATCG CCGCGCAATC GGTCGTGTCG ACGGTGCCGA TCCCGCTGCT GCGCTCGCTG
TCGGCCGAGC CGCTGCGGGC CGAGCTGCCC GAGATCCGAC TCGACTACCA GGGCGTGGTG
AACGCGGTCT TCCTGCTCGA CCGGCCACTC GACGGGCACT ATTGGGCGCC GGTGATCAAC
TGCGGCACCG ACTTCGACGG TGTGGTGGAG ATGTCGGCAC TCACCGGCAC CGAGCGCTAC
GGCGGACGGA CGCTGGTGTA CGTGATGCAC TACTGCGGAC GGGACTCGGC GTTGTTCGCC
GAACCGGACA CCGAGATCGC GCGCCGCTGG ACTGCCCAGC TGCGGGCGCT CTACCCCGAC
CGGCTCACCC ACGCCGGGGC GGTCGAACAG GTGCGCGTGT TCAAAGCGCC CTTCGTGGAG
CCGATTCCGA CACTGGGTTA CCACGAACGC ATGCCCGCGA GCCGGGTCGG TGACACGAAC
GTCTTCCTGG CCACCACGGC GCAGATCTAC CCCGAGGTCA CGAGCTGGAA TTCGTCGGTG
GGACTCGCCG GCCGCGTGGT GCGCGAGGTG ATCGACAACC GCGAGTCGGC CAGGCGCCGG
ACGGTGGCCA ATGCGTGA
 
Protein sequence
MSTPVEPGPV VVIGGGISGL TAAYRLARAS IPVRLVESAG QLGGLGIGGE IGGVEIERFY 
HCVMPTDEHL LPLLEELGLA ADIGWQQTTM GMNIDGRRYP FNSAVDLLRF TPLRFTQRIR
FGAVSLLLRR LGRGKDLDNT RTEDWLRGVY GPTVWQRLLR PLFGAKFGDA FGEVPARYLY
QRLGRESNVA TRGYPLGTYR SIVDRLRESI ESDGGRVDLG VGVQSVTADG DGATVRLDTG
EEIAAQSVVS TVPIPLLRSL SAEPLRAELP EIRLDYQGVV NAVFLLDRPL DGHYWAPVIN
CGTDFDGVVE MSALTGTERY GGRTLVYVMH YCGRDSALFA EPDTEIARRW TAQLRALYPD
RLTHAGAVEQ VRVFKAPFVE PIPTLGYHER MPASRVGDTN VFLATTAQIY PEVTSWNSSV
GLAGRVVREV IDNRESARRR TVANA