Gene Mkms_3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3074 
Symbol 
ID4610909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp3216203 
End bp3217672 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content64% 
IMG OID639792745 
Productphage terminase 
Protein accessionYP_939058 
Protein GI119869106 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.218281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTCGG GTAACAAGGG AAAGTTGCTG AATCAGCATG TTCCTTTGCC ATTCCGACCT 
GTCTCGGAGG TGGAGTCGGA GCGGTTCGTG AAGTTCTGCG AGAAGTTCCT GCGGACGCCG
AAAGGGACGG GTGCGCGCGA GGTGTTCCGG CCGCGTGAGT GGCAGATGGA CATCGTGCGG
GATGTGCTCG ATTCGGGTGC CCGCACCGTC GGGCTGATGA TGCCCCGAGG CCAGGGCAAG
ACGACGTTGT CGGCGGCGAT CCTGCTGTAC ATTTTCTTCA CTCGCGGCGA GGGCGCGAAT
GTGGTGCTGT TCGCGGTAGA TGAGCGCCAG GCGTCTCTTG CATTCCGGGT GGCTGCGCGC
ATGGTGCAGT TGTCGGAGGA TTTGTCGTCG CGCTGCTACG TGTACGCCGA CAAGCTGGTG
TTGCCGTTAA CCGATTCGAC GTATCAGGTG ATGCCCGCGT CCGCTGCGGC CGCTGAGGGT
TTGGACTACG TGGCCTGTCT GTGCGACGAG GCCGGCGTCA TTAACCGCGA TGTGTTCGAG
GTGGCGCAGC TGGCGCAGGG CAAGCGAGAA CGCTCAGTGC TCATCGCCAT CGGAACACCT
GGCCCAGACC CAAACGACCA GGTCCTCGCC GACCTCCGCG CCTATGCCGC CGAGCATCCC
GACGACAAAA GCCTGGTGTG GCGTGAGTTC TCCGCAGCCG GTTTTGAGGA CCACGGTGCC
GATTGCCCGC ATTGCTGGGA ACTCGCCAAC CCTGCGCTGG ACGACTTTCT GCATCGTGAT
GCGCTGCACG CGCTGCTGCC GCCGAAGACC CGCGAGGCGA CGTTCCGGCG TGCCCGATTG
TGCCAGTTCT CGACGGATAC CGACGGCGCG TTCCTTCCTG CGGGTGTGTG GGAAGGTTTG
TCCACGAGTT CACCGGTCCC GCCAGGGGTT GATGTGGTGC TCGCGTTGGA TGGCTCGTAT
AACGGGGACA CGACCGCCCT GCTCGTCGGA ACCGTCTCCG CTGAACCACA TTTCGATGTG
GTCCAGGTGT GGGACCCCAA AGGGGACCCG GATTATCGGG TGCCCGTCGC GGAAGTCGAG
GACGTTATTC GCCGGTCGGC GAAGGAGTGG CAGGTCGTCG AAATCATCGC TGACCCTTTC
CGATTCACCC GCACCCTGCA AGCCCTGGAA GCGGAGCGGC TGCCGGTAGT GGAGTTCCCG
CATTCCCCGT CCCGGCTGAC AGCCGCCACC ACTGACCTCT ATAAGGCGTG CGTGAACGGG
CAACTGACCC ATTCCGGGCA TCCCACGCTG GCCGCTCACG TCGCGGCGGC TGTGATTCGG
GAGGACCCGC GCGGCATGCG TTTGGACAAG GCGTCGCGGT CCCGGCACGC CCGAAAAATC
GACTGCGCCG CCTGCCTGGT GATGGCGCAT TCCCGCGCCA CCTGGCGCGC AACCCACAAG
AAAAGAAAGC GAGCAGTGAG CTTTAAATGA
 
Protein sequence
MRSGNKGKLL NQHVPLPFRP VSEVESERFV KFCEKFLRTP KGTGAREVFR PREWQMDIVR 
DVLDSGARTV GLMMPRGQGK TTLSAAILLY IFFTRGEGAN VVLFAVDERQ ASLAFRVAAR
MVQLSEDLSS RCYVYADKLV LPLTDSTYQV MPASAAAAEG LDYVACLCDE AGVINRDVFE
VAQLAQGKRE RSVLIAIGTP GPDPNDQVLA DLRAYAAEHP DDKSLVWREF SAAGFEDHGA
DCPHCWELAN PALDDFLHRD ALHALLPPKT REATFRRARL CQFSTDTDGA FLPAGVWEGL
STSSPVPPGV DVVLALDGSY NGDTTALLVG TVSAEPHFDV VQVWDPKGDP DYRVPVAEVE
DVIRRSAKEW QVVEIIADPF RFTRTLQALE AERLPVVEFP HSPSRLTAAT TDLYKACVNG
QLTHSGHPTL AAHVAAAVIR EDPRGMRLDK ASRSRHARKI DCAACLVMAH SRATWRATHK
KRKRAVSFK