Gene Mkms_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_3988 
Symbol 
ID4611928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4202248 
End bp4203843 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content72% 
IMG OID639793672 
Producthypothetical protein 
Protein accessionYP_939970 
Protein GI119870018 
COG category[S] Function unknown 
COG ID[COG2966] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.911951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGTGG ATTCCGAACG ATCTGACGGA CCGCCCCGGC GTCGGCGCGC ACTGAATCTC 
GCGCTGCGGG GCCGACGCGA CCCGGCATCG GGTGCGGGCC AGCGCCGGCG GGTGTCGGGT
GGGCTGAGCG AGCGGCACAC CCGCAAGGTC CTCGACCTGA CCGTCCGCCT CGCCGAGGTG
ATGCTGTCCT CCGGGTCGGG TACCGCCGAC GTCGTCGCGA CGGCGCAGGA CGTCGCGCAG
GCCTACCAGC TCACCGACTG TGTCGTCGAC GTCTTCGTCA CCACCATCTT CGTATCGGCG
CTGCCGACCG CCGACAGCCC GCCGGTGACG ATCGTGCGGG CAGTCCACGC ACGCTCGACC
GACTACTCGC GGCTCGCCGA ACTCGACCTG CTCGTCCGGC GGATCACCTC CGGCGGCGTA
TCGGTCGACG AGGCCCACGA GGCGATGGAC GAGCTGACCG AGCGGCCCCA CCCGTATCCA
CGCTGGGTCG CCACCGCGGG GTGGGCCGGT TTCGCCCTGG GCATCGCGAT GCTGCTCGGC
GGAAGCTGGT TGACCTGGAT CCTGGCCGCG GTCTCCTCGG CGCTGATCGA TCGGGTCGGC
CGGGTGCTCA ACCGGTGGGG CACGCCGTTC TTCTTCCAGC AGGCGGCGGG CGCGTTCATC
GCGACGATGA TCGCCGTCGC GGCGTACCTC TACGCCGGGG TGGGGCCCAC CGCCCTGGTG
GCGACCGGGA TCGTCATGCT GCTGGCGGGT CTGACCCTGG TGGGTTCGGT GCAGGACGCG
CTGACCGGTT ACATGGTCAC CGCGGTGGCC CGCCTCGGTG ACGTCCTGTT CCTCACCGCG
GGCATCGTCG TCGGCATCCT GGCCGGGCTG CAGGTCGCCG CGCTCGCCGG GATCCAGATC
GAGCTCCACG TCGACGCCAC GGAGTCGTTC GTGATGCCGA CCCGGCCGGT GCCGATCCTG
CTCGCGGTGC TGGGTGCCGC GCTTGCTGGC GCCTGCCTGA CGGTCGCCAG CTATGCGCGG
CTGCGCTCGG TGCTCACCGC GGGTGTCGCC GCAGGGCTGG CGGAGGCGGT GCTGATCGGT
CTGGGCGCAG CCGGGGTCGG CGGGGTGGTC GCCACCGGGA CCGCGGCCGT CGGCGTCGGT
TTGCTGGCCA CCCTGATCTC GATTCGCCGG CAGGCTCCGG CCCTGGTCAC CGCCACCGCG
GGCATCACTC CGATGCTGCC GGGCCTCGCG GTGTTCCGTG CGGTGTTCTT CTTCGCCGTC
GACCGCAACA TCCCCGGTGG GATTGCCCAG GCGCTGGGTG CCGCCGCCAT CGCACTGGCC
ATCGGGGCCG GTGTCGTGAT GGGCGAGTTG CTCGGCTCCC CGCTGCGCTA CCGCGCCGGG
CGTATCGGCG ACTTCCTGCG CGTCGAAGGG CCGCCCGGGC TTCGCCGGGC GATCGGCAAT
GTGGTTGCGC TGCGGCCGTC CGCCGGCCAG CAGCAGGCAC GTACCCCGCA CCGGCGGTCA
TGGAGCGTGG CCCTCGAACC GAAGGTCAAG AATTCGGCGG CCGACGACGA CGAGGCGTCC
GCAGGCCCCT CGGATGGTGA AAACGCGGAG CGGTAA
 
Protein sequence
MAVDSERSDG PPRRRRALNL ALRGRRDPAS GAGQRRRVSG GLSERHTRKV LDLTVRLAEV 
MLSSGSGTAD VVATAQDVAQ AYQLTDCVVD VFVTTIFVSA LPTADSPPVT IVRAVHARST
DYSRLAELDL LVRRITSGGV SVDEAHEAMD ELTERPHPYP RWVATAGWAG FALGIAMLLG
GSWLTWILAA VSSALIDRVG RVLNRWGTPF FFQQAAGAFI ATMIAVAAYL YAGVGPTALV
ATGIVMLLAG LTLVGSVQDA LTGYMVTAVA RLGDVLFLTA GIVVGILAGL QVAALAGIQI
ELHVDATESF VMPTRPVPIL LAVLGAALAG ACLTVASYAR LRSVLTAGVA AGLAEAVLIG
LGAAGVGGVV ATGTAAVGVG LLATLISIRR QAPALVTATA GITPMLPGLA VFRAVFFFAV
DRNIPGGIAQ ALGAAAIALA IGAGVVMGEL LGSPLRYRAG RIGDFLRVEG PPGLRRAIGN
VVALRPSAGQ QQARTPHRRS WSVALEPKVK NSAADDDEAS AGPSDGENAE R