Gene Mkms_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2033 
Symbol 
ID4613551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2155723 
End bp2156853 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content73% 
IMG OID639791699 
ProductDNA protecting protein DprA 
Protein accessionYP_938022 
Protein GI119868070 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.677839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCA TCGAAACGCG CGCGTGGGCG TATCTGTCGC GGGTCGCCGA ACCGCCGTGT 
CCGGAACTGG CGGCCCTGGT CACCGACGTC GGGCCGGTCG AGGCGGCCGA GAAGATCCGG
CGCGCCGAGG TGGGTGAGCG GTTGAGGCGC AAGACCGAGG CCCGCCGCGA ACATGGTTGT
GCGCAAGCGG ATCTCGACGT GCTGGCCCGG ATGGGCGGGC GATTGGTCAC CGCCTTCGAC
GACGAATGGC CGCTGTTGAA CTTTCTGAAA TTCACCGGCG TGGATACGCA GAAGCGGCCG
CAGGGCCACC CGCCGCTCGT GCTGTGGGCG GTCGGCCCGG TCGGTATGGA CGAGGTGGCC
GAGCGGGCGG CGGCGATCGT CGGCACCCGG GCGGCCACTG CCTATGGCGA ACACGTGGCG
GCGGATCTGG CCGCCGGGCT GGCGATGCGC GACGTCGCGG TGGTGTCCGG TGGCGCGTTC
GGTATCGACG GCGCCGCGCA CCGGGCGACG CTCGCGGGCG AGGGGGTGAC GGTCGCCGTC
GTGGCGGGCG GTATCGACGT GCCCTACCCG GCGGGCCACT CCGCGCTGCT CGCCAGGATT
CGCGCGAACG GGCTGGTGCT CAGCGAATAT CCGCCGGGGT CCCGCCCGAC GCGCAGCCGC
TTCCTCACCC GCAACCGGCT CGTCGCGGCG TTGTCGGGTG CGACGGTGGT GGTCGAGGCG
GGGGCGCGTA GCGGTGCGGC GAACACCGCG GCGTGGGCCG GCGCTCTCGG TCGCAACGTG
TGCGCCGTGC CCGGACCGGT CACGTCGGGG GCGTCGGTGG GCTGTCACCG CCTGCTACGC
GACGGCGGCG CGATCCTGGT GACCCGTGCC GAGGAAATCA TCGAGCTGGT CGGACGGATG
GGAGAGCTCC CGCCGGTCGA GGCCGGGCCG GCCACACCGC TCGACGGGCT CACCGACGCC
GAGCGACGGA TCTACGACGC CCTGCCCAAA CGCGGCGCGC GCAGTGCGGA CGAGGTCGCG
GTCGCCGCGG GTATCCCCGC CTACCAGGTC GTGGGGCCAT TGGCGATGTT AGAGGTGGCA
GGCTTGGTGG TACATGAGGG CGGCCGCTGG AGGATGGCGC GGGGACGGTA G
 
Protein sequence
MNTIETRAWA YLSRVAEPPC PELAALVTDV GPVEAAEKIR RAEVGERLRR KTEARREHGC 
AQADLDVLAR MGGRLVTAFD DEWPLLNFLK FTGVDTQKRP QGHPPLVLWA VGPVGMDEVA
ERAAAIVGTR AATAYGEHVA ADLAAGLAMR DVAVVSGGAF GIDGAAHRAT LAGEGVTVAV
VAGGIDVPYP AGHSALLARI RANGLVLSEY PPGSRPTRSR FLTRNRLVAA LSGATVVVEA
GARSGAANTA AWAGALGRNV CAVPGPVTSG ASVGCHRLLR DGGAILVTRA EEIIELVGRM
GELPPVEAGP ATPLDGLTDA ERRIYDALPK RGARSADEVA VAAGIPAYQV VGPLAMLEVA
GLVVHEGGRW RMARGR