Gene Mmcs_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_1987 
Symbol 
ID4110821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp2136739 
End bp2137869 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content73% 
IMG OID638031109 
ProductDNA processing protein DprA, putative 
Protein accessionYP_639152 
Protein GI108798955 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCA TCGAAACGCG CGCGTGGGCG TATCTGTCGC GGGTCGCCGA ACCGCCGTGT 
CCGGAACTGG CGGCCCTGGT CACCGACGTC GGGCCGGTCG AGGCGGCCGA GAAGATCCGG
CGCGCCGAGG TGGGTGAGCG GTTGAGGCGC AAGACCGAGG CCCGCCGCGA ACATGGTTGT
GCGCAAGCGG ATCTCGACGT GCTGGCCCGG ATGGGCGGGC GATTGGTCAC CGCCTTCGAC
GACGAATGGC CGCTGTTGAA CTTTCTGAAA TTCACCGGCG TGGATACGCA GAAGCGGCCG
CAGGGCCACC CGCCGCTCGT GCTGTGGGCG GTCGGCCCGG TCGGTATGGA CGAGGTGGCC
GAGCGGGCGG CGGCGATCGT CGGCACCCGG GCGGCCACTG CCTATGGCGA ACACGTGGCG
GCGGATCTGG CCGCCGGGCT GGCGATGCGC GACGTCGCGG TGGTGTCCGG TGGCGCGTTC
GGTATCGACG GCGCCGCGCA CCGGGCGACG CTCGCGGGCG AGGGGGTGAC GGTCGCCGTC
GTGGCGGGCG GTATCGACGT GCCCTACCCG GCGGGCCACT CCGCGCTGCT CGCCAGGATT
CGCGCGAACG GGCTGGTGCT CAGCGAATAT CCGCCGGGGT CCCGCCCGAC GCGCAGCCGC
TTCCTCACCC GCAACCGGCT CGTCGCGGCG TTGTCGGGTG CGACGGTGGT GGTCGAGGCG
GGGGCGCGTA GCGGTGCGGC GAACACCGCG GCGTGGGCCG GCGCTCTCGG TCGCAACGTG
TGCGCCGTGC CCGGACCGGT CACGTCGGGG GCGTCGGTGG GCTGTCACCG CCTGCTACGC
GACGGCGGCG CGATCCTGGT GACCCGTGCC GAGGAAATCA TCGAGCTGGT CGGACGGATG
GGAGAGCTCC CGCCGGTCGA GGCCGGGCCG GCCACACCGC TCGACGGGCT CACCGACGCC
GAGCGACGGA TCTACGACGC CCTGCCCAAA CGCGGCGCGC GCAGTGCGGA CGAGGTCGCG
GTCGCCGCGG GTATCCCCGC CTACCAGGTC GTGGGGCCAT TGGCGATGTT AGAGGTGGCA
GGCTTGGTGG TACATGAGGG CGGCCGCTGG AGGATGGCGC GGGGACGGTA G
 
Protein sequence
MNTIETRAWA YLSRVAEPPC PELAALVTDV GPVEAAEKIR RAEVGERLRR KTEARREHGC 
AQADLDVLAR MGGRLVTAFD DEWPLLNFLK FTGVDTQKRP QGHPPLVLWA VGPVGMDEVA
ERAAAIVGTR AATAYGEHVA ADLAAGLAMR DVAVVSGGAF GIDGAAHRAT LAGEGVTVAV
VAGGIDVPYP AGHSALLARI RANGLVLSEY PPGSRPTRSR FLTRNRLVAA LSGATVVVEA
GARSGAANTA AWAGALGRNV CAVPGPVTSG ASVGCHRLLR DGGAILVTRA EEIIELVGRM
GELPPVEAGP ATPLDGLTDA ERRIYDALPK RGARSADEVA VAAGIPAYQV VGPLAMLEVA
GLVVHEGGRW RMARGR