Gene Mmcs_2278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_2278 
Symbol 
ID4111111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp2416592 
End bp2418241 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content71% 
IMG OID638031403 
Productextracellular solute-binding protein 
Protein accessionYP_639442 
Protein GI108799245 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGCTC GGATCCGGCG GGCCGCCGCT GCGGCGGCGG CGGCGACGTT GACGGCCACC 
GCGCTGTCCT CCTGCGGCGG TGCCGCGGAA TCGGTGGACT ATGCCGTGGA CGGCGCACTG
CTCAGCTACA ACACCAACAC CGTGGCGGGT GCGGCGTCGG GCGGTCCGCA GGCGTTCGCC
CGCGTGCTGA CCGGGTTCAA CTACCACGGT CCGGAAGGTC AGATCGTCGG CGACCACGAC
TTCGGCACCA TCGCGGTGGT CGGCCGCGCA CCGCTCGTGC TCGACTACCA GATCAACGAC
AAGGCCGTCT ACTCCGACGG TAAACCGGTG ACCTGCGACG ACCTGGTCCT CGCCTGGGCC
TCCCAGTCCG GGCGCTTCCC GCAGTTCGAC GCCGCCAACC GGGGCGGTTA CAGCGACGTC
GCGTCGGTGG ACTGCGCCCC GGGCCAGAAA CGCGCGCGGG TGTCGTTCGC GCCGGACCGC
GGATTCGTCG ACTTCGGGCA GCTGTTCAGC GCGACCGCGC TGATGCCGTC CCACGTCCTC
GCCGACCAGC TCGGCGTCGA CGTCACGACC GCGCTGACGA AGGGCGACCA GCCGACCGTC
GCGCGCATCG CCGAGGCCTG GAACACCACC TGGGAGCTGA AGCCCGGACT GAACGACGAG
GACCTGAAGA AGTTCCCGTC GTCGGGCCCG TACAAGCTGG AATCGGTGAC CGACGAGGGC
GCCGTCACCC TGGTGGCCAA CGACAAGTGG TGGGGCGCCA AACCCGTGAC GAACCGGATC
ACGGTGTGGC CGCGCGGCGC CGACATCCAG GAGCGCGTGA ACGAGGGCGC CTACGACGTC
GTCGACATCG CCGCCGGATC CTCCGGCACC CTCAACATGC CCGAGGACTA CGTCCGCACC
GACAGCCCGT CCTCGGGTGT CGAACAGCTG ATCTTCGCCC CACGCGGCCC GCTCGCCGCT
CCACCGGCCC GGCGTGCGCT GGCCTTCTGC ACACCGCGCG ACGTCATCGC CCGCAACGCC
GATGTGCCGA TCGCCAACTC ACGGCTGAAC CCGGCCGACG AGGACGCCTT CGCCGCCGCG
GAGGCCACCG GCGAGGTGGG TCAGTTCACC GCCGCCAACC CGAACGCCGC CCGCGACGCG
CTGGGGAACC GTCCGCTGAC CGTGCGCATC GGCTACCAGA GCCCGAACGC CAGACTGGCC
GCGACCGTCG GCACGATCGC CAGGGCCTGC GAACCGGCGG GTATCCGGGT GGTCGACGCC
GCCGGCCCGA AGGTCGGCCC GCTTACGCTG CGCGCCAACG AGATCGACGT CCTGCTCGCC
AGCACCGGCG GCGCCCCGGG CAGCGGTTCG ACCGGTTCGT CGGCGTTGGA CGCCTACGCG
CTGCACACCG GCAACGGCAA CAACCTCAGC GGGTACTCCA ACGCGCGCGT CGACGGCATC
ATCGGTTCGC TGGCGGTGAG CAACGATCCC AAGGAGCTCG CCCGGCTGAT CGGTGAGGGC
GCGCCGATCC TGTGGGCCGA CATGCCGACC CTGCCGCTGT ACCGTCAGCA GCGCACTCTG
CTCACCTCGT CGAAGATGTA CGCGGTGATC GGCAACCCCA CCCGATGGGG CGCCGGCTGG
AACATGGACC GCTGGAGGCT GTCACGGTGA
 
Protein sequence
MPARIRRAAA AAAAATLTAT ALSSCGGAAE SVDYAVDGAL LSYNTNTVAG AASGGPQAFA 
RVLTGFNYHG PEGQIVGDHD FGTIAVVGRA PLVLDYQIND KAVYSDGKPV TCDDLVLAWA
SQSGRFPQFD AANRGGYSDV ASVDCAPGQK RARVSFAPDR GFVDFGQLFS ATALMPSHVL
ADQLGVDVTT ALTKGDQPTV ARIAEAWNTT WELKPGLNDE DLKKFPSSGP YKLESVTDEG
AVTLVANDKW WGAKPVTNRI TVWPRGADIQ ERVNEGAYDV VDIAAGSSGT LNMPEDYVRT
DSPSSGVEQL IFAPRGPLAA PPARRALAFC TPRDVIARNA DVPIANSRLN PADEDAFAAA
EATGEVGQFT AANPNAARDA LGNRPLTVRI GYQSPNARLA ATVGTIARAC EPAGIRVVDA
AGPKVGPLTL RANEIDVLLA STGGAPGSGS TGSSALDAYA LHTGNGNNLS GYSNARVDGI
IGSLAVSNDP KELARLIGEG APILWADMPT LPLYRQQRTL LTSSKMYAVI GNPTRWGAGW
NMDRWRLSR