Gene Mkms_4639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4639 
Symbol 
ID4612587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4866157 
End bp4867455 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content64% 
IMG OID639794330 
Productsodium:dicarboxylate symporter 
Protein accessionYP_940620 
Protein GI119870668 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTCT TGGCCCACCC CGCAGTCCAG ATCGGCATCG CCGCAATCGC GGGACTGGCG 
TTCGGGCTGA GCGTCGGCGA GTGGGCGGCG AACCTGAAGT TCATCGGGGA CATGTTCATC
CGACTCATCC AGATGTCGAT CGTGCCGCTG GTGATGGCGT CGGTGATCGT CGCGACCGGC
TCGATGAACG GCGCAGGAAC AGGCCGGATC GCACTGCGCA CGTTCAAGTG GATGCTCGGT
TTCTCCGCCG TCGCAGCGGT CCTGGCCTGG CTGCTCGGCG AACTGATCCG GCCCGGCGCC
GGCATGGTCT TCGACGGGGA ACTGGACTCC GAGCTCGCGG GGTCGGCCGA CGAGGCGCTG
GGCTGGCAGG AGACGCTACT CAACTTCGTC TCGACGAACA TCTTCGACGC GATGTCGACC
GCGTCGATGG TGCCGATCAT CGTGTTCTCG CTGCTGTTCG GACTCGCCCT GCGCACCCAG
ATCAACAAGA CCGGCGACAC CAGGGTCCTC ACCCTCATCG ATCAGATTCA GCAGGTCGTG
CTCACCATGA TCCGGCTGGT CATGTACATC GCGCCGATCG GCGTGTTCTG TCTGCTGGCG
GCGCTGGCCG GCGACGTCGG CTTCGCGGTC GTGACGTCGG CGCTGAAGTA CCTGGGCGCG
ACCCTGCTCG GCGTGCTCAT CCTCTTCGCC TTGTTCGTCG TGGTCGTCAC CCTGCGCACG
CGGCTGAACC CGGCGAAGCT GCCCGGCAAG CTCGCCGAGC AGACCGCGAT CGCGATCACC
ACGACGAGTT CGGCGGTGAC CTTCCCGACG GTGCTGAAGA ACACCGTCGA AAAATTCGGG
GTCAGCCAGA AGATCGCGAA CTTCACGCTG TCGATCGGGT TGACGATGGG GTCGTACGGA
GCCGTGCTCA ACTACGTGAT CGTCGTCCTG TTCCTCGCCC AGGCGGGCGG CGTCGACTTG
AGTTTCGGTC AGATCGCGTT CGGCATGGGT CTCGCGATCC TGCTCAACAT GGGAACGATC
ACTGTTCCGG GCGGCTTCCC GGTTGTCGCC ATGTTCCTCG CGACGTCACT TGATCTGCCG
TTCGAGGCCG TCGGGTTGCT GATCGCCGTC GACTGGTTCG CCGGCATCTT CCGCACGTTC
CTCAACGTGA ACGGCGACAC CTTCGTCGCG ATGCTGGTCG CCAACGCCGA TGACGAAATC
GATCGCGACG TCTACAACGG GACGAAGACG ATGATCGCCG ATGAGATCGA TCTCGAAGAG
CTCGAAGGCG CGATGGCCCG TGCCGACGAT GCCGACTGA
 
Protein sequence
MKVLAHPAVQ IGIAAIAGLA FGLSVGEWAA NLKFIGDMFI RLIQMSIVPL VMASVIVATG 
SMNGAGTGRI ALRTFKWMLG FSAVAAVLAW LLGELIRPGA GMVFDGELDS ELAGSADEAL
GWQETLLNFV STNIFDAMST ASMVPIIVFS LLFGLALRTQ INKTGDTRVL TLIDQIQQVV
LTMIRLVMYI APIGVFCLLA ALAGDVGFAV VTSALKYLGA TLLGVLILFA LFVVVVTLRT
RLNPAKLPGK LAEQTAIAIT TTSSAVTFPT VLKNTVEKFG VSQKIANFTL SIGLTMGSYG
AVLNYVIVVL FLAQAGGVDL SFGQIAFGMG LAILLNMGTI TVPGGFPVVA MFLATSLDLP
FEAVGLLIAV DWFAGIFRTF LNVNGDTFVA MLVANADDEI DRDVYNGTKT MIADEIDLEE
LEGAMARADD AD