Gene Mkms_4032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4032 
Symbol 
ID4611972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4251456 
End bp4252712 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content71% 
IMG OID639793716 
ProductH+ antiporter protein 
Protein accessionYP_940014 
Protein GI119870062 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00900] H+ Antiporter protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCCA GCAAGCGCGT CCCGCTGTAC CTCATCTACT TCTCGGCCTT GACCGCCGGA 
GCGGGTAACG GCATCTCGCT GGTCGCCTTC CCCTGGCTGG TGTTGCAGCG CAACGGCTCA
GCGGTCGACG CGTCGATCGT CGCGATGGCC GGAACGCTGC CACTGCTGGT CGCCACGCTG
ATCGCCGGCG CCGCGGTCGA CTACCTGGGC CGGCGCCGGG TGTCGATGAT CGCTGATGGT
TTGTCTGCCC TGTCCGTCGC GGCGGTGCCC GTGCTGGCGT GGACGTTCGG CGTGCAGGCG
GTCAATGTGG CGGTGCTGGC CGGACTCGCG GCCCTGGGTG CGTTCTTCGA TCCCGCCGGG
ATGACCGCCC GCGAGACGAT GCTGCCCGAG GCCGCGCACC GGGCCGGGTG GACGCTCGAC
CACGCCAACA GCGTGTACGA GGCCGTGTTC AACCTGGCCT ACATCGTCGG GCCGGGCATC
GGTGGTCTGC TGATCGCGAC GATGGGCGGC GTCAACACCA TGTGGGTCAC GGCCGCGGCG
TTCGTCCTGT CGATCGGCGC CATCGCCGCG CTGCAGCTCG AGGGTGCGGG ACGACCCGAC
CGCGACCAGC TGCCCGAGCG GGTGTGGTCG GGCATCGTGG AGGGTCTCGG ATTCGTCTGG
CGCACACCGG TGTTGCGCAC CCTGGCGGTG GTGGACCTGG TCGCCACCGG GCTGTACATG
CCGATGGAGA GCGTGCTGTT CCCGAAGTAC TTCACCGACC GCGACGAACC CGCGCAGCTG
GGCTGGGTCC TGATGGCGCT GTCGATCGGG GGGCTGGTGG GCGCCCTCGG CTATGCCGCG
CTGTCGCGAT ACGTCAACCG CCGCACGGTG ATGCTGGCCG CCGTGCTCAC CCTCGGCGTC
GCGATGACCG TCATCGCGTT CCTTCCGCCG CTGCCGGTGA TCCTGGTCCT CTCGGCGCTC
GTCGGACTGG TCTACGGGCC GATCGCGCCG ATCTACAACT ACGTCATGCA GACGCGCGCG
CCGGCCCACC TGCGCGGCCG CGTGGTCGGG GTGATGGGGT CGCTGGCGTA TGCAGCGGGC
CCGCTCGGCC TGGTGGTGGC GGGTCCACTG GCCGACGGCG CCGGCCTGTC CGTGACGTTC
CTCGCGCTCG CACTGCCGAT GCTCGCCCTC GGTGTGGCGG CCGTCTTCCT GCCCGCCCTG
CGCGAACTCG ACCGGGAACC GGCCCGACCC CTCAGAGGCG AACCCGGGGC GGGGTGA
 
Protein sequence
MIASKRVPLY LIYFSALTAG AGNGISLVAF PWLVLQRNGS AVDASIVAMA GTLPLLVATL 
IAGAAVDYLG RRRVSMIADG LSALSVAAVP VLAWTFGVQA VNVAVLAGLA ALGAFFDPAG
MTARETMLPE AAHRAGWTLD HANSVYEAVF NLAYIVGPGI GGLLIATMGG VNTMWVTAAA
FVLSIGAIAA LQLEGAGRPD RDQLPERVWS GIVEGLGFVW RTPVLRTLAV VDLVATGLYM
PMESVLFPKY FTDRDEPAQL GWVLMALSIG GLVGALGYAA LSRYVNRRTV MLAAVLTLGV
AMTVIAFLPP LPVILVLSAL VGLVYGPIAP IYNYVMQTRA PAHLRGRVVG VMGSLAYAAG
PLGLVVAGPL ADGAGLSVTF LALALPMLAL GVAAVFLPAL RELDREPARP LRGEPGAG