Gene Mkms_2338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2338 
Symbol 
ID4613340 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2450201 
End bp2451466 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content65% 
IMG OID639792007 
Productextracellular ligand-binding receptor 
Protein accessionYP_938326 
Protein GI119868374 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.267805 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGAC CAGGACGCTC CTCGTTCCAC AGATCGGCTC TTGCTGCGGG AAGTCTGGTC 
GCCGTGACCA GCATGTTGCT GGCGGGCTGC GGCAGCAAGG CCAGCGACAC CGACGCCGCC
AGCGCCGAGT CCTGTGTCGA CACCTCCGGC CCGAACATCA AGGTGGGCTC GCTGAACTCG
CTGTCCGGCA CGATGGCGAT CTCGGAGGTC ACCGTCCGCG ACGCCATCAA GCTGGCGGTC
GACGAGATCA ACGGCGCCGG CGGTGTCCTG GGCAAGCAGA TCCAGCTGGT CGGCGAGGAC
GGGGCGTCGG AGCCCACGGT CTTCGCCGAG AAGGCGGAGA AGCTCATCAG CAGCGACTGC
GTCGCCGCGG TCTTCGGTGG ATGGACCTCG TCGAGCCGCA AGGCCATGCT GCCGGTCTTC
GAGAGCGCGA ACTCGCTGCT CTACTACCCC GTCCAGTACG AGGGCCTGGA GTCCTCCCCC
AACATCTTCT ACACCGGCGC CACCACCAAC CAGCAGATCG TGCCCGCCCT CGACTACCTC
AAGGAGAAGG GCGTCAAGTC CCTCTACCTG GTCGGCAGCG ACTACGTCTT CCCGCAGACC
GCCAACCGCA TCATCAAGGC CTACGCCGAG GCCAACGGCA TCGAGATCAA GGGTGAGGAC
TACACCCCGC TGGGTTCGAC CGACTTCTCC ACGATCATCA ACAAGGTGCG TTCGGCCGAT
GCCGATGCGG TGTTCAACAC CCTCAACGGC GACTCCAACG TCGCGTTCTT CCGCGAGTAC
CGCAACGTCG GCCTGACCCC GCAGGACATG CCGGTGGTCT CGGTCTCCAT CGCCGAGGAG
GAGGTCGGCG GTATCGGTGT CCAGAACATC GACGGCCAGC TGACCGCGTG GGACTACTAC
CAGACCATCG ACACCCCGGT GAACAACGAG TTCGTCAAGG CCTACAAGGC CAAGTTCGGC
GCCGACAAGC CGACCTCGGA CCCGATGGAA GCCGCCTACG TGTCGGTGTA CCTGTGGAAG
AACACCGTCG AGAAGGCACA GTCGTTCGAC GTCAAGGCGA TCCAGGACAA CGCGGGCGGG
GTCACGTTCG ACGCACCCGA GGGCAAGGTC ACGATCGACG GCGAGAACCA CCACATCACC
AAGACCGCCC GCATCGGTGA GATCCGCCCG GACGGCCTGA TCTACACGAT CTGGGAATCC
CCCGGGCCGA TCGAGCCGGA CCCGTACCTG AAGTCCTACC CGTGGGCCGC CGGCCTCTCC
AGCTGA
 
Protein sequence
MRRPGRSSFH RSALAAGSLV AVTSMLLAGC GSKASDTDAA SAESCVDTSG PNIKVGSLNS 
LSGTMAISEV TVRDAIKLAV DEINGAGGVL GKQIQLVGED GASEPTVFAE KAEKLISSDC
VAAVFGGWTS SSRKAMLPVF ESANSLLYYP VQYEGLESSP NIFYTGATTN QQIVPALDYL
KEKGVKSLYL VGSDYVFPQT ANRIIKAYAE ANGIEIKGED YTPLGSTDFS TIINKVRSAD
ADAVFNTLNG DSNVAFFREY RNVGLTPQDM PVVSVSIAEE EVGGIGVQNI DGQLTAWDYY
QTIDTPVNNE FVKAYKAKFG ADKPTSDPME AAYVSVYLWK NTVEKAQSFD VKAIQDNAGG
VTFDAPEGKV TIDGENHHIT KTARIGEIRP DGLIYTIWES PGPIEPDPYL KSYPWAAGLS
S