Gene Mjls_1828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_1828 
Symbol 
ID4877550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp1930212 
End bp1931792 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content67% 
IMG OID640139125 
Productextracellular solute-binding protein 
Protein accessionYP_001070107 
Protein GI126434416 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCC GCCTGAAGAC CGTCGCCGCA CTCGCGGCGG CCGCCGCACT GACCCTGAGC 
GCCTGTGGCG GTTCGGATTC CGGTGGCGGC GCCCCCAGTG CCGCACCCAC CGACAAGGTG
CTGCACCTGT CGTTCCTGCA GGACCCGGGG CAGCCACCGG ACCCGGACGT CTTCTACGCC
GGACAGGGTC TACTGCTCAC CACCAACGTG TACGAAGGCC TGATGCAGTA CAAGGGTGGC
ACCGAGAAGG CGGAGATCGA ACCGCTGCTG GCCACGGAGT GGACCGAATC ACCGGACCAC
CGCGTGTTCA CCTTCAAGCT GCGGGAGGGG GTGACGTTCC ACGACGGGAC ACCGTTCACC
GCCGAGGCGG TCAAGGCGTC CTTCGACCGG AGGCTGGCCG TCGACCAGGG CCCCGCGTAC
ATGGTCGCCG ACGTGGAATC GATCACCACC CAGGGCGACC ATGCCGTCAC GATCACCCTC
AAGGCACCGA ACGCGGCGTT CCTGGACTAC CTCGCCTGCC CGTACGGTCC GCGCATGCTC
AGCCCGAAGG GGTTGGCCGA CAACGCCGGT GACGACCACG CCCAGAACTA CCTGACCACC
CACGATCTCG GCACCGGACC GTACACGCTG ACCGCGGCCG AAGTGGGATC GCGCTACGCA
CTGGCCGCCT ACCCCGGATA CTGGGGCGAG AAGCCGTATT TCGAGCAGGT GGAGATCCCG
GTCATCACCG ACGTGTCCGC CCAGCAGCTT CAGTTCAACA ACGGTCAGAT CGCCGCGATC
CTGCACGATC TGCCGTCGTC GGCGGTCGAG TCGTATCTCA ACAACGACAA GTACGCCCAC
TTCTCGCTGC CGACGATGAT GTCGAACTAC CTCTACCTCA ACCCGCGCCG CGGCATGCTC
ACCGACCCGA AGAACCGCGC CGCCGTGCTC GCCGCCATCG ACGTCGACGC GCTGGTCAAA
CAGACCTACT TCGGACGCGG CAAGAAGGCA GAACAGCTCT ACCCGCCGAA CATGATCGCC
CCGGAGTTGG CCAAGCAGAA CGTCACCCAC GACCCCTCGC TGCTCACCGA GATCGCGGCC
GGACTGCCCG CCGACCAGAA GGCCGTCACC ATCGGATACG ACTCCTCCAA CCCCGACAAC
CAGCTGATCA ACAACCTGAT CCAGACTCAG CTGGCCGCAG CCGGGCTCAA CGCCAAGGTG
CAGAGCTACC CGACCTCGGA GATCTACGGC TGGATCGGCA ACGACGCCCC CAACGCGCCG
GACATCCTGA CCGGTACGGC GTGGCCGGAT GCGCCGTCGC CCTACACCTG GGGTCATATC
TCCTGGGACG CCGACGGCGG GTTGAACTAC CTGGGCTGCT CGGCGCCCCC GGTGACCAGC
GCACTGGCTC GTGGTCTGGA AACCGGTGAC CCGCAGGTGT TCTCGGAGGC CGCCAAGGCC
GCCGCCGACA CCGGCTGCTG GCTCAACATC GCCGACGTCG ACGACTTCGT AGTCGCCCAG
CCGTGGCTCG CAGGGGTCGA GGAGGCGCAC GTGGTGACCA ACCCGAACTC GCTTCGGCTC
TTCGAACTCT CGGTCGCCTG A
 
Protein sequence
MIRRLKTVAA LAAAAALTLS ACGGSDSGGG APSAAPTDKV LHLSFLQDPG QPPDPDVFYA 
GQGLLLTTNV YEGLMQYKGG TEKAEIEPLL ATEWTESPDH RVFTFKLREG VTFHDGTPFT
AEAVKASFDR RLAVDQGPAY MVADVESITT QGDHAVTITL KAPNAAFLDY LACPYGPRML
SPKGLADNAG DDHAQNYLTT HDLGTGPYTL TAAEVGSRYA LAAYPGYWGE KPYFEQVEIP
VITDVSAQQL QFNNGQIAAI LHDLPSSAVE SYLNNDKYAH FSLPTMMSNY LYLNPRRGML
TDPKNRAAVL AAIDVDALVK QTYFGRGKKA EQLYPPNMIA PELAKQNVTH DPSLLTEIAA
GLPADQKAVT IGYDSSNPDN QLINNLIQTQ LAAAGLNAKV QSYPTSEIYG WIGNDAPNAP
DILTGTAWPD APSPYTWGHI SWDADGGLNY LGCSAPPVTS ALARGLETGD PQVFSEAAKA
AADTGCWLNI ADVDDFVVAQ PWLAGVEEAH VVTNPNSLRL FELSVA