Gene Mjls_4867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_4867 
Symbol 
ID4880566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5109910 
End bp5112204 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content73% 
IMG OID640142173 
Producthypothetical protein 
Protein accessionYP_001073123 
Protein GI126437432 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0900282 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCAC ACACTCCGCC CACGGGCAGG AGCGAAGCGA CTCGGGGGAC CACCGGTGTC 
CCTCTGGGCG CCTGGTTGGC CCAACTACCC GACGAGCGGC TGATCCGGCT GCTCGAGTTG
CGCCCTGACC TCACCCAGCC GCCGCCGGGC ACCATCGCCG CGCTGGCCGC ACGGGCGACG
TCACGACAGT CGGTCAAGGC CGCCACCGAC GGCCTCGATT TCCTGCGGCT GGCCGTGCTC
GATGCGCTGC TGGTGCTGCA CGCCGACACC ACCGCGGTGC CGTTGACCAA GTTGTTCGAG
CTGATCGGGG CGCGCGCGGA CGAGGGTGCG ATCGTCGTCG CGGTCGACGA TCTGCGCGCG
CGGGCGCTGG TGTGGGGTGA CGACGACCAG GTGCGTGTTG CGGCCGAGGC GGCGTCCGGG
CTGCCGTGGT ATCCGGGTCA GGCGGTCGTG GAGACGGCCG AGCACGGCGC CGACGACATC
GCGCAGAAGC TGGCCGGACT CGACACCGCA CAGCGTGAGC TGCTCGAGCG GCTCCTCGAG
GGGTCCCCGG TCGGCCGCAC GCGGGATGCG GCGCCCGGAA CGCCGGCCGA CCGTCCGGTG
CAGCGCCTGC TGGCGGCGGG GCTGCTGCGC CAGGTCGACG ACGACACCGT GATCCTGCCC
CGTCTCGTCG GCCAGGTGCT GCGCGGCGAG GCGCCCGGAC CGACGGAGTT GAACCCACCC
GATCCCGTCA CGACCTCGAC GAAACCGTCC GACGTCGACG CCGTCGCCGC CGGCGCGGCG
ATCGACGCGC TGCGTGAGGT CGATGTGGTG CTCGAGGCGC TCTCGGCGGC CCCGGTGCCG
GAACTGCGCA GCGGCGGCCT CGGCGTGCGC GACCTCAAAC GCCTCGTGAA AGCCACGGGG
ATCGACGAGC GTCGGCTGGG GCTGATCCTG GAGGTGGCGT TGGGGGCGGG CCTCATCGCG
GCCGGGATGC CCGAACCGGA TCCGGGCGAC GGGACCAGCA CGTTCTGGGC GCCGACGGTG
GCGGCCGATC GGTTCATCGA GTCACCGACC GCGGTGCGCT GGCACCTTCT GGCCTCGACG
TGGCTCGACC TGCCCGCCAG GCCGGGGCTC ACCGGCAGCC GGGGACCCGA CGGCAAACCG
TATGCGTCGC TGTCGGATGC GCTGTACTCG ACGGCGGCTC CGCTGGACCG CCGGCTGCTG
CTGGCGGTGC TGGCCGACCT GCCCGCAGGT TCGGGGGTCG ACGCGGCGTC GGCCTCTCGG
GCGATGATCT GGCGCAGGCC GCGCTGGGCG GTCCGGCTGC AGCCCGAACC GGTCGGCGGT
CTGCTCACCG AGGCGCACGC ACTCGGCATG GTGGGCCGCG GCGCGATCGC GACACCCACC
CGCAGGCTGC TCGCCGGTGA ACCGCCCGAG GACGTCGTGG CGGCCAAGGC CAAGGTGCTG
CCCGCCCCCA TCGACCATTT CCTGGTTCAG GCCGACCTGA CCGTCGTCGT CCCCGGCCCG
CTCGAACGCG ACCTCGCCGA GCAGCTGGCG GCCGTCGCGG CGGTGGAGTC CGCGGGCGCG
GCGATGGTGT ACCGGGTCAG TGAGGCGTCG ATCCGCCGTG CGCTCGACAC CGGCAAGACC
GCCAGCGAAT TGCACTCGTT CTTCGGGCGG CATTCGAAAA CCCCTGTGCC GCAGGGGTTG
ACGTATCTGA TCGACGACGT CGCGCGTCGT CACGGCCAGC TCCGGGTCGG TATGGCGGCG
TCGTTCGTGC GGTGCGAGGA TCCGGCGCTG CTGGCCCAGG CCGTCGCCGC ACCGGCCACC
GGCGCGGTGG AACTGCGGTT GTTGGCGCCG ACGGTGGCGG TGTCGCAGGC GCCGATCGCC
GACGTGCTCG CCGCGCTGCG CAACGCCGGG CTCGCCCCGG CGGCCGAGGA CTCGTCCGGC
GCGATCGTCG ACATCCGCTC CCGCGGTGCC CGGGTGCCGG CACCGGGCCG GCGACGGGTC
TTCCGCCCCG CGCCCACCCC GACCGGCCAG ACGCTCGGTG CGATCGTCGC GGTGCTGCGC
AAGGTCGCCG CCGCGCCGTC CGGGAACATG CGGCTCGATC CGGGCGTTGC GATAACGCAG
CTGCAGGAAG CGGCGCTACA GCAGACTTCG GTGGTGATCG GCTACGTGGA CCCGGCCGGG
GTGGCCACGC AGCGGGTGGT GGCCCCCGTC AACGTCCGCG GCGGCCAGTT GACCGCCTAC
GATCCGGCAT CCGGGCGCGT GCGCGAATTC GCGATTCACC GCGTTACCTC GGTGGTGTCG
GCCGAGAACG AATAA
 
Protein sequence
MTAHTPPTGR SEATRGTTGV PLGAWLAQLP DERLIRLLEL RPDLTQPPPG TIAALAARAT 
SRQSVKAATD GLDFLRLAVL DALLVLHADT TAVPLTKLFE LIGARADEGA IVVAVDDLRA
RALVWGDDDQ VRVAAEAASG LPWYPGQAVV ETAEHGADDI AQKLAGLDTA QRELLERLLE
GSPVGRTRDA APGTPADRPV QRLLAAGLLR QVDDDTVILP RLVGQVLRGE APGPTELNPP
DPVTTSTKPS DVDAVAAGAA IDALREVDVV LEALSAAPVP ELRSGGLGVR DLKRLVKATG
IDERRLGLIL EVALGAGLIA AGMPEPDPGD GTSTFWAPTV AADRFIESPT AVRWHLLAST
WLDLPARPGL TGSRGPDGKP YASLSDALYS TAAPLDRRLL LAVLADLPAG SGVDAASASR
AMIWRRPRWA VRLQPEPVGG LLTEAHALGM VGRGAIATPT RRLLAGEPPE DVVAAKAKVL
PAPIDHFLVQ ADLTVVVPGP LERDLAEQLA AVAAVESAGA AMVYRVSEAS IRRALDTGKT
ASELHSFFGR HSKTPVPQGL TYLIDDVARR HGQLRVGMAA SFVRCEDPAL LAQAVAAPAT
GAVELRLLAP TVAVSQAPIA DVLAALRNAG LAPAAEDSSG AIVDIRSRGA RVPAPGRRRV
FRPAPTPTGQ TLGAIVAVLR KVAAAPSGNM RLDPGVAITQ LQEAALQQTS VVIGYVDPAG
VATQRVVAPV NVRGGQLTAY DPASGRVREF AIHRVTSVVS AENE