Gene Mjls_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_1101 
Symbol 
ID4876840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp1178444 
End bp1180015 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content69% 
IMG OID640138413 
Producthypothetical protein 
Protein accessionYP_001069398 
Protein GI126433707 
COG category[S] Function unknown 
COG ID[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0319349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCCGT CCAGTCGGGA CCTTTCGACG GGCCGTGAGG TCCGAAGACC GCTGCGGCAA 
ACCCGTCCGC GAAGCGACCA TTGGCCCATG ACCACCGCGG AGGAGACCGC TGCCGACGAT
TCGCCGGTGA ACGACATGGC CGCCGAGGTG TACGAGACAC ACACCGGTGT GGTGCTACTG
CTCGGCGAGA AGGCCTACAA GATCAAGAAG CCGGTGACCA CGGACTTCCT CGATTTCAGC
GCGCCGGAAC AGCGGGAGCG GGTGTGCGCC CGCGAAGTGG AGTTGAACAG CCGCCTCGCG
CCGGGCAGCT ACCTCGGCGT CGCCCACATG CACGGACCCG GTCACGACGT GCCCGAACCG
GTCGTGGTGA TGCGCCGCTA CCCCGACCGG TACCGGCTGC GGTCCATGGT GATACGCGGT
GAGTCGACCG AGAATCACCT CACGATGCTC GCGAGCATGC TCGCCCGCTT CCATGCGACG
GCGGACCGGC GCGCGGACAT CGATGCCTGC GCGACAGCCG CCGCGGTCAG GGCGCGGTGG
TGCGAGAACC TCGACGAGCT GGACCATTCC GCGGGTGCAG TGGTTTCGGC GCAGACGGTC
GACGAGGTCC GCCGACTGGC GTTGCGGTAT CTCGACGGCC GGGACGCGTT GTTCGCCGGC
CGCATCGCCG ACCGCCGCAT CGTCGACGGT CACGGCGACC TGCTGGCCGA CGACGTCTTC
TGCACACCCG ACGGCCCGGT TCCGTTGGAC TGCCTGGAGT TCGACGACCG GCTGCGCTTC
GTCGACGGCG TCGATGACGC CGCATTTCTG GCGATGGACC TGGAATTCCT GGGCCGGCGT
GACCTGGCCG ACCACTTCCT CGATCAGTAC CAGGAACTCG CCGGGGACTC CGCGCCGCGC
TCGCTGGTCG ACTTCTACAT CGCCTACCGC GCCGTCGTGC GCGCCAAGGT CGACTGCATC
AAGGTCGGCC AGGGACACGA GGACGCGGCG GCCGACGCAG GGTGGCACCT GGACATCGCC
GCCAACCACC TCAAAGCGGC CACGGTCCGG CTCGTCCTCG TCGGCGGGGG CCCAGGAACC
GGTAAGACCA CACTGTCGGG CGCCCTGGGT GAATCCGTTG GCGCCCATGT CATCTCGACC
GACAATGTCC GTCGGGAACT TCAGGACTCC GGAGTCGTCC ACGGCGCCGC CGGCGCTCTC
GAGAGCGGGC TGTACTCACC CGAGAACGTG GCACTGGTGT ACGACACCGT GCTGCATCGG
GCGGCGGTGC TGCTCGCTCA CGGCGAGTCG GTGGTCCTCG ACGGCACCTG GCGGGACCCG
GGTCACCGCA GGGCGGCGCG TGACTGCGCC GACCGTTCAT CAGCCGTTCT GGTGGAACTG
GCCTGCGATA CCGAACTCTC GGCGGCCCAG ACCCGGATCA CGCACCGGAC GTCGACCACC
TCCGACGCCA CCCCGCAGAT CGCGGCCGAC ATCACGACGC CCGTCTGGCA CGGCGCGCAC
CGTGTCGACA CCGGCCGCCC GTTGGCCGAC TCCGTGGCCG AAGCGCAGCA GATCTGTTGT
CTGGCCTACT GA
 
Protein sequence
MAPSSRDLST GREVRRPLRQ TRPRSDHWPM TTAEETAADD SPVNDMAAEV YETHTGVVLL 
LGEKAYKIKK PVTTDFLDFS APEQRERVCA REVELNSRLA PGSYLGVAHM HGPGHDVPEP
VVVMRRYPDR YRLRSMVIRG ESTENHLTML ASMLARFHAT ADRRADIDAC ATAAAVRARW
CENLDELDHS AGAVVSAQTV DEVRRLALRY LDGRDALFAG RIADRRIVDG HGDLLADDVF
CTPDGPVPLD CLEFDDRLRF VDGVDDAAFL AMDLEFLGRR DLADHFLDQY QELAGDSAPR
SLVDFYIAYR AVVRAKVDCI KVGQGHEDAA ADAGWHLDIA ANHLKAATVR LVLVGGGPGT
GKTTLSGALG ESVGAHVIST DNVRRELQDS GVVHGAAGAL ESGLYSPENV ALVYDTVLHR
AAVLLAHGES VVLDGTWRDP GHRRAARDCA DRSSAVLVEL ACDTELSAAQ TRITHRTSTT
SDATPQIAAD ITTPVWHGAH RVDTGRPLAD SVAEAQQICC LAY