Gene Mjls_3334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3334 
Symbol 
ID4879046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3494350 
End bp3496002 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content71% 
IMG OID640140635 
Productstage II sporulation E family protein 
Protein accessionYP_001071603 
Protein GI126435912 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.231097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCGG ACGGTCCCAT CGGCGTCGCC ATGGGTCTCG ACGCCGCGTG GTCGTCCATC 
CCCCACCCGG TGATCGTCGT GACCAGCGAC GGAGTGGTCC GCGCGGTCAG CACCTCGACG
CATTCGGTGC TGCCGTCGGC GATACCCGGC TCGGATCTGG ACGACGTCGC TCCCCCGTGG
TTGGCGCAGG CACACCGCCG GCTGGTCCAA CGACTCGGCA CACCCGACGC CGACGACGCT
GCCGCATCCG GCAGCCTGCA CGGCAAACGT TTCGAGGCCC GCCCGACCGT GCTCGACGGT
CAGATCGCCT GGTGGTTGGT CGAGGACGCC GGCCGCGAGC TGTGGGATAC GAAGCAGGCC
CTGACCCGGG AACAGGCGCG CACCTCGTTC CTCGGCGAGG CCTCGGCGGT GCTGATGGCC
ACCCTCAACG TCGACCGCTG CATGTCGGCG ACCGTCCACC TGGCGGTGCG GCACCTCGCC
GACGCGGCGT CGGTCGTCGC CCCGGTCACC GGCAACCGGT TGCCGGTGGT GTGTGGTGAC
CGCGGCGCGG TGGAGCAGCG CACGGTCGAG GCGGATCCCT CGGATGTGCC GGGGCTCAGT
GAGGCGCTGC GCGGCTTCCC ACCGGTGCCG TCACGGTGGC TCGATCCGGC CGCACTGCCC
GAGTGGCTGG TCCCGTCGAC GTTCGACGGT CCGGTGGGGT CGGTGCTGAT CACCCCGCTG
CCCGGCCTCG GCGTGCCGGC CGGGGCGCTG GTACTGCTGC GGCGGGCATC CGAACCGGTC
TTCGGAGAGG ACGACGAGCT GTCCGCGCGG CTGTTCGCCG CACGTGCCGG TGCGGCCCTG
TCCACGGCGG GGCTCTACGC CGAACAGTCG GCGATCACCC GCACACTGAT GCGCGACCTC
GTCCCGCCCC AGCTTCGCCG GCTGCACGGC TTCGAACTGG CCGGCGGGTA TCGCGCCTCG
GAGGACCATC AGATCGTCGG CGGCGACTTC TACGACGTCC ACCCCGGCGC CACCCCCGAG
GACGACACGT TGGTCGTACT CGGCGACGTA TGCGGCAAGG GTCTCGAGGC CGCGGTCCTG
ACCGGCAAGA TCCGCAACAC ACTCCAGGCG CTGGCGCCGC TGGCCCAGGA CCACGGCGGT
GTGCTCAGGT TGCTCAACAG CGCCCTGCTC TCGGCCGACC ACACACGCTT CGCCACCCTG
GTCCTGGCAT CGGTGGCGCG CCGCGACGGT CAGGTGGTGC TGCGATTGAC CAGCGCCGGG
CACTGTGCGC CGTTGATCGT GCGCAGCGAC GGGCGGGTCG AGGAGGCCGA CACCCGCGGT
CAACTGGTGG GTGTGCTGGA GCAGATCCAG GCCCGCACAT TCGAGACGGT GCTGGCGCCG
GGTGAGACGT GCGTCCTCTA CACCGATGGT GTGACCGAGG CGTGGGGCGG ACCGCTCGGT
ACCGACATGT TCGGTGAGCA GCGCCTCGCA GCCGCCCTCG AGGAGTGCGC GGGGATGCCC
GCCGAAGCCG TGGTCGAACG GATCATGATG CTCACGACGC AGTGGGTGCG TCGCCGCGAG
CACGACGACA TCGCCGTCGT CGCCATCACC GCCCCACGCC GGACGCACCT CAGCGCGGTC
GACGGCCACA CCGCCGGGAG GTACACCGCT TGA
 
Protein sequence
MGADGPIGVA MGLDAAWSSI PHPVIVVTSD GVVRAVSTST HSVLPSAIPG SDLDDVAPPW 
LAQAHRRLVQ RLGTPDADDA AASGSLHGKR FEARPTVLDG QIAWWLVEDA GRELWDTKQA
LTREQARTSF LGEASAVLMA TLNVDRCMSA TVHLAVRHLA DAASVVAPVT GNRLPVVCGD
RGAVEQRTVE ADPSDVPGLS EALRGFPPVP SRWLDPAALP EWLVPSTFDG PVGSVLITPL
PGLGVPAGAL VLLRRASEPV FGEDDELSAR LFAARAGAAL STAGLYAEQS AITRTLMRDL
VPPQLRRLHG FELAGGYRAS EDHQIVGGDF YDVHPGATPE DDTLVVLGDV CGKGLEAAVL
TGKIRNTLQA LAPLAQDHGG VLRLLNSALL SADHTRFATL VLASVARRDG QVVLRLTSAG
HCAPLIVRSD GRVEEADTRG QLVGVLEQIQ ARTFETVLAP GETCVLYTDG VTEAWGGPLG
TDMFGEQRLA AALEECAGMP AEAVVERIMM LTTQWVRRRE HDDIAVVAIT APRRTHLSAV
DGHTAGRYTA