Gene Mjls_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_2094 
Symbol 
ID4877815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp2200368 
End bp2202611 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content67% 
IMG OID640139392 
Producthemerythrin HHE cation binding domain-containing protein 
Protein accessionYP_001070372 
Protein GI126434681 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.105486 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCA TAGCCGACAC CCAAGCCACC TTGGACTGGC CCATCATCCG TGAGCAGGCG 
GCCGCGTTCG TGACCACGGA GTACGCGTCG CTGGACCGCC GCGGCGCCCC GATCACCTGG
CCGGTCACGC CGTATCTGGG CGCCGATGGG CGCACCATCG ATGTGGCTAC CGGCCTCACC
TACCCCCTCA AGGCCGAGCG GGCTCGGCGC AATCCGAAAG TGACGCTGTC GTTTTCGCAG
CCGCTCGGCT CGGGGCTCGC CGACCCCGCG ACGTTCGTGA TCCATGGGCT TGCCACGGTC
CGCGACGCCG ACCTGCGGGC CAACTCGGCC CGCTACCTCG CCGAAGTCGC AACGCGTTTA
CCCGAAGCCT TCGACAGGAT CCCCGCTGTG GTGCTGCGTC GGATGGCTTG GTACTGGGCC
CGCATCTGGA TCGAAGTCAC CCCTGTGCGG GTGCTGTGGT GGCCGGGCGG GAATCTCGAT
CACCGCCCGC AGCTCTGGGA GCCAGAGATC CCGCCCACCG CGCCGCCGTC TGATCCTGCG
CCGGTCGGAC CCGGTGCCGG GTCTTGGAAC ACGCGCGCAC CGGAGGACTG GCGGGTGCGG
GTGCGCGGTG CGCTCGACCG GCTAGGCATG CCGGTCCTGA CCAGCGTGAC ACCCGACGGC
TGGCCGATAC CCGTCCGCGT GCGCCACGCC GAGCAGATCC CCGGCGGCTT CCGGCTGCGT
CCACCCGTCG GCTGCGAGAT CGTCGACGGG GCGGCCTGCC TGACGTTCCA CACACATGGG
CCGGCGTTTG AAAGCCAAGA AAACATCAGC GTGACGGGGC AATGCCGCAA TGTCGGTGAA
TACGTCGAGT TCACGGCTGA GCGAGCGCTC AACGACTTCG TCCTTTCTGC CAATCCTGTG
CGTCGGGCCG CGTACCTGAT GTCTGCGGGT CGGCGGCTGC GGCTGCGGCT GGACTCCGAG
GCGCAGCGAC GTGGGCAGCG GGTGCCGCGA TTCGACGAAC TCGGCTTCAA TAAGACCAAG
CGCCAGAAGG ACCGTGCTGT GACACCCGAC GCTCAGCCCG CCGACACCCG GATGATGGGC
ATTGTCCACA ACGCGTTGCG CCGCGACATC GCTCGCGCGC AATCCGCGCT GACGCGGTGG
CCTTATCCCG ATCCCAGCCA GCGCGCCGCG ATCGCGAAGC ACCTGGCGTG GATGATGGAG
TTCCTTCATC GCCACCATCA CATCGAGGAT GATGGCCTGT ATCCCCTGGT GCGGGAACGC
GTCCCTGGGG CAGCTCAGAT TCTGGACGCG ATGGAGGCCG ACCACCACGC ATTGATACCG
GCGATCGACC GGCTCACCGA AACCGCCGGT CGCTACATCC AAAATCCCTC TGCGCGAACC
GAAGTGGCCA CCGCGCTTGA CGAGCTTGCG GCGGTGATGC TGCCGCATCT GCAGCGCGAG
GAGACCGAGA TGATGCCGGT AGTGTCGGCG GCGGTGACAA GGGCGGAATG GGAGGCTATC
GAGCAGGCCT CTGCCGTCAA GCCGCTCAAG CCTGCCGAGT TGGCCTTCAC CGCGCTGTGG
TTGTTCGATG ACGCCAGCGA GGAGGACCGT GAGGTGGTCC GATCCCTGGT GCCCAAGCCC
GTTGCGTGGG CGATCGAGAC CTTCACCACC CGTAGGTATG AGCGCTGCGT ATGGCGCTGT
TGGTACCTGC CGCAGCACAC CCGACTGCAC CGGAAATTCA ATGGGCAGAT CAGCGTGGAG
ATCGCCGCCC CGATCGAAGC AGTATGGAAA CAGGTCGCCG ATCCGGTGCG TGTTCCGCGG
TGGAGCCACG AGTGTCGCCG GGTGCGATTC CTGGACGGTA CGACGTCGGC GGGGTTGGGC
CGGCGATTCC GCGGGACCAA CCGCAGCGGC CGCTATCGAT GGTCGCGCAA CTGCACGATC
TTCACCTACG ACGAGCCTCT CGAGTTCGGT TACGTCACCT CCGGCGGTCT CGGTGACGCA
ACGGCCTGGC ACTTTCGGCT TGAGCCCACC GCCACCGGCA CCCGGCTCAC GCAGGCGTTC
CAGGGTGTGT CGATGCCGCT GTGGTTATCA CGATTGGTCT CGGTGCTGAT ACCCACTCAC
GACGACCGCA CCGATGCACT ACGCGGTGAC ATGGCGCGAC TGGCCGCGCT CGCCGCCGCG
CAGCACCCAC GCGCCGACGC GCCGGCGCCG GGCACCCCAG GTGATCGGAA TCGACGATCT
TTCAACGCCG CGTTGGAAAT TTGA
 
Protein sequence
MAAIADTQAT LDWPIIREQA AAFVTTEYAS LDRRGAPITW PVTPYLGADG RTIDVATGLT 
YPLKAERARR NPKVTLSFSQ PLGSGLADPA TFVIHGLATV RDADLRANSA RYLAEVATRL
PEAFDRIPAV VLRRMAWYWA RIWIEVTPVR VLWWPGGNLD HRPQLWEPEI PPTAPPSDPA
PVGPGAGSWN TRAPEDWRVR VRGALDRLGM PVLTSVTPDG WPIPVRVRHA EQIPGGFRLR
PPVGCEIVDG AACLTFHTHG PAFESQENIS VTGQCRNVGE YVEFTAERAL NDFVLSANPV
RRAAYLMSAG RRLRLRLDSE AQRRGQRVPR FDELGFNKTK RQKDRAVTPD AQPADTRMMG
IVHNALRRDI ARAQSALTRW PYPDPSQRAA IAKHLAWMME FLHRHHHIED DGLYPLVRER
VPGAAQILDA MEADHHALIP AIDRLTETAG RYIQNPSART EVATALDELA AVMLPHLQRE
ETEMMPVVSA AVTRAEWEAI EQASAVKPLK PAELAFTALW LFDDASEEDR EVVRSLVPKP
VAWAIETFTT RRYERCVWRC WYLPQHTRLH RKFNGQISVE IAAPIEAVWK QVADPVRVPR
WSHECRRVRF LDGTTSAGLG RRFRGTNRSG RYRWSRNCTI FTYDEPLEFG YVTSGGLGDA
TAWHFRLEPT ATGTRLTQAF QGVSMPLWLS RLVSVLIPTH DDRTDALRGD MARLAALAAA
QHPRADAPAP GTPGDRNRRS FNAALEI