Gene Mjls_3879 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3879 
Symbol 
ID4879589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp4105191 
End bp4107182 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content69% 
IMG OID640141191 
Producttranscription termination factor Rho 
Protein accessionYP_001072146 
Protein GI126436455 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.495548 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGATA CGGACCTCTT CACGGCCGAC AGCGCTGAGC GAACAGAGTT GCCGAACGTC 
GTGAACACCG AAACTTCCAC GGCGTCGGAG GGCCCGGCCG TCACCACGAC GACCGCGGAG
TCCGCCCCGA CCGCCGACGT GGCGTCGGGT GACCGTGCCG CCTCGCTGAC CTCGATGGTC
CTGCCGGAAC TGCGCGCGCT GGCGCGCCAG ATCGGCGTCG AAGGCGCCTC CGGCATGCGC
AAGAGCGAAC TGATCGCCGC GATCCGCGAA CGCCGCGGCG ACGGCGGTGC CGCCAAGGCG
GAGAGCGCCG CCGCACCGCA GACGCCCCCG CGCCGTGAGC GTCGCAGCGC ATCGCGTCAG
GCCGGCGCCG CCGAGGCCAA GGCTCCAGAG GCCCTCGAGG CCAAGGCCCC CGAGGTGAAG
GCCCCCGAGG TGAAGGCCCC CGAGGTGAAG GCTGATGCAC CGGCCGAAGA GGCCGCTGCG
GAGCCCAAGA CCGAGCGGCC CAGGAACGAC CGCACCAAGG CCGAGAAGCC CGCGGCGCAG
CAGCCCGACG GCGAGCAGCC CAAGGGAGAC CAGCCGAAGG GCGAGCAGTC CAGGGGCGAT
CAGCGCCGGT CCGATCGTCC GCGGGGCGAT CAGTCCGACA CCAGGTCCGA CGACCGGTCC
GACAGTGACC AGCAGCAGGG CCAGGGCAAC CGAAACAACA GCAACAACAG CGCCGATGGC
GACGACGACG GCGACGGCCG GGGTGGCCGC CGCGGCCGCC GGTTCCGCGA CCGCCGGCGC
CGTGGTGAGC GCGGTGGCGA GGGCGGCGGC GGTGGCGATA CCGAGATCCG CGAGGACGAC
GTCGTCCAGC CCGTCGCCGG CATCCTCGAC GTCCTCGACA ACTACGCGTT CGTCCGCACC
TCGGGCTACC TGCCCGGTCC GAACGACGTC TACGTGTCGA TGAACATGGT GCGCAAGAAC
GGGCTGCGCC GCGGCGACGC CGTCACCGGT GCGGTCAAGG TGCCCAAGGA GGGCGAAGGC
GGCGGTCAGA ACCAGCGCCA GAAGTTCAAT CCGCTGGTGC GTCTCGACAG CGTCAACGGC
GGGCCGGTCG AGGATGCGAG GAAGCGTCCC GAATTCGGCA AGCTGACCCC GCTGTACCCC
AATCAGCGGC TGCGTCTGGA GACCTCTCCG GACAAGCTGA CCACCCGCGT CATCGACCTG
ATCATGCCGA TCGGCAAGGG GCAGCGCGCG CTGATCGTGT CGCCGCCCAA GGCCGGTAAG
ACCACGATCA TGCAGGACAT CGCCAACGCG ATCACCCGCA ACAACCCGGA ATGCCACCTG
ATGGTGGTGC TCGTCGACGA GCGTCCGGAA GAGGTCACCG ACATGCAGCG CTCGGTCAAG
GGTGAGGTCA TCGCCTCGAC CTTCGACCGG CCGCCGTCCG ACCACACCAC GGTCGCCGAA
CTGGCCATCG AGCGCGCCAA GCGCCTGGTG GAGCAGGGCA AGGACGTCGT GGTGCTCCTC
GACTCGATCA CCCGCCTCGG CCGGGCCTAC AACAACGCCT CGCCGGCCTC GGGCCGCATC
CTGTCCGGTG GTGTGGACTC GACCGCGTTG TACCCGCCGA AGCGCTTCCT GGGTGCCGCA
CGCAACATCG AAGAGGGCGG CTCACTGACC ATCGTCGCGA CCGCGATGGT GGAGACCGGT
TCGACCGGTG ACACGGTCAT CTTCGAGGAG TTCAAGGGCA CCGGTAACGC CGAGCTCAAG
CTCGATCGCA AGATCGCCGA ACGCCGGGTG TTCCCCGCGG TCGACGTGAA CCCGTCGGGC
ACCCGTAAGG ACGAGCTGCT GCTCGGCCCC GACGAGTTCG CGATCGTGCA CAAGCTGCGC
CGGGTGCTGT CGGGTCTCGA CAGCCATCAG GCCATCGACC TGCTGATGAG TCAGCTGCGC
AAGACCAAGA CGAACTACGA GTTCCTGGTG CAGGTCTCCA AGAACACACC GGGGTCGGTG
GACAACGACT GA
 
Protein sequence
MTDTDLFTAD SAERTELPNV VNTETSTASE GPAVTTTTAE SAPTADVASG DRAASLTSMV 
LPELRALARQ IGVEGASGMR KSELIAAIRE RRGDGGAAKA ESAAAPQTPP RRERRSASRQ
AGAAEAKAPE ALEAKAPEVK APEVKAPEVK ADAPAEEAAA EPKTERPRND RTKAEKPAAQ
QPDGEQPKGD QPKGEQSRGD QRRSDRPRGD QSDTRSDDRS DSDQQQGQGN RNNSNNSADG
DDDGDGRGGR RGRRFRDRRR RGERGGEGGG GGDTEIREDD VVQPVAGILD VLDNYAFVRT
SGYLPGPNDV YVSMNMVRKN GLRRGDAVTG AVKVPKEGEG GGQNQRQKFN PLVRLDSVNG
GPVEDARKRP EFGKLTPLYP NQRLRLETSP DKLTTRVIDL IMPIGKGQRA LIVSPPKAGK
TTIMQDIANA ITRNNPECHL MVVLVDERPE EVTDMQRSVK GEVIASTFDR PPSDHTTVAE
LAIERAKRLV EQGKDVVVLL DSITRLGRAY NNASPASGRI LSGGVDSTAL YPPKRFLGAA
RNIEEGGSLT IVATAMVETG STGDTVIFEE FKGTGNAELK LDRKIAERRV FPAVDVNPSG
TRKDELLLGP DEFAIVHKLR RVLSGLDSHQ AIDLLMSQLR KTKTNYEFLV QVSKNTPGSV
DND