Gene Mjls_5200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5200 
Symbol 
ID4880898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5449797 
End bp5450984 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content70% 
IMG OID640142511 
Productcolicin V production protein 
Protein accessionYP_001073455 
Protein GI126437764 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000777524 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACACCGT CTCAGTGGCT CGACTTCCTC GTCCTCGCCG TTGCCTTCGT CGCCGCCGTC 
TCGGGCTGGC GCTCGGGCGC GTTGGGGTCC CTCATGTCGT TCATCGGCGT GGTGCTCGGC
GCGGTCGCCG GCGTGCTGCT CGCCCCGCAC GTGGTCACCC ACATCAGCGG TCCCCGCACC
AAACTGTTCG CGGCGCTGTT CCTGATCCTC GCGCTGGTGG TGATCGGCGA GATCGCCGGC
GTGGTCCTGG GCCGGGCCGT GCGCGGTGCG ATCCGCAACC GCACGCTGCG CCTGTTCGAC
TCCGTGATCG GGGTGGGCCT GCAGATCGGG GCGGTGCTGC TCGCGTCCTG GCTGCTGGCG
ACCCCGCTGA CGTCCTCGGA CCAGCCGAGC CTGGCCGCGG CCGTCAAGGG CTCACGGGTG
CTGGCCGAGG TGGACGACGT CGCACCGCCG TGGCTGAAGT CGGTGCCCAC ACGGCTCTCC
GGTCTGCTCG ACACCTCGGG CCTACCCGAA GTCCTCGAAC CGTTCGGACG CACCCCGATC
GCGACGGTCG ACGCCCCGGA CGCGGCGCTG GCCACCGATG CCGTGGTCGG CGCGACACGT
GGCAGCGTGG TGAAGATCCG CGGTGTCGCA CCCGGCTGCC AGAAGGTGCT CGAGGGCACC
GGTTTCGTGG TGTCGCCGAA CCGGGTGATG TCCAATGCCC ACGTCGTCGC CGGGTCGGAG
AGCGTCACCG TGGAGGTCGA CGGTCAGACC TACGACGCTT TCGTGGTGTC CTACGACCCG
AACGCCGACA TCTCGATCCT CGACGTCCCG GACCTGCCCG CGGCGCCGCT GCCGTTCGTC
GACGAGTTGG CGCCCCCGGG GACCGACGCC ATCGTGATGG GCTATCCGGG CGGCGGCGAC
TTCACCGCCA CCCCGGCGCG GATCCGCGAG ACCATCGAGC TCAACGGGCC CGACATCTAT
CGCAAGACCA CGGTGACCCG CGAGGTCTAC ACCATCAGAG GGACTGTGCG TCAGGGCAAT
TCGGGTGGTC CGATGATAAA CCGCGGCGGC AAGGTGCTGG GTGTGGTGTT CGGCGCCGCG
GTCGACGACG CCGACACCGG GTTCGTGCTG ACCTCCGACG AGGTGGGGGC TCAGCTGGCC
AAGGTGGGTA ACACCGCACG GGTGCCCACC GGCGTCTGCG TGAGCTGA
 
Protein sequence
MTPSQWLDFL VLAVAFVAAV SGWRSGALGS LMSFIGVVLG AVAGVLLAPH VVTHISGPRT 
KLFAALFLIL ALVVIGEIAG VVLGRAVRGA IRNRTLRLFD SVIGVGLQIG AVLLASWLLA
TPLTSSDQPS LAAAVKGSRV LAEVDDVAPP WLKSVPTRLS GLLDTSGLPE VLEPFGRTPI
ATVDAPDAAL ATDAVVGATR GSVVKIRGVA PGCQKVLEGT GFVVSPNRVM SNAHVVAGSE
SVTVEVDGQT YDAFVVSYDP NADISILDVP DLPAAPLPFV DELAPPGTDA IVMGYPGGGD
FTATPARIRE TIELNGPDIY RKTTVTREVY TIRGTVRQGN SGGPMINRGG KVLGVVFGAA
VDDADTGFVL TSDEVGAQLA KVGNTARVPT GVCVS