Gene Mflv_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_0804 
Symbol 
ID4972132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp828560 
End bp829666 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content56% 
IMG OID640454999 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001132076 
Protein GI145221398 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.526121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.427419 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTGGC CGGAGGTCCC GATAGACAGC TTCTGCAGGC CTAAGCAGTG GCCGACCATA 
TCTCAAAGCC AGCTGACACC GACCGGATAC CCGGTTTACG GGGCCAACGG CCAAATAGGT
TGGTACTCCA CCTATAACCA TGAATCCGAG ACCGTGTTGA TCACCTGTCG CGGCGCAACT
TGCGGAACAG TAAACGTCAG CCCACCCAAG TCGTATGTAA CAGGCAACGC AATGGCGCTC
GACTCGCTCG ACGAGGCGCG TATCCATCTA CGATACTTGG TACACGTTTT GACACCCGAG
CGGCTACGAC GGTCAATCAC TGGCAGCGCT CAACCTCAGA TCACTCGCGA AAGCCTCAAA
GCTATAACAG TGCCACTCCC ACCCTTGGCC GATCAGCGCC GTATCGCCGC GATTCTCGAC
CAGGCTGATC GCCTGAGATC ACACCGCCAC GGACTGCTGC GACGTTACAG CGAACTGAAG
CGAGCGGGCT TTGCGTCAAT GTTTGCAGGG ATTTCAAGTT CGGGAAAACT CGGCGATTAC
GGAGAGGTTC AAGGCGGGCT CCAGGTTTCA CGGAAGAGGG AATCACTTCC CCTTGAGCGA
CCCTACTTGC GGGTGGCGAA CATCTATCGT GGGAAGCTCG ATCTCGGCGA GGTTAAAACG
ATTCGTGTGA CCGAGGCTGA ATCGATGCGG GTTAGGTTGG AGCCCGGTGA TCTATTGTTC
GTTGAAGGTC ACGCGAATCC GAACGAAGTC GGCCGAGTTG CTGAATGGAA TGGCTCGGTG
CCAGATTGTC TACACCAGAA TCACCTCATC CGCGTTCGGC TGGACAGGTC GGCGGTAGAG
CCGACATATG CGGAGGCGTG GTTCAACTCG CGCGATGGAT CGATGCATTT TCAGCGGGCA
GGAAAAACCA CCTCTGGACT GAACACCATC AATGCTTCGC AATTGCGGGC AGCGCCGTTA
CCGGTGCCAC CAATCAGTTT GCAGCGAGAG TACGTCACCG TGGCGAATGC GATCGACAAC
CACCTGCGTG ATCAAACCAT GCAAAGTGAG TTAGTCGACG AGCTATTCGT CTCCCTCCAA
TCTCGCGCGT TCTCCGGGCA GTTGTGA
 
Protein sequence
MKWPEVPIDS FCRPKQWPTI SQSQLTPTGY PVYGANGQIG WYSTYNHESE TVLITCRGAT 
CGTVNVSPPK SYVTGNAMAL DSLDEARIHL RYLVHVLTPE RLRRSITGSA QPQITRESLK
AITVPLPPLA DQRRIAAILD QADRLRSHRH GLLRRYSELK RAGFASMFAG ISSSGKLGDY
GEVQGGLQVS RKRESLPLER PYLRVANIYR GKLDLGEVKT IRVTEAESMR VRLEPGDLLF
VEGHANPNEV GRVAEWNGSV PDCLHQNHLI RVRLDRSAVE PTYAEAWFNS RDGSMHFQRA
GKTTSGLNTI NASQLRAAPL PVPPISLQRE YVTVANAIDN HLRDQTMQSE LVDELFVSLQ
SRAFSGQL