Gene M446_0540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0540 
Symbol 
ID6135238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp641718 
End bp644888 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content56% 
IMG OID641640861 
Productpeptidase C14 caspase catalytic subunit p20 
Protein accessionYP_001767536 
Protein GI170738881 
COG category[R] General function prediction only 
COG ID[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0423513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0609988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAAG CAACTGCCTT GGTGCTTGCC TTAGGGCTCC TCGGCCAGGC GAGTTCGGGG 
GCCCGGGCGT TGGACGCGAT TGGCCACAAT GCTCGCGGCT TCGAACTGCA AAATCGAGGC
GAGCATGAGA AAGCAATCGC CGAATTCAAT CTTGCCCTGC GGCTCAATCC CAAACTGGTG
TCAGCCTACA TCAATCGCGG TTTCGCCTTT CGAAACAAGG GTGACTACGA TCGCGCGATC
GCCGATTATG ATCATGCCTT GCAGATCGAT CCAAATTCAG TAGTCGCCTT TAACAATCGC
GGTGATGCAT TTTATCATAA GGGAGAATAC GATCGAGCGA TTGCAGACTA TAATCGCTCA
ATCAAGCTGA GCTCCGACAA GGCAGCCGTA TATAATAATC GTGGGCTCGC ATTCTTCAGC
AAGGAGGAAT ACGATCGGGC CATCGCGGAT TACAATCAAG CTTTGCGGCT CGACCCTAAA
TACCTGAGCG CCGCGCTCAA CCGCGGGGAT GCGTTTCGAA GCAAGGGCGA GTATGATCGT
GCGATCGCGG ACTATAATCA AGTCCTGCAG ATCGATCCCA GGTCCGTGGT ATCTTACAAC
AATCGCGGGC TCGCGTTCCA GGGCAAAGGC GAATACGACC GAGCTGTTGC CGATTACAAC
CAAGCGCTCA CTCTCGATCC GGGCTACACG ATCGCTCTCA TCAACCGCGG AGATGTGTTC
AGGATCAAAG GTCAGTACGA TTCTGCGATT GAGAATTATA ACCAAGCCTT GCAGTTAAAT
CCAAAATCAA AGATTGCCTA CAACAATCGA GGCTTCGTGT TCTACAACAA AGGAGAGTAC
GACCGCGCGA TTGCGGACTA TAATTCAGCG CTGCAGATTG ACCCCAGATA CGTGGTCGCG
CTCGTCAATC GTGGCGATGC CTTTGTGAGC AAGGGTGACT ACGATCGGGC GATCGGAGAC
TATGGCCATG CTCTCCAGAT CAATCCGAAT TACGCCTTTG CCTATAATGG GCGTGGAGTC
GCCCTTCAGA ACAAAGGTGA GTATGACCGC GCCATCATGG ATTACGATCA AGCCTTGCGA
CTCGATCCGA AATACGTTTT CGCCTTTGCG AACCGCGGGG ATGCCTTCCG GAGCAAAGGC
GAGCACGACG TCGCGATCGC CGATTACAAT CAGGCCCTTC GCTTAAGCCC AAACTACGCC
AAAGCCTATA ATGGTCGAGG TCTGTCCTTC CAGAATAAGG CTCAATACAA CCGGGCAATC
GAGGATTATG AACAAGTCAT TCGTCTCGAC CCAAGGTTCG TGGCCGCCTA CAACAATCGC
GGCTTTGCCC TCGTCAGCAA AGGTGAGCCG ACCCTTGCGA TCGCGGATTA CGACAAAGCC
TTGTTGCTCG ATCCGAAATC TGCGACGGTT TACGCGAACC GTGGGCGTGC CTTTCAGGAC
AAGGGTGAGT ACGATCGCGC AATCGCGGAC TATGATCAGG CCTTGCGCCT CAATCCGAAG
GACGCCATCG CCTTGAACAA TCGCGCGGAT ATTTTGCGCC TTCGGCACGA GCATGATCGG
GCAATTGCAA GCTATGATCA AGCCCTGCAG CTCAACCCGA AATACGTGGG TGCTTATAAT
AGCCGTGGGT TAGCGTTTCA GGACAAAGGT GAATACGACC GAGCTATTGC AAATTACGAT
CAAGCCTTAC AGCTGAATCC TAGGTATATT ACTGCTTACA TCAATCGCGG AGACGCATAC
CGCCGCAAAG GCGAGCACGC GCGAGCGATC TCCGACTATA ATCAAGCGCT TCAGATCGAC
CAAAATTCTG TGATTGCCTA CAATAACCGC GGATTATGTT TTCATGAACA AGGAGAGTAC
GACCGGGCAA TCATCGACTA CGACCGCGCC CTGCAAATCG ATCCGATGTA TTCAACCGGG
TTCATCAATC GTGGATTCGC CTTTCATAAG AAGGGAGAAT ATGATCGAGC AATCGCCGAT
TATGATCGCG CCTTGCAAAT TGATCCCAGG TCCGCAACAG CTTACAACAA TCGCGGCTTC
ACTTTCCAAA ACAGAGGTGA GTACGATCTG GCAATCGTCG ACTACGATAA GGCCATTCTG
ATTAAGCCCG ACTTGGCGAA TTCTTATTAT CACCGCGGGA CAGTGCTGCG GCTCAAAGGA
GACCTTGAAC GCAGTGTCGC GGATTTAACT GAAGCCATAC GCCTCAACCC CAGATATGCT
GAAGCGTACC AAGATCGGGG CCTCACCTTC CACGCGAAAG GCGAGGCGGA CCGGGCTCTC
GCGGATTTCG CCGAGGCTGC CCGGCTGAAA CCCGAGTTCG AGAATGATCC GGCCTTCCTC
GCGGCCCGCC GCGTGGCGCA GGAGGGTCGC GCGGGTCAGG CTTCGGCTGC CTCGGTTGTC
GCGGCCGTCC ATGCCCCTCC GGTCGTGGCC CCGCCCCCGG TTTCCGCCGC CGCACCGGTG
TCGGAGACGC GCGTGGCCCT GGTGATCGGC AACGGTGCGT ATGCCTCCGT GGCCTCTCTC
GAGAACCCGA CGCGAGATGC CAGGGCCATC GCGCGCTCGT TGCGTGAAGC TGGCTTCAAG
ATCGTCCATC TGGAGAATGA CCTCCGCTAT GATGACCTGC GAAGAGCTCT GAACAATTTC
TCCGCCGAAG CTGATCAGGC CGATTGGGCC GTTGTTTACT ACGCGGGTCA CGGCATAGAA
GTCGGCGGCC TGAATTACAT CGTTCCGATC GATGCGCGGC TGAAAACGGA CCGAGCCGTC
CAATTTGAGG CGGTTCCGCT TGATCAGGTT CTGAGCAGCA TCGAAGGCGC GCGAAAGCTT
CGCCTCGTGA TCTTGGATGC GTGTCGGGAT AACCCGTTCC TGCAGCAGAT GACGCGCACG
GTCGCTTGGC GTTCGGTGGG ACGAGGCTTG GCGAAGGTCG AGCCGGAGAC GAGCGGCACG
CTGGTGGCGT TCGCAGCCAA GCATGGCCAG GTTGCCCTGG ATGCGCAGGA TGGCCGTGAA
AATAGTCCCT TCGCGACCGC GCTGATCAGG AACATCGCGC GACCTGGCAT AGAAATCCGC
AAGATGTTCG GGATCGTGCA CGATGACGTC ATGTCGGGCA CGGGGCGCAA GCAGGAACCT
TTCGTGTATG GCGCGCTCGG GGGTGAGGAT TATTTCTTCA ACGTGCGCTA G
 
Protein sequence
MRKATALVLA LGLLGQASSG ARALDAIGHN ARGFELQNRG EHEKAIAEFN LALRLNPKLV 
SAYINRGFAF RNKGDYDRAI ADYDHALQID PNSVVAFNNR GDAFYHKGEY DRAIADYNRS
IKLSSDKAAV YNNRGLAFFS KEEYDRAIAD YNQALRLDPK YLSAALNRGD AFRSKGEYDR
AIADYNQVLQ IDPRSVVSYN NRGLAFQGKG EYDRAVADYN QALTLDPGYT IALINRGDVF
RIKGQYDSAI ENYNQALQLN PKSKIAYNNR GFVFYNKGEY DRAIADYNSA LQIDPRYVVA
LVNRGDAFVS KGDYDRAIGD YGHALQINPN YAFAYNGRGV ALQNKGEYDR AIMDYDQALR
LDPKYVFAFA NRGDAFRSKG EHDVAIADYN QALRLSPNYA KAYNGRGLSF QNKAQYNRAI
EDYEQVIRLD PRFVAAYNNR GFALVSKGEP TLAIADYDKA LLLDPKSATV YANRGRAFQD
KGEYDRAIAD YDQALRLNPK DAIALNNRAD ILRLRHEHDR AIASYDQALQ LNPKYVGAYN
SRGLAFQDKG EYDRAIANYD QALQLNPRYI TAYINRGDAY RRKGEHARAI SDYNQALQID
QNSVIAYNNR GLCFHEQGEY DRAIIDYDRA LQIDPMYSTG FINRGFAFHK KGEYDRAIAD
YDRALQIDPR SATAYNNRGF TFQNRGEYDL AIVDYDKAIL IKPDLANSYY HRGTVLRLKG
DLERSVADLT EAIRLNPRYA EAYQDRGLTF HAKGEADRAL ADFAEAARLK PEFENDPAFL
AARRVAQEGR AGQASAASVV AAVHAPPVVA PPPVSAAAPV SETRVALVIG NGAYASVASL
ENPTRDARAI ARSLREAGFK IVHLENDLRY DDLRRALNNF SAEADQADWA VVYYAGHGIE
VGGLNYIVPI DARLKTDRAV QFEAVPLDQV LSSIEGARKL RLVILDACRD NPFLQQMTRT
VAWRSVGRGL AKVEPETSGT LVAFAAKHGQ VALDAQDGRE NSPFATALIR NIARPGIEIR
KMFGIVHDDV MSGTGRKQEP FVYGALGGED YFFNVR