Gene M446_1617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_1617 
Symbol 
ID6133112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp1808461 
End bp1810857 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content76% 
IMG OID641641880 
Productpeptidase C14 caspase catalytic subunit p20 
Protein accessionYP_001768549 
Protein GI170739894 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily
[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.154049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.123391 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGAT CCGTCGGGCG GGCGCTCCTG ATTCTGCTCA CGCTCTGCCT CTCCGCGGCG 
GCCGCCAGGG CCGAGCGGCG CCTCGCCTTC GTGGTCGGCG TCAACGCCTA CGCGAATCTC
CCCCAGGGGA TGCAGCTCGA GCGGGCGGTG CGCGACGCCG AGACCGTGGC GGACGCGCTG
CAATCCCTCG ATTTCCGCGT CGCGCGCCTG ACCCGCGAGG CGACGCTCAC CGGCTTCCTC
ACCGCCTTCG GGGCCTTCAC CCGCGAGGTC GAGCCGGGCG ACACGGTGCT GTTCTACTTC
GCGGGCCACG GCGTCGCCCT CGACGGCGTC AACTACCTGA TCCCGGCCGA CATCCCCTCG
CTGGCGGCGG GCGGCGAGGG CGTGGTGCGC AAGCTCGCCC TCGCCGAGGC CGACATCATC
CAGGACATCC AGTCCCGCGG CGCCCGGGTC ACGATCCTGG TGATCGATGC CTGCCGGGAC
AACCCCTTCC CGAAGGCCGG GTCGCGCACG CTCGGCGCGC CCCGCGGCCT CGCGTTCAAG
GAACCGCCGC AGGGCGTGTT CTCGCTCTAC TCGGCCGGGG CGGGCCAGCA GGCCCTCGAC
CGCCTCCCCG GCAGCGACCC GAGCCCCAAC TCGGTCTTCA CCCGCGTCTT CGCCGCCGAG
CTGCGCAAGC CCGGCACCAG CCTCGTCGAT CTCGGCGAGA CGGTGCGCGA GGAGGTGGCC
GCCCTCGCCC GCCGGGCGAA CCACGACCAG ATCCCGGCCG TCTACAACCA GGTGCTCGGC
GCGCGGCGGA TCATGCTGGC CGGCGCGCGG GCCGAGCCGC CCGCGCCCGC CGCGCCCCGG
GCCGACGAGA TCGCGTGGCG CTTCCTCAAG ACCAGCACCG ACCCGGAGGC CCTGCGCAGC
TTCGTCGCGC AGTTCGGCGA CAGCCCGCTG CGGGCGGAGG CCCAGGCCCG CCTCGCCGAG
ATCGAGACGA GCCGGCGCCA GACGGTCGCC GCGCTGCCGC GCCCGACGCT GCGCCGGGTC
GGCGACGAGG CCCTCTGCGA CGAACTCGCC GCCGTCTCCG ACGGGCGCGA GCGCGCGCCG
GGCCTGCGCG GCGTGCCGCT CGACAAGATC GACGCCCGGC GCGCGGTCGA GGCCTGCCGG
GCGGCGACGC GGGCGGCTCC CGACAACCCG CGCTACGCGG TGCAACTCGG CCGGGCCCTG
CACGCCGACA AGCAGTACGG CGACGCGGCG GCCTGGTACC GCCGCGCGGC CGATCAGGGC
AGCGCGCAGG CCCTCAACAA CCTCGCCACC CTGACGCTGG AGGGGCTCGG CGGCCTGGAG
CGCAGCCCCG ACGAGGCGCT CCGCCTGTTC CACCGCGCGG CCGAGCGCGG CAGCGTCGCG
GCGATGCGCA ACCTCGGCAA CCTCTACCGG ACCGGCAGGC AGGGGGTGGC GAAGGATTCC
GCCGAGGCCG TGCGCTGGTA CAGGGCGGCG GTCGAGCGCG AGGATGCCGC CGCCATGGTG
GAGCTCGGCG TGATGACGGC CCGCGGCGAG GGCGTGCCGA AGGACGAGGC CGAGGCGGGA
CGGCTCTACG CGCGGGCCGA GCGCCTCGGC AACGCGGCGG CCCAGAACAA CCTCGGCCTG
TTCCTGCTGC ACGGGAAGGG CGGGTTCGCC AAGGACGAGG CGGAGGCCGC GCGCCTGTTC
CGCCGCGCCG CCGAGGCGGG CAATGCCCAC GCGATGGCGC ATCTCGGCTG GATGACCAAG
CTCGGCAAGG GTGGCCTGGC GAAGGACGAC GTCGAGGCCG TGCGGCTCTA CCGCAGGTCG
GCGGAGGCCG GGAACAGCCT CGGCATGGCC TACCTGGCGG CGATGTACCG GGAGGGGCGC
GGCGGCCTGC CAGAGGACGC GCGGGAGGCG CGCAGGCTCT ACGAGCGCGC CGCCGAGGAG
GGCAGCGCGG TCGGGCGGGC GGGCCTCGCC TTCCTGCACG AGCGGGGGCT CGGCGGCCTG
CCGCGCAACG AGGCCGAGGC GCTGCGCCTC TACCGGCTCG CCGCCGAGGA GAACAACGGC
GTCGCGATCG ACCATCTCGG CCAGTTCCAC CGGGACGGCA AGGGCGGGCT GCGGCCCGAT
CCGCAGGCCG CCATGGCGCA GTTCCGGCGG GCGGCCGATC TCGGCTTCGC CCCCGCCATG
GCCCATCTGG GCGCGCTCTA CGAGAAGCGC CGCAACGCCG CCGAGGCCCT GGCCTGGTAC
CGCAGGGCCG CCGACCTCGA CGACCCGCTC GGGCTCTACC TCCTGGGCCA GGCCCACGAG
ACCGGGCTCG GGATGCCCCG GAACCGGGGC GAGGCGCTGC GCCTCTACGG GCGGGCCGCC
GAACTCGGCC ATGCGGGGGC GGCGGCCGCC CTGCGGCGCC TCGGCAGGAG GGTGTGA
 
Protein sequence
MTGSVGRALL ILLTLCLSAA AARAERRLAF VVGVNAYANL PQGMQLERAV RDAETVADAL 
QSLDFRVARL TREATLTGFL TAFGAFTREV EPGDTVLFYF AGHGVALDGV NYLIPADIPS
LAAGGEGVVR KLALAEADII QDIQSRGARV TILVIDACRD NPFPKAGSRT LGAPRGLAFK
EPPQGVFSLY SAGAGQQALD RLPGSDPSPN SVFTRVFAAE LRKPGTSLVD LGETVREEVA
ALARRANHDQ IPAVYNQVLG ARRIMLAGAR AEPPAPAAPR ADEIAWRFLK TSTDPEALRS
FVAQFGDSPL RAEAQARLAE IETSRRQTVA ALPRPTLRRV GDEALCDELA AVSDGRERAP
GLRGVPLDKI DARRAVEACR AATRAAPDNP RYAVQLGRAL HADKQYGDAA AWYRRAADQG
SAQALNNLAT LTLEGLGGLE RSPDEALRLF HRAAERGSVA AMRNLGNLYR TGRQGVAKDS
AEAVRWYRAA VEREDAAAMV ELGVMTARGE GVPKDEAEAG RLYARAERLG NAAAQNNLGL
FLLHGKGGFA KDEAEAARLF RRAAEAGNAH AMAHLGWMTK LGKGGLAKDD VEAVRLYRRS
AEAGNSLGMA YLAAMYREGR GGLPEDAREA RRLYERAAEE GSAVGRAGLA FLHERGLGGL
PRNEAEALRL YRLAAEENNG VAIDHLGQFH RDGKGGLRPD PQAAMAQFRR AADLGFAPAM
AHLGALYEKR RNAAEALAWY RRAADLDDPL GLYLLGQAHE TGLGMPRNRG EALRLYGRAA
ELGHAGAAAA LRRLGRRV