Gene Mchl_3724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_3724 
Symbol 
ID7115385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp3927019 
End bp3930369 
Gene Length3351 bp 
Protein Length1116 aa 
Translation table11 
GC content70% 
IMG OID643526459 
Producttransglutaminase domain protein 
Protein accessionYP_002422471 
Protein GI218531655 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.565418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0894015 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCGG CGGCCGAGCG CACCGCCCGC TCGTTCGAGA CAATCGCAGA AGCCTTCGGA 
GCCTCCGTGT CGATCCAAGC CGCCCTGCAC CACGTCACGC ATTACCGCTA CGACCGGCCG
ATCGCGCTGG GACCGCAGAC GATCCGGCTC CGCCCGGCGC CGCATGCGCG AACCCGGGTG
CCGGCCTACG CGCTGAAGGT CAGCCCGGAA AACCACTTCA TCAACTGGCA GCAGGATCCG
GCCGGCAACT GGCTCGCCCG GCTCGTCTTC CCGGAAAAGA CCACGGAACT GCGCATCGAG
GTCGATCTCA CCGCCGACCT CGCGGTCATC AACCCGTTCG ACTTCTTCGT CGAGCCCTAC
GCCGAGCGCC ATCCCTTCGA GTACGAGCCG GATCTGAAGG TCGCGCTCGC GCCCTACCTC
GTTCTCGATG ACGCGCACGG GCCGGAGATC GACGCCTTCC TCACGCGCAT TCCCGAGGAG
ACCCACACGG TCACCTTCCT CGTGGCGCTC AACGCGCTGC TTCGGGCGGA GGTGAATTAC
GGCGTGCGCA TGGAACCCGG CGTGCAGACG CCCGCCGAGA CGCTCACCCT GCAGAGCGGC
TCGTGCCGCG ATTCCGCGTG GCTGCTGGTG CAGGTGCTGC GGCGCCTCGG CTTTGCCGCC
CGCTTCGTCT CGGGCTACCT GATCCAGCTC GTGCCTGACA CCACCGCCGT GGACGGCCCG
GCCGGCACCA AGACCGACTT CACCGACCTG CACGCCTGGG CCGAGGTCTA CCTGCCCGGC
GCCGGCTGGA TCGGCTTCGA CGCCACCTCC GGCCTGCTCA CGGGCGAGGG TCACATCCCG
CTCGTGGCTA CCGCGCATTA CAACGCCGCC GCGCCGATCT CGGGGCTCGC CGAACCGGCC
AAGGTCGAGT TCGCCTACGA GATGACGATC AGCCGCGTCG CCGAGGCGCC GCGCATCACC
AAGCCGTTCT CGGACGAGGT CTGGACCTCG ATGGATGCGC TCGGCGAGCG CATCGACGCG
GATCTCGCCG AGCAGGACGT ACGCCTGACC ATGGGCGGCG AACCGACCTT CGTCTCGGTG
GACGACTTCG AATCCCCTGA ATGGAACGTC GCCGCCGTGG GGCCGACCAA GCGGGGCTTG
GCCGACCAAC TGATCCGCCG CCTGCGGGAC CGCTTCGCGC CCGGCGGCAT GCTGCATTAC
GGCCAGGGCA AGTGGTATCC CGGCGAGAGC CTGCCGCGCT GGGCCTTCGC CCTCTACTGG
CGCAAGGACG GCGTGCCGAT CTGGAAGAAC GCCGACCTGA TCGCCGTCGA GGACGGCCCG
AAGACCGCCA CCATCACGGA TGCCGAGCGG CTGATCGGCA CGCTCGCCGA GCGGCTCGAA
CTCGCCCGCT TCGTCTTCCC GGCCTACGAA GACGCCGATT ACTGGCGGAC CCGCGAGAGC
GAGCTGCCGG TCAACGTCAC GACCGCGGAA CCGCAAACCG GCAGCCCGGA GACCGATGCC
CGGTTCCGGC GCGTGTTCGG GCGCGGCCTC GACAAGCCCG TCGGCTACGT GCTGCCGCTG
GCCAACCTCG CGGCGGGGGA GGGCCGTGTC TGGATCTCGG AGACCTGGAA GTTCCGCCGC
GGCGGCGCCT ACCTGAACCC CGGTGACTCG CCTTTGGGCT TCCGCCTGCC GCTCGGCTCG
CTTCCCTATG TGCCGCCGGA TTCCTACCCC TACTACCACC CGCAGGATCC GCTCGACGCG
CGCGGCGACC TGCCCGTCGA ACCGGGCGGC CCCCGCGACG CTCCCCTGCC GAAGGGCGCC
GACCGGGCCA ACGGCGCCGG CATGGAGGGC GTGGCCGTGC GCACCGCGCT GTCGGTCGAG
CCCCGCGACG GCGTGCTCTG CGTGTTCATG CCGCCCGTGG AGCGGGCCGA CGATTACATC
GACCTCGTCG GACATCTCGA ACGCGTCGCC GAGACCATCG GCCAGCCGAT TCATATCGAG
GGCTACGAGC CGCCCTACGA TCCGCGTCTC CCGGTCATCA AGGTCACGCC CGATCCCGGC
GTGATCGAGG TCAACGTCCA CCCCGCCGCC TCCTGGCGCG AGGCGGTCGA CATCACCCGC
GGACTCTACG AGGAGGCCCG GCAGACGCGC CTGGGCGCCG AGAAGTTCAT GATCGACGGG
CGCCACACCG GCACCGGCGG GGGCAACCAC GTCGTGCTCG GCGGCGCGAC CCCGTCCGAC
TCGCCGTTCC TGCGCCGGCC GGACCTGCTG AAGAGCCTCG TGCTGTTCTG GCAGCGGCAT
CCGTGCCTGT CCTACCTGTT CGCCGGGCTC TATGTCGGCC CAACGAGCCA GGCGCCGCGC
ATGGACGAGG CGCGCCATGA CGGGCTCTAC GAATTGGAGA TCGCGCTGGC GCAGGTGCCG
GAGCCGGATG GGGCCAACAT CCCGCACTGG CTGGTGGACC GGCTGTTCCG CAACATCCTC
GCTGATGTCA CCGGCAACAC CCATCGCGCC GAGATCTGCA TCGACAAGCT GTTCTCGCCC
GACGGCGCCA CAGGCCGGCT CGGCCTGCTC GAATTCCGCT CCTTCGAGAT GCCGCCGGAC
GCGCGCATGA GCCTCGCGCA GCAGGTGCTG TTGCGGGCCA TCGTGGCGTG GCTCTGGCGC
GAGCCGCAGA CTGGCGGCTG TGTCCGCTGG GGCACGGCGC TGCACGATCG CTTCATGCTG
CCGCATTTCC TCTGGGCGGA CTTCCTCTCC GTGCTGGAAG ACCTGCGCGG CGGCGGCTAC
GACTTCGACC CTCAGGCCTT CGCGGCGCAA GCCGAGTTCC GCTTCCCCGT CTTCGGCCGG
GTGGAGCAGG GCGGCGTCGG CCTCGAACTA CGCCAAGCGC TGGAGCCGTG GCACGTGCTG
GGCGAAGAGG GGTCCGCCGG GGGAACCGTG CGCTTCGTGG ACGCATCCGT CGAACGGCTT
CAGGTGAAGG TCGAGGGCTT CGTCCCCGGC CGGCACGTCA TCGCCTGCAA TGGCCGCCGC
CTGCCGATGA CGCCGACCGG CGCCAGCGGC GAAGCGGTCG CGGGCCTGCG CTTCAAGGCG
TGGCAGCCGG CCTCCTCAAT GCACCCGACG ATCCCGCCGC ATGGGCCGCT GACCTTCGAC
ATCTTCGATG CCTGGAGCGG CCGTTCGATC GGCGGCTGCC GCTACCACGT CAGCCATCCG
GGCGGGCGCA ACTACGACAG CTTCCCCGTC AACGCCTACG AGGCGGAGGG GCGTCGTCTC
GCCCGGTTCG AGGCCATGGG CCACACTCCC GGCCGCCTCG CGATGCCGGC CGAGGAGCGC
ACGGGCGACT TCCCCCTGAC CCTCGACCTG CGTACGCCCG CACCCCGATG A
 
Protein sequence
MQPAAERTAR SFETIAEAFG ASVSIQAALH HVTHYRYDRP IALGPQTIRL RPAPHARTRV 
PAYALKVSPE NHFINWQQDP AGNWLARLVF PEKTTELRIE VDLTADLAVI NPFDFFVEPY
AERHPFEYEP DLKVALAPYL VLDDAHGPEI DAFLTRIPEE THTVTFLVAL NALLRAEVNY
GVRMEPGVQT PAETLTLQSG SCRDSAWLLV QVLRRLGFAA RFVSGYLIQL VPDTTAVDGP
AGTKTDFTDL HAWAEVYLPG AGWIGFDATS GLLTGEGHIP LVATAHYNAA APISGLAEPA
KVEFAYEMTI SRVAEAPRIT KPFSDEVWTS MDALGERIDA DLAEQDVRLT MGGEPTFVSV
DDFESPEWNV AAVGPTKRGL ADQLIRRLRD RFAPGGMLHY GQGKWYPGES LPRWAFALYW
RKDGVPIWKN ADLIAVEDGP KTATITDAER LIGTLAERLE LARFVFPAYE DADYWRTRES
ELPVNVTTAE PQTGSPETDA RFRRVFGRGL DKPVGYVLPL ANLAAGEGRV WISETWKFRR
GGAYLNPGDS PLGFRLPLGS LPYVPPDSYP YYHPQDPLDA RGDLPVEPGG PRDAPLPKGA
DRANGAGMEG VAVRTALSVE PRDGVLCVFM PPVERADDYI DLVGHLERVA ETIGQPIHIE
GYEPPYDPRL PVIKVTPDPG VIEVNVHPAA SWREAVDITR GLYEEARQTR LGAEKFMIDG
RHTGTGGGNH VVLGGATPSD SPFLRRPDLL KSLVLFWQRH PCLSYLFAGL YVGPTSQAPR
MDEARHDGLY ELEIALAQVP EPDGANIPHW LVDRLFRNIL ADVTGNTHRA EICIDKLFSP
DGATGRLGLL EFRSFEMPPD ARMSLAQQVL LRAIVAWLWR EPQTGGCVRW GTALHDRFML
PHFLWADFLS VLEDLRGGGY DFDPQAFAAQ AEFRFPVFGR VEQGGVGLEL RQALEPWHVL
GEEGSAGGTV RFVDASVERL QVKVEGFVPG RHVIACNGRR LPMTPTGASG EAVAGLRFKA
WQPASSMHPT IPPHGPLTFD IFDAWSGRSI GGCRYHVSHP GGRNYDSFPV NAYEAEGRRL
ARFEAMGHTP GRLAMPAEER TGDFPLTLDL RTPAPR