Gene Mpop_3608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpop_3608 
Symbol 
ID6311765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium populi BJ001 
KingdomBacteria 
Replicon accessionNC_010725 
Strand
Start bp3849696 
End bp3853010 
Gene Length3315 bp 
Protein Length1104 aa 
Translation table11 
GC content71% 
IMG OID642652330 
Producttransglutaminase domain protein 
Protein accessionYP_001926292 
Protein GI188582847 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.432464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.238952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGATCC AAGCCGCCCT GCACCACGTC ACGCATTACC GCTACGACCG GCCGATCGCG 
CTCGGGCCGC AGACAATCCG CCTGCGCCCG GCGCCGCATG CCCGGACGAA GGTGCCGGCC
TACGCGCTGA AGGTCAGCCC GGAGAACCAC TTCATCAACT GGCAGCAGGA TCCGGCCGGC
AACTGGCTCG CCCGCCTCGT CTTTCCGGAG AAGACGACGG AGTTCCGGGT CGAGGTCGAT
CTCACCGCCG ACCTGGCCGT CATCAACCCG TTCGACTTCT TCGTCGAGCC CTATGCCGAG
CAACGCCCCT TCGACTACGA GGCCGACCTG AAGGTCGCGC TCGCGCCCTA CCTCGTCCTC
GACGAGGCGC AGGGTCCGGA GATCGACGCC TTCCTCGCGG AAATCCCCGA GGAGCCCCAC
ACGGTCAGCT TCCTCGTGGC GCTCAACGAG AAGGTGCGTT CGCACGTCGC CTACGGGGTC
CGGATGGAGC CCGGCGTGCA AACCGCCGCC GAGACGCTGA CGCTCAAGAG CGGATCCTGC
CGCGATTCCG CGTGGCTCCT GGTGCAGGTG CTGCGTCGGC TCGGCTTCGC CGCCCGCTTC
GTCTCGGGCT ACCTGATCCA GCTCGTGCCC GACACCACCG CCGTCGACGG CCCGGCAGGC
ACCACCACCG ACTTCACCGA CCTGCACGCC TGGGCCGAGG TCTACCTGCC GGGCGCCGGC
TGGATCGGGT TCGACGCCAC CTCGGGCCTG CTCACCGGCG AGGGACACAT CCCGCTGGTG
GCCACCGCGC ACTACAACGC CGCCGCGCCG ATCTCCGGCT TGGCCGAGCC GGCCAAGGTC
GAGTTCGCCT ACGAGATGAC GATCAGCCGC GTGGCGGAAG CGCCGCGCAT CACCAAGCCG
TTCTCCGACG AGATCTGGGC GGCGATGGAT GCGCTCGGCG AGCGGATCGA CGCGGATCTG
ACCGCGCAGG ACGTGCGCCT GACCATGGGC GGCGAGCCGA CCTTCGTCTC GGTGGACGAC
TTCGAATCGC CGGAGTGGAA CGTCGCCGCG GTGGGGCCGA CGAAGCGCGG CTTGGCCGAC
ACGCTGGTGC GGCGCCTGCG CGACCGCTTC GCGGCGGGCG GCATGCTGCA TTACGGCCAG
GGCAAGTGGT ATCCCGGTGA GAGCCTGCCG CGCTGGGCCT TCGCCCTGTA CTGGCGCAAG
GACGGCGTGC CGATCTGGCG CAACGCCGAC CTCATCGCCG TCGAGGACGG TCCCAAGACC
CCAAACACCA AGACCCCAAA CACCAAGACC GCGACCATCA AGGACGCCGA GCGTCTGATC
GGCGCGTTGT CGGAGCGGCT CGAACTCGGC CGCTTCGTCT TCCCGGCCTA CGAGGACGCG
GTCTACTGGA AGGCGCGCGA GAGCGAGTTG CCGGTCAACG TCACGACCGC GGCGGCGCAG
ACCGGCAGCC TGGAGGCCGA TGCCCGGTTC AAACGCGTGT TCGGCCGCGG CCTCGACAAC
CCCGTCGGCT ACGTGCTGCC CCTCGCCAAT CTTGCGGTGG GGGAGGGCCG CGCCTGGATC
TCGGAGGCCT GGGCCTTCCG CCGCGGCGGT GCGTATCTCA GCCCCGGCGA CTCGCCGATG
GGCTTCCGCC TGCCGCTCGC CTCGCTGCCC TACGTGCCGC CGGATTCCTA CCCCTACTAC
CACCCGCAGG ACCCGATGGA GGCCCGCGGC GACCTGCCCG CCGAGCCCGG CGGGCTCAAG
GATGCCCCGC TCGCCAAGGG CAAGGATCGC GGGAACGGCG CCGGGATCGA GGGGGTGGCC
GTGCGCACGG CGCTGGCCGT GGAGCCCCGC GACGGCGTGC TCTGCGTGTT CATGCCGCCG
CTGGAGCGGG TGGACGAGTA TCTCGACCTC GTCGGGCATC TGGAGCGGGC GGCGGAAGCG
ATCGGCCAGC CGATCCATAT CGAGGGCTAC GAGCCGCCCT ACGATCCGCG CCTCTCGGTC
ATCAAGGTCA CGCCCGATCC CGGCGTGATC GAGGTCAACG TCCACCCCGC CGCCTCCTGG
CGCGAGGCGG TGGACATCAC CGCCGGCCTC TACGAGGAGG CGCGCCAGAC GCGCCTCTGC
GCCGAGAAAT TCATGATCGA CGGACGCCAC ACCGGCACCG GCGGCGGCAA CCACGTCGTG
CTCGGCGGCG CCAGCCCCTC GGACTCGCCG TTCCTGCGCC GGCCGGACCT GCTGAAGAGC
CTCGTGCTGT TCTGGCAGCG CCATCCCTGC CTGTCCTACC TCTTCGCCGG CCTCTATGTC
GGCCCGACGA GCCAGGCGCC GCGGATGGAC GAGGCGCGCC ACGACGGCCT CTACGAACTG
GAGATCGCGC TCGCCCAGGT GCCGGGGCCG GACGAGGCCA ACATCCCGCA CTGGCTGGTG
GACCGATTGT TCCGCAACAT CCTCGCGGAC GTCACCGGCA ACACCCACCG GGCCGAGATC
TGCATCGACA AGCTGTTCTC GCCGGACGGG CCGACGGGGC GGCTCGGCCT CCTTGAATTC
CGCTCCTTCG AGATGCCACC GGATGCGCGC ATGAGCCTGG CCCAGCAAGT CCTGCTGCGG
GCGATCGTGG CGTGGCTCTG GCGGGAGCCG CAGACGGGGG GCTGTGTCCG CTGGGGCACG
GCGCTGCACG ATCGCTTCAT GCTGCCGCAT TTCCTCTGGG CCGACTTCCT CTCCGTGCTG
GAAGACCTGC GCGGCGGCGG CTACGACTTC GACCCTCAGG CCTTCGCGGC GCAGGCCGCG
TTCCGCTTCC CCATCTTCGG CCGGGTGGAG CAGGGCGGCG TGAGCCTCGA ACTGCGCCAG
GCGCTGGAGC CCTGGCACGT GCTGGGCGAA GAGGGGACCG CCGGGGGAAC CGTGCGCTTC
GTGGACGCAT CCGTCGAACG GCTTCAGGTG AAGGTCGAGG GCTATGTCCC CGGCCGGCAC
GTGATCGCCT GCAACGGCCG GCGCCTGCCG ATGACGTCGA CCGGCGCCAG CGGCGAGGCG
GTCGCGGGCC TGCGCTTCAA GGCGTGGCAG CCCGCCTCCT CGCTGCACCC GACGATCCCG
CCGCACGGGC CGCTGACCTT CGACATCCTC GATGTCTGGA GCGGCCGTTC GCTCGGCGGC
TGCCGCTACC ATGTCAGCCA TCCGGGCGGG CGCAGCTACG ACAGCTTCCC GATCAACGCC
TATGAGGCGG AGGGGCGCCG TCTCGCCCGC TTCGAGGCGA TGGGCCACAC CCCCGGCCGC
ATCGCGATGC CGGCCGAGGA GCGCACGAGC GAATTCCCCC TGACCCTCGA CCTGCGCACC
CCCGCACCCC GATGA
 
Protein sequence
MSIQAALHHV THYRYDRPIA LGPQTIRLRP APHARTKVPA YALKVSPENH FINWQQDPAG 
NWLARLVFPE KTTEFRVEVD LTADLAVINP FDFFVEPYAE QRPFDYEADL KVALAPYLVL
DEAQGPEIDA FLAEIPEEPH TVSFLVALNE KVRSHVAYGV RMEPGVQTAA ETLTLKSGSC
RDSAWLLVQV LRRLGFAARF VSGYLIQLVP DTTAVDGPAG TTTDFTDLHA WAEVYLPGAG
WIGFDATSGL LTGEGHIPLV ATAHYNAAAP ISGLAEPAKV EFAYEMTISR VAEAPRITKP
FSDEIWAAMD ALGERIDADL TAQDVRLTMG GEPTFVSVDD FESPEWNVAA VGPTKRGLAD
TLVRRLRDRF AAGGMLHYGQ GKWYPGESLP RWAFALYWRK DGVPIWRNAD LIAVEDGPKT
PNTKTPNTKT ATIKDAERLI GALSERLELG RFVFPAYEDA VYWKARESEL PVNVTTAAAQ
TGSLEADARF KRVFGRGLDN PVGYVLPLAN LAVGEGRAWI SEAWAFRRGG AYLSPGDSPM
GFRLPLASLP YVPPDSYPYY HPQDPMEARG DLPAEPGGLK DAPLAKGKDR GNGAGIEGVA
VRTALAVEPR DGVLCVFMPP LERVDEYLDL VGHLERAAEA IGQPIHIEGY EPPYDPRLSV
IKVTPDPGVI EVNVHPAASW REAVDITAGL YEEARQTRLC AEKFMIDGRH TGTGGGNHVV
LGGASPSDSP FLRRPDLLKS LVLFWQRHPC LSYLFAGLYV GPTSQAPRMD EARHDGLYEL
EIALAQVPGP DEANIPHWLV DRLFRNILAD VTGNTHRAEI CIDKLFSPDG PTGRLGLLEF
RSFEMPPDAR MSLAQQVLLR AIVAWLWREP QTGGCVRWGT ALHDRFMLPH FLWADFLSVL
EDLRGGGYDF DPQAFAAQAA FRFPIFGRVE QGGVSLELRQ ALEPWHVLGE EGTAGGTVRF
VDASVERLQV KVEGYVPGRH VIACNGRRLP MTSTGASGEA VAGLRFKAWQ PASSLHPTIP
PHGPLTFDIL DVWSGRSLGG CRYHVSHPGG RSYDSFPINA YEAEGRRLAR FEAMGHTPGR
IAMPAEERTS EFPLTLDLRT PAPR