Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpop_3608 |
Symbol | |
ID | 6311765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium populi BJ001 |
Kingdom | Bacteria |
Replicon accession | NC_010725 |
Strand | + |
Start bp | 3849696 |
End bp | 3853010 |
Gene Length | 3315 bp |
Protein Length | 1104 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 642652330 |
Product | transglutaminase domain protein |
Protein accession | YP_001926292 |
Protein GI | 188582847 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.432464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.238952 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGATCC AAGCCGCCCT GCACCACGTC ACGCATTACC GCTACGACCG GCCGATCGCG CTCGGGCCGC AGACAATCCG CCTGCGCCCG GCGCCGCATG CCCGGACGAA GGTGCCGGCC TACGCGCTGA AGGTCAGCCC GGAGAACCAC TTCATCAACT GGCAGCAGGA TCCGGCCGGC AACTGGCTCG CCCGCCTCGT CTTTCCGGAG AAGACGACGG AGTTCCGGGT CGAGGTCGAT CTCACCGCCG ACCTGGCCGT CATCAACCCG TTCGACTTCT TCGTCGAGCC CTATGCCGAG CAACGCCCCT TCGACTACGA GGCCGACCTG AAGGTCGCGC TCGCGCCCTA CCTCGTCCTC GACGAGGCGC AGGGTCCGGA GATCGACGCC TTCCTCGCGG AAATCCCCGA GGAGCCCCAC ACGGTCAGCT TCCTCGTGGC GCTCAACGAG AAGGTGCGTT CGCACGTCGC CTACGGGGTC CGGATGGAGC CCGGCGTGCA AACCGCCGCC GAGACGCTGA CGCTCAAGAG CGGATCCTGC CGCGATTCCG CGTGGCTCCT GGTGCAGGTG CTGCGTCGGC TCGGCTTCGC CGCCCGCTTC GTCTCGGGCT ACCTGATCCA GCTCGTGCCC GACACCACCG CCGTCGACGG CCCGGCAGGC ACCACCACCG ACTTCACCGA CCTGCACGCC TGGGCCGAGG TCTACCTGCC GGGCGCCGGC TGGATCGGGT TCGACGCCAC CTCGGGCCTG CTCACCGGCG AGGGACACAT CCCGCTGGTG GCCACCGCGC ACTACAACGC CGCCGCGCCG ATCTCCGGCT TGGCCGAGCC GGCCAAGGTC GAGTTCGCCT ACGAGATGAC GATCAGCCGC GTGGCGGAAG CGCCGCGCAT CACCAAGCCG TTCTCCGACG AGATCTGGGC GGCGATGGAT GCGCTCGGCG AGCGGATCGA CGCGGATCTG ACCGCGCAGG ACGTGCGCCT GACCATGGGC GGCGAGCCGA CCTTCGTCTC GGTGGACGAC TTCGAATCGC CGGAGTGGAA CGTCGCCGCG GTGGGGCCGA CGAAGCGCGG CTTGGCCGAC ACGCTGGTGC GGCGCCTGCG CGACCGCTTC GCGGCGGGCG GCATGCTGCA TTACGGCCAG GGCAAGTGGT ATCCCGGTGA GAGCCTGCCG CGCTGGGCCT TCGCCCTGTA CTGGCGCAAG GACGGCGTGC CGATCTGGCG CAACGCCGAC CTCATCGCCG TCGAGGACGG TCCCAAGACC CCAAACACCA AGACCCCAAA CACCAAGACC GCGACCATCA AGGACGCCGA GCGTCTGATC GGCGCGTTGT CGGAGCGGCT CGAACTCGGC CGCTTCGTCT TCCCGGCCTA CGAGGACGCG GTCTACTGGA AGGCGCGCGA GAGCGAGTTG CCGGTCAACG TCACGACCGC GGCGGCGCAG ACCGGCAGCC TGGAGGCCGA TGCCCGGTTC AAACGCGTGT TCGGCCGCGG CCTCGACAAC CCCGTCGGCT ACGTGCTGCC CCTCGCCAAT CTTGCGGTGG GGGAGGGCCG CGCCTGGATC TCGGAGGCCT GGGCCTTCCG CCGCGGCGGT GCGTATCTCA GCCCCGGCGA CTCGCCGATG GGCTTCCGCC TGCCGCTCGC CTCGCTGCCC TACGTGCCGC CGGATTCCTA CCCCTACTAC CACCCGCAGG ACCCGATGGA GGCCCGCGGC GACCTGCCCG CCGAGCCCGG CGGGCTCAAG GATGCCCCGC TCGCCAAGGG CAAGGATCGC GGGAACGGCG CCGGGATCGA GGGGGTGGCC GTGCGCACGG CGCTGGCCGT GGAGCCCCGC GACGGCGTGC TCTGCGTGTT CATGCCGCCG CTGGAGCGGG TGGACGAGTA TCTCGACCTC GTCGGGCATC TGGAGCGGGC GGCGGAAGCG ATCGGCCAGC CGATCCATAT CGAGGGCTAC GAGCCGCCCT ACGATCCGCG CCTCTCGGTC ATCAAGGTCA CGCCCGATCC CGGCGTGATC GAGGTCAACG TCCACCCCGC CGCCTCCTGG CGCGAGGCGG TGGACATCAC CGCCGGCCTC TACGAGGAGG CGCGCCAGAC GCGCCTCTGC GCCGAGAAAT TCATGATCGA CGGACGCCAC ACCGGCACCG GCGGCGGCAA CCACGTCGTG CTCGGCGGCG CCAGCCCCTC GGACTCGCCG TTCCTGCGCC GGCCGGACCT GCTGAAGAGC CTCGTGCTGT TCTGGCAGCG CCATCCCTGC CTGTCCTACC TCTTCGCCGG CCTCTATGTC GGCCCGACGA GCCAGGCGCC GCGGATGGAC GAGGCGCGCC ACGACGGCCT CTACGAACTG GAGATCGCGC TCGCCCAGGT GCCGGGGCCG GACGAGGCCA ACATCCCGCA CTGGCTGGTG GACCGATTGT TCCGCAACAT CCTCGCGGAC GTCACCGGCA ACACCCACCG GGCCGAGATC TGCATCGACA AGCTGTTCTC GCCGGACGGG CCGACGGGGC GGCTCGGCCT CCTTGAATTC CGCTCCTTCG AGATGCCACC GGATGCGCGC ATGAGCCTGG CCCAGCAAGT CCTGCTGCGG GCGATCGTGG CGTGGCTCTG GCGGGAGCCG CAGACGGGGG GCTGTGTCCG CTGGGGCACG GCGCTGCACG ATCGCTTCAT GCTGCCGCAT TTCCTCTGGG CCGACTTCCT CTCCGTGCTG GAAGACCTGC GCGGCGGCGG CTACGACTTC GACCCTCAGG CCTTCGCGGC GCAGGCCGCG TTCCGCTTCC CCATCTTCGG CCGGGTGGAG CAGGGCGGCG TGAGCCTCGA ACTGCGCCAG GCGCTGGAGC CCTGGCACGT GCTGGGCGAA GAGGGGACCG CCGGGGGAAC CGTGCGCTTC GTGGACGCAT CCGTCGAACG GCTTCAGGTG AAGGTCGAGG GCTATGTCCC CGGCCGGCAC GTGATCGCCT GCAACGGCCG GCGCCTGCCG ATGACGTCGA CCGGCGCCAG CGGCGAGGCG GTCGCGGGCC TGCGCTTCAA GGCGTGGCAG CCCGCCTCCT CGCTGCACCC GACGATCCCG CCGCACGGGC CGCTGACCTT CGACATCCTC GATGTCTGGA GCGGCCGTTC GCTCGGCGGC TGCCGCTACC ATGTCAGCCA TCCGGGCGGG CGCAGCTACG ACAGCTTCCC GATCAACGCC TATGAGGCGG AGGGGCGCCG TCTCGCCCGC TTCGAGGCGA TGGGCCACAC CCCCGGCCGC ATCGCGATGC CGGCCGAGGA GCGCACGAGC GAATTCCCCC TGACCCTCGA CCTGCGCACC CCCGCACCCC GATGA
|
Protein sequence | MSIQAALHHV THYRYDRPIA LGPQTIRLRP APHARTKVPA YALKVSPENH FINWQQDPAG NWLARLVFPE KTTEFRVEVD LTADLAVINP FDFFVEPYAE QRPFDYEADL KVALAPYLVL DEAQGPEIDA FLAEIPEEPH TVSFLVALNE KVRSHVAYGV RMEPGVQTAA ETLTLKSGSC RDSAWLLVQV LRRLGFAARF VSGYLIQLVP DTTAVDGPAG TTTDFTDLHA WAEVYLPGAG WIGFDATSGL LTGEGHIPLV ATAHYNAAAP ISGLAEPAKV EFAYEMTISR VAEAPRITKP FSDEIWAAMD ALGERIDADL TAQDVRLTMG GEPTFVSVDD FESPEWNVAA VGPTKRGLAD TLVRRLRDRF AAGGMLHYGQ GKWYPGESLP RWAFALYWRK DGVPIWRNAD LIAVEDGPKT PNTKTPNTKT ATIKDAERLI GALSERLELG RFVFPAYEDA VYWKARESEL PVNVTTAAAQ TGSLEADARF KRVFGRGLDN PVGYVLPLAN LAVGEGRAWI SEAWAFRRGG AYLSPGDSPM GFRLPLASLP YVPPDSYPYY HPQDPMEARG DLPAEPGGLK DAPLAKGKDR GNGAGIEGVA VRTALAVEPR DGVLCVFMPP LERVDEYLDL VGHLERAAEA IGQPIHIEGY EPPYDPRLSV IKVTPDPGVI EVNVHPAASW REAVDITAGL YEEARQTRLC AEKFMIDGRH TGTGGGNHVV LGGASPSDSP FLRRPDLLKS LVLFWQRHPC LSYLFAGLYV GPTSQAPRMD EARHDGLYEL EIALAQVPGP DEANIPHWLV DRLFRNILAD VTGNTHRAEI CIDKLFSPDG PTGRLGLLEF RSFEMPPDAR MSLAQQVLLR AIVAWLWREP QTGGCVRWGT ALHDRFMLPH FLWADFLSVL EDLRGGGYDF DPQAFAAQAA FRFPIFGRVE QGGVSLELRQ ALEPWHVLGE EGTAGGTVRF VDASVERLQV KVEGYVPGRH VIACNGRRLP MTSTGASGEA VAGLRFKAWQ PASSLHPTIP PHGPLTFDIL DVWSGRSLGG CRYHVSHPGG RSYDSFPINA YEAEGRRLAR FEAMGHTPGR IAMPAEERTS EFPLTLDLRT PAPR
|
| |