Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_5075 |
Symbol | |
ID | 6131808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 5566374 |
End bp | 5569616 |
Gene Length | 3243 bp |
Protein Length | 1080 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641645210 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001771835 |
Protein GI | 170743180 |
COG category | [E] Amino acid transport and metabolism [S] Function unknown |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases [COG4196] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.804608 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.120829 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGATCA AGGCTGCCCT GCACCACGTC ACCTCCTACA CCTACGACCG TCCGGTGAGC CTCGGCCCGC AGGTGATCCG CCTGCGGCCG GCGCCGCACA CGCGCACCCG CATCCTGAGC TATTCGCTCA AGGTCACGCC CGGCAACCAC TTCGTGAACT GGCAGCAGGA CCCGAGCGGC AACTGGCTCG CCCGGCTCGT CTTCCCCGAG AAGACGACGC AGTTCCGGGT CGAGGTCGAC ATCGCCGCCG ACATGGCGGT GATCAACCCG TTCGACTTCT TCGTCGACGA TTACGCCCAG ACCCTGCCCT TCGCGTATCC GGCGGAGCTG CAGGAGGAAC TCGCGCCCTA CCTGGTGCCG AACGACGGCG GGCCGCTGCT CGACGCGTTC CTGACGCGCC TGCCCGAGGA GAAGAACACC GTCCTGTTCC TGGTCGCGCT CAACGAGATG GTGCGGGACA GCGTCAATTA CGTGGTGCGC ATGGAGCCGG GCGTGCAGAC GCCCGACGAG ACGCTCGACG CCGCCAGCGG CTCCTGCCGG GATTCGGCCT GGCTCCTCGT CCAGGTGCTG CGCCGCCTCA ACCTCGCGGC CCGGTTCGTC TCCGGCTACC TGATCCAGCT CGTCCCCGAC ACCACCGCGG TCGACGGGCC GGCCGGCACC TCCAAGGACT TCACCGACCT GCACGCCTGG GCCGAGGTCT ACGTGCCGGG CGCGGGCTGG ATCGGCCTCG ACGCAACCTC CGGCCTGCTC TGCGGCGAGG GCCACATCCC GCTCGCCGCC ACGCCGCACT ACCGCTCGGC CGCGCCGATC TCCGGCCTCG CGGACCCCGC GGAGGTGGAT TTCCACTTCG AGATGAACGT GATGCGGGTC GCCGAGGCGC CCCGGGTGAC GCGGCCGTTC TCGGACGAGG CCTGGGGCGC CATGGACGCG CTCGGGGACC GGATCGACCG GGACCTCGCC GCCCAGGACG TGCGCCTGAC CATGGGCGGC GAGCCGACCT TCGTGTCGGT GGACGATTTC CAGTCGCCCG AGTGGAACAC CGCGGCGGTG GGCCCGACCA AGCGGGCGCT CGCCGACCAG CTGATCCGCC GCCTGCGCGA GCGCTTCGCG CCGGGCGGCC TCCTGCATTA CGGCCAGGGC AAGTGGTATC CGGGCGAGAG CCTGCCCCGC TGGGCCTTCG CCCTCTACTG GCGCAAGGAC GGGCAGCCGA TCTGGCGCGA CGAGGCGCTG ATCGCCCGCG TGGCCGGGCC GCAGGATACC GGCATCGAGC ACGCGGAGGC CTTCGGCAAG GCGCTCGCCA AGGCGCTCGG CCTGGGGCCC TACCTGCAGC CGACCTTCGA GGACCCGGTC TACTGGGAGC GCAAGGAGGC GGAACTCCCG ATCAACACCA CGCCGCTTCA GCCGCGGGTC GGCGACGCCG AGTTCCAGGA GCGGATGGCG CGGATCTACA GGCGCGGCCT CACGGAGCCG GTCGGCTACG TGCTGCCCCT CGCCAGCGTG CAGGCCGGGG CCGGGAGGGT CTGGGTCTCG GAGAAGTGGC AGACGCGCCG GGGAGCCCTC TACCTCGCGG CGGGCGACTC GCCGGTGGGC TTCCGGCTGC CGCTCAACTC CCTGATCTCC CTGCCGCCGG AGGAGTTTCC CTACTACGCC CCGCAGGATC CGCTGGAGGC GCGGGGTCCC CTGCCCTCCC GGCCCGCGGC GCGCAGCCGC CCCGTCCCCG GCGTCCCGGT GCGCACCGCG CTCGCGATCG AGCCGCGGGA CGGGGCGGTG TGCGTGTTCA TGCCGCCCGT CGAGCGGGCG GACGAGTACG TCGAACTCGT CGCGACCCTG GAGAAGGCCG CGGCCGAGAC CGGCATCCCG ATCCACATCG AGGGCTACGA GCCGCCCTAC GACCCGCGCC TCGGGGTGAT CAAGGTCACG CCCGATCCCG GCGTGATCGA GGTCAACGTG CACCCGGCCC GCACCTGGCG GGAGGCGGTC GAGATCACCA CCGGCCTCTA CCAGGACGCC CGCGAGATCC GGCTCGGCGC GCAGAAATTC ATGATCGACG GGCGCCACAC CGGCACCGGC GGCGGCAACC ACGTGGTGCT CGGCGGCGCG ACCCCGGCGG ATTCGCCCTT CCTGCGCCGG CCCGACCTGC TGAAGAGCCT CGTGCTCTAC TGGCAGCGCC ACCCGTCGCT GTCCTACCTG TTCTCGGGCC TCTACATCGG CCCGACGAGC CAGGCGCCGC GCATGGACGA GGCGCGCCAC GACGGGCTCT ACGAGCTGGA GATCGCCCTC GCGCAGGTGC CGCCCCCGGG CGGGCCCGAG GTGCCGCTCT GGCTCGTCGA CCGGCTCTTC CGCAACGTCC TCGTCGACGT GACCGGCAAC ACCCACCGGG CCGAGATCTG CATCGACAAG CTCTACTCGC CGGACGGGCC GACGGGGCGG CTCGGCCTGC TCGAATTCCG CTCCTTCGAG ATGCCGCCGG ACGCGCGCAT GAGCCTCGCC CAGCAATTGC TGCTGCGGGC GATCATCGCG TGGCTGTGGC GCGAGCCGCA GGAGGGCGGC TTCGTCCGCT GGGGCACCGC GCTCCACGAC CGCTTCATGC TGCCCCACTT CCTCTGGCAG GATTTCCTCG GCGTGCTCGG CGACCTGCGC GGGGCGGGCT ACGCCTTCGA CCCGGTCTGG TACCGGGCGC AGGCCGAGTT CCGCTTCCCC CTCTACGGCA CGGTCCAGCA CGGCGGCGTC ACGCTGGAAC TGCGCCAGGC CCTCGAACCC TGGCACGTGC TGGGCGAGGA GGGCTCGTCC GGCGGCACGG TGCGCTTCGT CGATTCCTCG GTCGAGCGGC TGCAGGTGCG CGTCGAGGGC TACGTGCCGA GCCGGCACGT CGTCACCTGC AACGGGCGGC GCCTGCCCCT GACCGAGACC GGGCGCTCCG GCGAGGCGGT GGCGGGCCTG CGCTTCAAGG CCTGGCAGCC GGCCTCGGCG CTGCACCCGA TGATCCCGGT CCATTCGCCG CTGACCTTCG ACATCGTCGA CGCGTGGTCG GGCCGCTCGC TGGGCGGTTG CACCTACCAC GTGTCCCATC CGGGCGGGCG CAATTACGAG ACCTTCCCGG TCAACACCTA CGAGGCGGAG GGGCGGCGCC TCGCCCGCTT CCAGGACCAC GGCCACACGC CCGGCCGGGT CACCCCCGCC CCCGAGGAGC CGCGGCGCGA ATTCCCGCTC ACCCTCGACC TGCGCGCGCC CGCGCCCCGA TGA
|
Protein sequence | MSIKAALHHV TSYTYDRPVS LGPQVIRLRP APHTRTRILS YSLKVTPGNH FVNWQQDPSG NWLARLVFPE KTTQFRVEVD IAADMAVINP FDFFVDDYAQ TLPFAYPAEL QEELAPYLVP NDGGPLLDAF LTRLPEEKNT VLFLVALNEM VRDSVNYVVR MEPGVQTPDE TLDAASGSCR DSAWLLVQVL RRLNLAARFV SGYLIQLVPD TTAVDGPAGT SKDFTDLHAW AEVYVPGAGW IGLDATSGLL CGEGHIPLAA TPHYRSAAPI SGLADPAEVD FHFEMNVMRV AEAPRVTRPF SDEAWGAMDA LGDRIDRDLA AQDVRLTMGG EPTFVSVDDF QSPEWNTAAV GPTKRALADQ LIRRLRERFA PGGLLHYGQG KWYPGESLPR WAFALYWRKD GQPIWRDEAL IARVAGPQDT GIEHAEAFGK ALAKALGLGP YLQPTFEDPV YWERKEAELP INTTPLQPRV GDAEFQERMA RIYRRGLTEP VGYVLPLASV QAGAGRVWVS EKWQTRRGAL YLAAGDSPVG FRLPLNSLIS LPPEEFPYYA PQDPLEARGP LPSRPAARSR PVPGVPVRTA LAIEPRDGAV CVFMPPVERA DEYVELVATL EKAAAETGIP IHIEGYEPPY DPRLGVIKVT PDPGVIEVNV HPARTWREAV EITTGLYQDA REIRLGAQKF MIDGRHTGTG GGNHVVLGGA TPADSPFLRR PDLLKSLVLY WQRHPSLSYL FSGLYIGPTS QAPRMDEARH DGLYELEIAL AQVPPPGGPE VPLWLVDRLF RNVLVDVTGN THRAEICIDK LYSPDGPTGR LGLLEFRSFE MPPDARMSLA QQLLLRAIIA WLWREPQEGG FVRWGTALHD RFMLPHFLWQ DFLGVLGDLR GAGYAFDPVW YRAQAEFRFP LYGTVQHGGV TLELRQALEP WHVLGEEGSS GGTVRFVDSS VERLQVRVEG YVPSRHVVTC NGRRLPLTET GRSGEAVAGL RFKAWQPASA LHPMIPVHSP LTFDIVDAWS GRSLGGCTYH VSHPGGRNYE TFPVNTYEAE GRRLARFQDH GHTPGRVTPA PEEPRREFPL TLDLRAPAPR
|
| |