Gene M446_5075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5075 
Symbol 
ID6131808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5566374 
End bp5569616 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content72% 
IMG OID641645210 
Producttransglutaminase domain-containing protein 
Protein accessionYP_001771835 
Protein GI170743180 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases
[COG4196] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.804608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.120829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGATCA AGGCTGCCCT GCACCACGTC ACCTCCTACA CCTACGACCG TCCGGTGAGC 
CTCGGCCCGC AGGTGATCCG CCTGCGGCCG GCGCCGCACA CGCGCACCCG CATCCTGAGC
TATTCGCTCA AGGTCACGCC CGGCAACCAC TTCGTGAACT GGCAGCAGGA CCCGAGCGGC
AACTGGCTCG CCCGGCTCGT CTTCCCCGAG AAGACGACGC AGTTCCGGGT CGAGGTCGAC
ATCGCCGCCG ACATGGCGGT GATCAACCCG TTCGACTTCT TCGTCGACGA TTACGCCCAG
ACCCTGCCCT TCGCGTATCC GGCGGAGCTG CAGGAGGAAC TCGCGCCCTA CCTGGTGCCG
AACGACGGCG GGCCGCTGCT CGACGCGTTC CTGACGCGCC TGCCCGAGGA GAAGAACACC
GTCCTGTTCC TGGTCGCGCT CAACGAGATG GTGCGGGACA GCGTCAATTA CGTGGTGCGC
ATGGAGCCGG GCGTGCAGAC GCCCGACGAG ACGCTCGACG CCGCCAGCGG CTCCTGCCGG
GATTCGGCCT GGCTCCTCGT CCAGGTGCTG CGCCGCCTCA ACCTCGCGGC CCGGTTCGTC
TCCGGCTACC TGATCCAGCT CGTCCCCGAC ACCACCGCGG TCGACGGGCC GGCCGGCACC
TCCAAGGACT TCACCGACCT GCACGCCTGG GCCGAGGTCT ACGTGCCGGG CGCGGGCTGG
ATCGGCCTCG ACGCAACCTC CGGCCTGCTC TGCGGCGAGG GCCACATCCC GCTCGCCGCC
ACGCCGCACT ACCGCTCGGC CGCGCCGATC TCCGGCCTCG CGGACCCCGC GGAGGTGGAT
TTCCACTTCG AGATGAACGT GATGCGGGTC GCCGAGGCGC CCCGGGTGAC GCGGCCGTTC
TCGGACGAGG CCTGGGGCGC CATGGACGCG CTCGGGGACC GGATCGACCG GGACCTCGCC
GCCCAGGACG TGCGCCTGAC CATGGGCGGC GAGCCGACCT TCGTGTCGGT GGACGATTTC
CAGTCGCCCG AGTGGAACAC CGCGGCGGTG GGCCCGACCA AGCGGGCGCT CGCCGACCAG
CTGATCCGCC GCCTGCGCGA GCGCTTCGCG CCGGGCGGCC TCCTGCATTA CGGCCAGGGC
AAGTGGTATC CGGGCGAGAG CCTGCCCCGC TGGGCCTTCG CCCTCTACTG GCGCAAGGAC
GGGCAGCCGA TCTGGCGCGA CGAGGCGCTG ATCGCCCGCG TGGCCGGGCC GCAGGATACC
GGCATCGAGC ACGCGGAGGC CTTCGGCAAG GCGCTCGCCA AGGCGCTCGG CCTGGGGCCC
TACCTGCAGC CGACCTTCGA GGACCCGGTC TACTGGGAGC GCAAGGAGGC GGAACTCCCG
ATCAACACCA CGCCGCTTCA GCCGCGGGTC GGCGACGCCG AGTTCCAGGA GCGGATGGCG
CGGATCTACA GGCGCGGCCT CACGGAGCCG GTCGGCTACG TGCTGCCCCT CGCCAGCGTG
CAGGCCGGGG CCGGGAGGGT CTGGGTCTCG GAGAAGTGGC AGACGCGCCG GGGAGCCCTC
TACCTCGCGG CGGGCGACTC GCCGGTGGGC TTCCGGCTGC CGCTCAACTC CCTGATCTCC
CTGCCGCCGG AGGAGTTTCC CTACTACGCC CCGCAGGATC CGCTGGAGGC GCGGGGTCCC
CTGCCCTCCC GGCCCGCGGC GCGCAGCCGC CCCGTCCCCG GCGTCCCGGT GCGCACCGCG
CTCGCGATCG AGCCGCGGGA CGGGGCGGTG TGCGTGTTCA TGCCGCCCGT CGAGCGGGCG
GACGAGTACG TCGAACTCGT CGCGACCCTG GAGAAGGCCG CGGCCGAGAC CGGCATCCCG
ATCCACATCG AGGGCTACGA GCCGCCCTAC GACCCGCGCC TCGGGGTGAT CAAGGTCACG
CCCGATCCCG GCGTGATCGA GGTCAACGTG CACCCGGCCC GCACCTGGCG GGAGGCGGTC
GAGATCACCA CCGGCCTCTA CCAGGACGCC CGCGAGATCC GGCTCGGCGC GCAGAAATTC
ATGATCGACG GGCGCCACAC CGGCACCGGC GGCGGCAACC ACGTGGTGCT CGGCGGCGCG
ACCCCGGCGG ATTCGCCCTT CCTGCGCCGG CCCGACCTGC TGAAGAGCCT CGTGCTCTAC
TGGCAGCGCC ACCCGTCGCT GTCCTACCTG TTCTCGGGCC TCTACATCGG CCCGACGAGC
CAGGCGCCGC GCATGGACGA GGCGCGCCAC GACGGGCTCT ACGAGCTGGA GATCGCCCTC
GCGCAGGTGC CGCCCCCGGG CGGGCCCGAG GTGCCGCTCT GGCTCGTCGA CCGGCTCTTC
CGCAACGTCC TCGTCGACGT GACCGGCAAC ACCCACCGGG CCGAGATCTG CATCGACAAG
CTCTACTCGC CGGACGGGCC GACGGGGCGG CTCGGCCTGC TCGAATTCCG CTCCTTCGAG
ATGCCGCCGG ACGCGCGCAT GAGCCTCGCC CAGCAATTGC TGCTGCGGGC GATCATCGCG
TGGCTGTGGC GCGAGCCGCA GGAGGGCGGC TTCGTCCGCT GGGGCACCGC GCTCCACGAC
CGCTTCATGC TGCCCCACTT CCTCTGGCAG GATTTCCTCG GCGTGCTCGG CGACCTGCGC
GGGGCGGGCT ACGCCTTCGA CCCGGTCTGG TACCGGGCGC AGGCCGAGTT CCGCTTCCCC
CTCTACGGCA CGGTCCAGCA CGGCGGCGTC ACGCTGGAAC TGCGCCAGGC CCTCGAACCC
TGGCACGTGC TGGGCGAGGA GGGCTCGTCC GGCGGCACGG TGCGCTTCGT CGATTCCTCG
GTCGAGCGGC TGCAGGTGCG CGTCGAGGGC TACGTGCCGA GCCGGCACGT CGTCACCTGC
AACGGGCGGC GCCTGCCCCT GACCGAGACC GGGCGCTCCG GCGAGGCGGT GGCGGGCCTG
CGCTTCAAGG CCTGGCAGCC GGCCTCGGCG CTGCACCCGA TGATCCCGGT CCATTCGCCG
CTGACCTTCG ACATCGTCGA CGCGTGGTCG GGCCGCTCGC TGGGCGGTTG CACCTACCAC
GTGTCCCATC CGGGCGGGCG CAATTACGAG ACCTTCCCGG TCAACACCTA CGAGGCGGAG
GGGCGGCGCC TCGCCCGCTT CCAGGACCAC GGCCACACGC CCGGCCGGGT CACCCCCGCC
CCCGAGGAGC CGCGGCGCGA ATTCCCGCTC ACCCTCGACC TGCGCGCGCC CGCGCCCCGA
TGA
 
Protein sequence
MSIKAALHHV TSYTYDRPVS LGPQVIRLRP APHTRTRILS YSLKVTPGNH FVNWQQDPSG 
NWLARLVFPE KTTQFRVEVD IAADMAVINP FDFFVDDYAQ TLPFAYPAEL QEELAPYLVP
NDGGPLLDAF LTRLPEEKNT VLFLVALNEM VRDSVNYVVR MEPGVQTPDE TLDAASGSCR
DSAWLLVQVL RRLNLAARFV SGYLIQLVPD TTAVDGPAGT SKDFTDLHAW AEVYVPGAGW
IGLDATSGLL CGEGHIPLAA TPHYRSAAPI SGLADPAEVD FHFEMNVMRV AEAPRVTRPF
SDEAWGAMDA LGDRIDRDLA AQDVRLTMGG EPTFVSVDDF QSPEWNTAAV GPTKRALADQ
LIRRLRERFA PGGLLHYGQG KWYPGESLPR WAFALYWRKD GQPIWRDEAL IARVAGPQDT
GIEHAEAFGK ALAKALGLGP YLQPTFEDPV YWERKEAELP INTTPLQPRV GDAEFQERMA
RIYRRGLTEP VGYVLPLASV QAGAGRVWVS EKWQTRRGAL YLAAGDSPVG FRLPLNSLIS
LPPEEFPYYA PQDPLEARGP LPSRPAARSR PVPGVPVRTA LAIEPRDGAV CVFMPPVERA
DEYVELVATL EKAAAETGIP IHIEGYEPPY DPRLGVIKVT PDPGVIEVNV HPARTWREAV
EITTGLYQDA REIRLGAQKF MIDGRHTGTG GGNHVVLGGA TPADSPFLRR PDLLKSLVLY
WQRHPSLSYL FSGLYIGPTS QAPRMDEARH DGLYELEIAL AQVPPPGGPE VPLWLVDRLF
RNVLVDVTGN THRAEICIDK LYSPDGPTGR LGLLEFRSFE MPPDARMSLA QQLLLRAIIA
WLWREPQEGG FVRWGTALHD RFMLPHFLWQ DFLGVLGDLR GAGYAFDPVW YRAQAEFRFP
LYGTVQHGGV TLELRQALEP WHVLGEEGSS GGTVRFVDSS VERLQVRVEG YVPSRHVVTC
NGRRLPLTET GRSGEAVAGL RFKAWQPASA LHPMIPVHSP LTFDIVDAWS GRSLGGCTYH
VSHPGGRNYE TFPVNTYEAE GRRLARFQDH GHTPGRVTPA PEEPRREFPL TLDLRAPAPR