Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3417 |
Symbol | |
ID | 5833681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 3791183 |
End bp | 3792085 |
Gene Length | 903 bp |
Protein Length | 300 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641369216 |
Product | transglutaminase domain-containing protein |
Protein accession | YP_001640874 |
Protein GI | 163852831 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.331503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.52884 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATCTACA GCCTGCGTCA CCTGACGACC TACCGCTATG CGCGGGCCGT CCGCTTCGCG CGCTGCAACC TGCGCCTTCG CCCCCGCGAC GGCGAGGGGC AGCGGGTGCT GGAGAGCGCG CTTCACGTCA CGCCCACGCC GACCAGCCGT CTCGCGCGGC GCGACTTCTT CGGCCTCGAC ACGCTGACCC TGACCCTCGA CGAGCCCCAC CGCGAATTCA CGGTCGAGGC GGTCTCCCGC GTCGCCGTGG AGCGTCCGTC CCCGCCGCCG CCCGAGTCGG GCCTGCCCTG GGAGGCCGTG CGGGCGGCGG CCTTGGCGCT GCCCTCGCTC GGGCCGGACG GGCCGGCGCA TTTCCAGTTC CGCAGCCAGC GGGTGCCTCT CGAACCGGCG GTGACGGACT ATGCCCGCGC CAGCTTCCCG CCGGGGCGCA GCGCCTATGG CGGCGCGGTC GAGCTGATGC AGCGCATTCG CGACGACTTC CGTTTCGACG CCAAGGCGAC CACCGTCTCG ACGCCGCTGG CGGAAGCCTT CGCCTTGAGG GCGGGCGTCT GCCAGGACTT CACCCACGTC ATGATCGCCG GTCTTCGCGG GCTCGGCCTG CCGGCGGCCT ATGTCAGCGG CTACCTGCGC ACCCGGCCGC CGGCGGGGCG CCCGCGCCTG CGCGGGGCGG ATGCCAGCCA CGCCTGGGTC GCCCTCTGGT GCGGACCCGA GACGGGCGCG GGGGAGGGGG GCTGGATCGG CCTCGACCCG ACCAATGCCT GCGTCGTGCG CGACGACCAC ATCGTTGTGG CACGCGGACG CGATTACGCC GACGTCGCCC CCATCGATGG CATGGTCGCC TCCGCCGGGG AGCAGAAGCT CACCGTCGAG GTCGACGTCA TCCCCGAGGA CGAGGCGGCC TGA
|
Protein sequence | MIYSLRHLTT YRYARAVRFA RCNLRLRPRD GEGQRVLESA LHVTPTPTSR LARRDFFGLD TLTLTLDEPH REFTVEAVSR VAVERPSPPP PESGLPWEAV RAAALALPSL GPDGPAHFQF RSQRVPLEPA VTDYARASFP PGRSAYGGAV ELMQRIRDDF RFDAKATTVS TPLAEAFALR AGVCQDFTHV MIAGLRGLGL PAAYVSGYLR TRPPAGRPRL RGADASHAWV ALWCGPETGA GEGGWIGLDP TNACVVRDDH IVVARGRDYA DVAPIDGMVA SAGEQKLTVE VDVIPEDEAA
|
| |