Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2962 |
Symbol | |
ID | 8138305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3444936 |
End bp | 3446834 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644870560 |
Product | transglutaminase domain protein |
Protein accession | YP_003022749 |
Protein GI | 253701560 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 0.0859741 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAGC TAAAGATTGA AAAGCTGCTG AACCTCCTTG CGGGCTTAAT CGCGCTCCTG GGGTACCTCC CGCTGCAGCC GTACCTGGAC CCGCTCCCCC GCTATTTCTT CCCGGTCTCG CTACTGGGCG CGTTTTACCT GCAGCGCACC GGCCGCGCCC TGCCGACGCG CCTGCTCACC CCCCTCTCCA TCGCGCTCTT TCTCTACTAC GCCGTCGGCT TCAGCGTGGA TCGCCTGGTA CCTGTGACCG GGGACCTCCT GGTGCTCTTT CTGGCGGTGA GGCTCTTGGG CGACAGAAGC GGCAGGCATT ACCTGCAGGC CTTCGCGCTG TCGCTCTTTT GCCTGGCTGC CTCGTCGCTC TACGAGATTT CCGCCGTCTT CCTGCTTTAC CTGCTGCTGC TTTTGTTCCT CATGGCCGTT TCGCTGGTGC TGCTCACCTT CCATGCCCAC GACCCCGCCA TCGCCCTTGC CCCTGACCAG GGGAAGAAGG TGCTCGCCGT CTCGGTACTC ATTCCCGTGG CGTCGCTCCC CATCCTGCTC GTGCTCTTCG TGCTCCTGCC GAGGACGCAG TACCCCCTGT GGCATTTCCT GGCAGGAACT GCAGGGAAGA AGACCGGGCT TTCCGACAGC GTCCAGCCGG GGGACGCGGC CAGCGTCACC GAGGTGAAAG GGGCGGTGCT CAGGGCCATA ACCGGCAAGC TCCCGGAGGA GAAGCTTTAT TGGCGCGGGG TCGTGCTGAA CGGATTCCGC GGCGATTCGT GGGTGAGGCT TCCGGTGCCC GAGGAACTGC CACCGGTCCA AAGGGGGGGG GCGGTGCTCC AGGAGATCTA CCCGGAGCGT TCGCAAAGCT CCTACCTACT CGCCCTGAAT ACCACCCGCA GCATCTCGGG GCTGCGCCAC GACGAGGCCA ACGACGCCGT CTTCACTTCC CGGCGGCCGC TGGACAGGAA GGTGAAGTAC GTCGCGACGT CGGTCATCGG CACCCCCCTG GAGGTGAAGG GAGGGGTAGA CCGCGGCTTT TACCTGCAGC TTCCCTCGAC GCTACCCGAG CGCATCCTGG CCAAGGGGCG CGATCTCGCC CGCGCCGGCC TCGCCCCCCC GGAGCGGATG CGGCTTTTGG AAGCGTTCTT CCGCAACCAG AGGATCACCT ACGCCAACAC CGAGCTCCCG GTCGGCCCCC AGCCGCTGGA CTCGTTCCTC TTCGGCAAAA GGCGGGGCAA TTGCGAATAC TTCGCCTCCT CTTACGCCAC CCTGCTCAGG CTGGCCGGCA TCCCGTCTCG ACTGGTCGGG GGGTATCGCG GCGGAAGCTA CAACGACATG GGGGGATATT ACCTGGTCAC CGAGGACATG GCGCACGTCT GGGTCGAGGC GTATGTCGAC GGGGTGGGAT GGCAGACGGT CGATCCCAGC GCCTGGGCGA TCGGGTCGGC AAGGCGCGCC GCCTCCGCCA GGGGGATCTC GATGTACTTC GATGCCGTCA GCTTCTACTG GGACAAGGCG GTGGTAAGTT ACGACCTCGA CAAGCAGATC GCGCTGGTGA GGCGGGCCGG GGGCAAGGCA CGCGACCTGC GTCTCCCGGC GGGCTTCGTG CGGGGCTCGC TGGCGCTGTT GCTGCTCATG CTGCCGCTGG CGGCACTTGG CTTGTGGCTC AAGAAGAGGC CGGCGAGCCG GGAGGAGAGG GTGCTGAGGA AGATGCTGCG GGCCGCTCGG AAGCGCTACC CGGGGGACAC GAGCGGGGAG GAAGGGCTCT TCGAGCTGTC GGCGCGCCTG GACGATCCCC TGATCCGCGA GTTCGCATCC ATCTACGGCG GCGCCGTCTA TCGCGACAGA CCGCTACGCA AGGAAGAACT GGCGAGATTG AAGGAAATCG TCCGGGAACT GCGTCAGCAT GGTCCTTGA
|
Protein sequence | MAKLKIEKLL NLLAGLIALL GYLPLQPYLD PLPRYFFPVS LLGAFYLQRT GRALPTRLLT PLSIALFLYY AVGFSVDRLV PVTGDLLVLF LAVRLLGDRS GRHYLQAFAL SLFCLAASSL YEISAVFLLY LLLLLFLMAV SLVLLTFHAH DPAIALAPDQ GKKVLAVSVL IPVASLPILL VLFVLLPRTQ YPLWHFLAGT AGKKTGLSDS VQPGDAASVT EVKGAVLRAI TGKLPEEKLY WRGVVLNGFR GDSWVRLPVP EELPPVQRGG AVLQEIYPER SQSSYLLALN TTRSISGLRH DEANDAVFTS RRPLDRKVKY VATSVIGTPL EVKGGVDRGF YLQLPSTLPE RILAKGRDLA RAGLAPPERM RLLEAFFRNQ RITYANTELP VGPQPLDSFL FGKRRGNCEY FASSYATLLR LAGIPSRLVG GYRGGSYNDM GGYYLVTEDM AHVWVEAYVD GVGWQTVDPS AWAIGSARRA ASARGISMYF DAVSFYWDKA VVSYDLDKQI ALVRRAGGKA RDLRLPAGFV RGSLALLLLM LPLAALGLWL KKRPASREER VLRKMLRAAR KRYPGDTSGE EGLFELSARL DDPLIREFAS IYGGAVYRDR PLRKEELARL KEIVRELRQH GP
|
| |