Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3887 |
Symbol | |
ID | 8139261 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4474723 |
End bp | 4476159 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871504 |
Product | transglutaminase domain protein |
Protein accession | YP_003023662 |
Protein GI | 253702473 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 0.978849 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCTCC CATTAAACCT ACTGCGGCTG ATTATCGCCT CGCTGCTGGT GCTGCTGCCG CTCCAGCTTT TCGCAGCTCC CCTTCCCAAG CTGACCGAGC TGCCGCTCGG GGAGCGCTGG TTCAGCATCA AGCTCGGCGA CGAGCGGGTC GGCTTCAGCC GGCTCACCAT CACCCGGACC GATGCGGGCT ACCGCATCGA TTCCGAGGGG AGCGTGAAGA TGCGGGTGAT GGGGTTCTCC AGGGAGGCCA CCTCCAAGGA GTCCTACCTG GTCGCCCGGG ACCTCGCGCT GCAATCATTC TCGGCAGAGA ACCGCATCGA CGGCAGCCCC GTCACCACTA CCGGCGAGGT CACCCCTAAG GGAGTGACAA TCGTCTCCCA GTCCGGCAGC GGGAAGAAGG AGCGGACCCT CAAGTTGAAG GGGGCGCTTT ACCCCCCCCA CGCGCTGAAC CTTTACCCGC TCATGCAGGG TGCGACCAAG GGTAAAAGCT ACAAGCTCCC GGTGCTGGAC GTGGAAGCGC TTAAGGTGAA GCAGCTCAAG GTCGAGGTGG TCGGCGAGGA GACGCTTCCG CCGGGGCGGC AGGTGATCCA TCTGAAGAAC GACGCCTACC CCATGGTCGA TAACGATATA TGGGTGGACC TCCAGGGGAA CGTGCTCAAG GAGTCGGTCC GCGACGACCT GGTGGTCACC GAGGCCGAGG ACGAGGCGAC CGCCCGTGCG GAACTTGCGC GCGACGCGCT CTGCAAGACG GACCTGGTGC TGGATTTCAG CATGATCCGG GTCACCCCCC CTATCGAGCG TCCAGCCCAA TTGCAGAAGC TGGTACTGGA GATGACAGGC ATCCCGTCGC AGCTGCCGCT TTTGCAGGGG ACAAGGCAGC AGGTGGTGCG CAGGGCCGAC GGTACGGTGC TCGTCACCAT GCCCAACCCC GCCCTAGTCG CCGAAGCGCC CCCAACCGCC GCCGATCTGG AACCGGCGGA GCGGATACCG AGCGACCACC CGGAGATAAA GGCGAAAGCC GTGGAGATAA TGGGGAGCGA GCAGGATCCG GCTCAGGTAG CGAAGCTTCT GTCCGACTGG GTGGCCCGCG AGATAAAGGG GGCAGTGACC GACAGCCAGT CCCCCCTGGA AACGCTCAAA ACCCGCATCG GCAACTGCCA GAGCCACGCG AGGCTCTACG CTTCCCTCGC ACGCGCCTCC GCAATCCCGA CCCGCTTCGT GTCCGGGATC GTCCACCAGG GGGAGGGCTT TCTCTACCAT AGCTGGGCGG AAAGCTACTT GGGCGGCGCC TGGGTCCCCA TCGATCCCAC CTTCGGCGAG ATGCCTGCCA ACCTGAGCCA CGTCAAGTTC GTCGATGGGG AAACGCTGGA CGAAATGGGG TCGCTGGCGG GGATGATCGG GAAGGTACGG GCGAAAGTCG TGGAAAAGCG GTACTGA
|
Protein sequence | MTLPLNLLRL IIASLLVLLP LQLFAAPLPK LTELPLGERW FSIKLGDERV GFSRLTITRT DAGYRIDSEG SVKMRVMGFS REATSKESYL VARDLALQSF SAENRIDGSP VTTTGEVTPK GVTIVSQSGS GKKERTLKLK GALYPPHALN LYPLMQGATK GKSYKLPVLD VEALKVKQLK VEVVGEETLP PGRQVIHLKN DAYPMVDNDI WVDLQGNVLK ESVRDDLVVT EAEDEATARA ELARDALCKT DLVLDFSMIR VTPPIERPAQ LQKLVLEMTG IPSQLPLLQG TRQQVVRRAD GTVLVTMPNP ALVAEAPPTA ADLEPAERIP SDHPEIKAKA VEIMGSEQDP AQVAKLLSDW VAREIKGAVT DSQSPLETLK TRIGNCQSHA RLYASLARAS AIPTRFVSGI VHQGEGFLYH SWAESYLGGA WVPIDPTFGE MPANLSHVKF VDGETLDEMG SLAGMIGKVR AKVVEKRY
|
| |