Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1746 |
Symbol | |
ID | 8137077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2034918 |
End bp | 2035946 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644869358 |
Product | Agmatine deiminase |
Protein accession | YP_003021558 |
Protein GI | 253700369 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2957] Peptidylarginine deiminase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 112 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAAGACA CCAGAAGATT GCCGGCAGAA TGGGAACCGC AGGACGGGGT GCTCCTCGCC TGGCCGCACG AAAACAGCGA CTGGGCGCCG TACCTGGATG CGGTGGAGCC GGTTTTCGCA GAGATAGTGA CGCAGATAAG CCGTTTCGAG ACCGCCGTCG TCGCGGCCGC CGATCCCGGC GAGGTACGCG AAAAGCTCGC CGCCCGCGGC GCGAACCTGG AAAGGGTACG GATACACCAG GTCGACGCCA ACGACACCTG GGCCCGGGAC TTCGGCCCCA TCGCCGTTGA GGAAAACGGC GCCCCCCGGC TCCTCAACTT CGGCTTCAAC GGCTGGGGCC TCAAGTTCCC TTCCGACCTG GACAACAGGA TCAACAAGCG GCTCCAGGCC CTGGGGGTTT GGGGCACGCC GCTCGACACG GTCGGGCTCA TCCTCGAAGG TGGGAGCATC GAGAGCGACG GCCAAGGCAC CATCCTCAGC ACCGAGGAGT GCCTGATGAA CGACAACCGG AACCCGCATC TCACCCGCGT CGAACTGGAG GAGGAACTGC ACGGGCTTTT CGGCAGCGAC CGTTTCCTCT GGCTCGCCAA CGGCTACCTG GCCGGCGACG ACACCGACTC GCACGTGGAC ACGCTGGCCC GGCTTTGCCC GGACGACACC ATCGCCTACG TCCGCTGCGA CGACCCGGAT GACGAGCACT ACCATGCGCT TGCGGCGATG GAGCAGGAGA TCCTCTCCTT CCGGACCCGC GACGGCCGCC CGTACCGCGC CATCCCCCTC CCCTGGCCCG GCGCCAGGTT CGACGAGGAG GGCGAGCGCC TCCCGGCCAC CTACGCCAAC TTCCTGGTGG TGAACGGCGC CGTGCTGGTG CCGACCTACC GGGACGAAAA GGACGCCGCC GCCCTTAAAG CCGTCGGGGA AGCGTTCCCC GGGCGGGAGA TCGTCGGCAT AGACTGCCTG CCGCTCATAC TGCAGCACGG CTCGCTGCAC TGCGTCACCA TGCAGCTCCC CAAGGGAAGC CTGAAATAG
|
Protein sequence | MEDTRRLPAE WEPQDGVLLA WPHENSDWAP YLDAVEPVFA EIVTQISRFE TAVVAAADPG EVREKLAARG ANLERVRIHQ VDANDTWARD FGPIAVEENG APRLLNFGFN GWGLKFPSDL DNRINKRLQA LGVWGTPLDT VGLILEGGSI ESDGQGTILS TEECLMNDNR NPHLTRVELE EELHGLFGSD RFLWLANGYL AGDDTDSHVD TLARLCPDDT IAYVRCDDPD DEHYHALAAM EQEILSFRTR DGRPYRAIPL PWPGARFDEE GERLPATYAN FLVVNGAVLV PTYRDEKDAA ALKAVGEAFP GREIVGIDCL PLILQHGSLH CVTMQLPKGS LK
|
| |