Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0557 |
Symbol | |
ID | 8135868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 682916 |
End bp | 683914 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644868170 |
Product | Arginase/agmatinase/formiminoglutamase |
Protein accession | YP_003020389 |
Protein GI | 253699200 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 8.30365e-16 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACAGCA AAGACATCCC GATGGTTCCC AACAGGAAGG CCTCGCTCCC CACCGTCTAT GGCGACACTC CCTCTTTTCT CGGAGTACCC GTTCTGGATT ACAAGAAACC TGCAGCAGGC TACGACGTGA TGGTCGCCGG GGTCCCCTGG GAAGGGACCG TCACCTGGGG CTCCTTCACA GGGTGCGAGC TCGCTCCCAG GAGCATCCGG CACGCCTCGG CGCGTTACGG CGGATTCCTC CCCGAGTACG AGATCGACCT GTTCGACCAC CTGACGCTCG GCGACATCGG GGATATACCG ATACACCCCA ATGACCCCGC CGAGACGATG CGCAACGTGC ACGCCGCCAT GCAGCGGATC TACCGCAACC AGAGCATCCC CTTCGTGCTG GGAGGCGACC ACTCCTTCAC CCCGGAGATC ATCAGGGCGC TCGCGGACGG AGACGAGGGC AAGATCGGCA TCATCCACCT GGACGCACAC CTCGACAACG CCAAGTCCTT CGGCAGCGAC CAGTTCGCCC GCTGCGGCCC GATCCACAGG ATCTCCCAGA TCCCGCAGGT CCGCAAAGAG AGCATCGTCC ATCTGGGTAT CCGCGGCCCG AGGAACTCCC CGACACAGTA CGAGTATGCC CAAAGCATGG GCGCGCGCGT CATCACCACC AGGGAAGTCA GGGGAAGGGG GATGAGCGCC GTCACCGAGG AGGCGATACG GATCGCGCAC CACGAAACCA GGCACGTCTT CGTCACCATC TGCAGCGACT GCATCGATGC CGGGTACAAC CCGGGGGGGC CGGCCGATTT CAACGGGCTG CTCCCCAGCG AGCTTTTGCC GGCGCTGCAC CAAATCGGAG CCTCGGGCAT CAGCGGCCTA GATTACGTCG AGGTTTATCC GGGGCAGGAC CCGCAGGGAT ATTCCTCGCA CCTGGCTGCC TGGGCGATGA TCTACGCGCT CTCGGGTATG GCGCAGCGAA AGCGCGACCG GCCGGGACCG GACCGGTAA
|
Protein sequence | MNSKDIPMVP NRKASLPTVY GDTPSFLGVP VLDYKKPAAG YDVMVAGVPW EGTVTWGSFT GCELAPRSIR HASARYGGFL PEYEIDLFDH LTLGDIGDIP IHPNDPAETM RNVHAAMQRI YRNQSIPFVL GGDHSFTPEI IRALADGDEG KIGIIHLDAH LDNAKSFGSD QFARCGPIHR ISQIPQVRKE SIVHLGIRGP RNSPTQYEYA QSMGARVITT REVRGRGMSA VTEEAIRIAH HETRHVFVTI CSDCIDAGYN PGGPADFNGL LPSELLPALH QIGASGISGL DYVEVYPGQD PQGYSSHLAA WAMIYALSGM AQRKRDRPGP DR
|
| |