Gene GM21_0557 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0557 
Symbol 
ID8135868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp682916 
End bp683914 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content64% 
IMG OID644868170 
ProductArginase/agmatinase/formiminoglutamase 
Protein accessionYP_003020389 
Protein GI253699200 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value8.30365e-16 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACAGCA AAGACATCCC GATGGTTCCC AACAGGAAGG CCTCGCTCCC CACCGTCTAT 
GGCGACACTC CCTCTTTTCT CGGAGTACCC GTTCTGGATT ACAAGAAACC TGCAGCAGGC
TACGACGTGA TGGTCGCCGG GGTCCCCTGG GAAGGGACCG TCACCTGGGG CTCCTTCACA
GGGTGCGAGC TCGCTCCCAG GAGCATCCGG CACGCCTCGG CGCGTTACGG CGGATTCCTC
CCCGAGTACG AGATCGACCT GTTCGACCAC CTGACGCTCG GCGACATCGG GGATATACCG
ATACACCCCA ATGACCCCGC CGAGACGATG CGCAACGTGC ACGCCGCCAT GCAGCGGATC
TACCGCAACC AGAGCATCCC CTTCGTGCTG GGAGGCGACC ACTCCTTCAC CCCGGAGATC
ATCAGGGCGC TCGCGGACGG AGACGAGGGC AAGATCGGCA TCATCCACCT GGACGCACAC
CTCGACAACG CCAAGTCCTT CGGCAGCGAC CAGTTCGCCC GCTGCGGCCC GATCCACAGG
ATCTCCCAGA TCCCGCAGGT CCGCAAAGAG AGCATCGTCC ATCTGGGTAT CCGCGGCCCG
AGGAACTCCC CGACACAGTA CGAGTATGCC CAAAGCATGG GCGCGCGCGT CATCACCACC
AGGGAAGTCA GGGGAAGGGG GATGAGCGCC GTCACCGAGG AGGCGATACG GATCGCGCAC
CACGAAACCA GGCACGTCTT CGTCACCATC TGCAGCGACT GCATCGATGC CGGGTACAAC
CCGGGGGGGC CGGCCGATTT CAACGGGCTG CTCCCCAGCG AGCTTTTGCC GGCGCTGCAC
CAAATCGGAG CCTCGGGCAT CAGCGGCCTA GATTACGTCG AGGTTTATCC GGGGCAGGAC
CCGCAGGGAT ATTCCTCGCA CCTGGCTGCC TGGGCGATGA TCTACGCGCT CTCGGGTATG
GCGCAGCGAA AGCGCGACCG GCCGGGACCG GACCGGTAA
 
Protein sequence
MNSKDIPMVP NRKASLPTVY GDTPSFLGVP VLDYKKPAAG YDVMVAGVPW EGTVTWGSFT 
GCELAPRSIR HASARYGGFL PEYEIDLFDH LTLGDIGDIP IHPNDPAETM RNVHAAMQRI
YRNQSIPFVL GGDHSFTPEI IRALADGDEG KIGIIHLDAH LDNAKSFGSD QFARCGPIHR
ISQIPQVRKE SIVHLGIRGP RNSPTQYEYA QSMGARVITT REVRGRGMSA VTEEAIRIAH
HETRHVFVTI CSDCIDAGYN PGGPADFNGL LPSELLPALH QIGASGISGL DYVEVYPGQD
PQGYSSHLAA WAMIYALSGM AQRKRDRPGP DR