Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_48900 |
Symbol | |
ID | 7763749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4949681 |
End bp | 4950799 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643807730 |
Product | agmatine deiminase |
Protein accession | YP_002801965 |
Protein GI | 226946892 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2957] Peptidylarginine deiminase and related enzymes |
TIGRFAM ID | [TIGR03380] agmatine deiminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0724155 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGAAC CGATGCGAAC CCTGAGCGGC ACGCCGCGCG CCGATGGCTT CCGCATGCCC GCCGAGTGGG AGCCGCACGC CCAGACCTGG ATGCTCTGGC CGGAACGCCC GGACAACTGG CGGCTCGGCG GCAAGCCGGC GCAGGCCGCC TTCGCCACGG TGGCCCGCGC CATCGCCCGC TTCGAGCCGG TGACCGTCGG CGTCTCGGCG GCGCAGTACG AGAACGCCTG CGTCCGCCTC GCCGGCGCCG ACATCCGGGT GGTCGAACTG AGCAGCGACG ACGCCTGGGT ACGCGACACC GGACCGACCT TCCTGGTCGA CGACGCCGGC GAGGTGCGCG GCGTGGACTG GACCTTCAAC GCCTGGGGCG GCTTCGCCGG CGGCCTGTAC GCGCCCTGGA ACCGCGACGA CCAGGTGGCG CGCAAGATCC TCGGTATCGA GCGCTGCGCC CGCTACCGCA CCGAAGGCTT CGTGCTGGAA GGCGGCGCCA TCCACGTCGA CGGCGAGGGC ACCCTGCTCA CCACCGAGGA ATGCCTGCTC AACCCGAACC GCAACCCGCA CCTCTCGCGC GAGGAGATCG AGGCCGTGCT CGCCGGGCAC CTGGCCGTCG AACGGGTGAT CTGGCTGCCG CAGGGGCTGT ACAACGACGA GACCGACGGC CATGTCGACA ACTTCTGCTG CTTCGTGCGT CCCGGCGAGG TGCTGCTGGC CTGGACCGAT GACCCGCAGG ACCCCAACCA CCCGCGCTGC CGGGCGGCGC TGGAGGTGCT CGAGCGGGTC CGCGACGCCC GCGGCCGGGC CCTGCGCGTG CAGCGGATGC CGATACCAGG TCCGCTGCAC GCCAGCGCGG AGGAATGCGC CGGCGTCGAC CCGGCCGCCG ACAGCCAGCC GCGCGATCCG TCGATCCGCC TGGCCGCCTC CTACGTGAAT TTCCTGATCG TCAACGGCGG CATCATCGCG CCGGCCTTCG ACGATCCGCG CGACGCCGAG GCCGAGGCCC TTCTCCGGCA GTCGTTCCCC GGGCGCGAGG TGTTGATGCT ACCCGGCCGC GAGATTCTCC TCGGCGGCGG CAACATCCAC TGCATCACCC AGCAGCAGCC GGCCGCACGA CGGCGCTGA
|
Protein sequence | MPEPMRTLSG TPRADGFRMP AEWEPHAQTW MLWPERPDNW RLGGKPAQAA FATVARAIAR FEPVTVGVSA AQYENACVRL AGADIRVVEL SSDDAWVRDT GPTFLVDDAG EVRGVDWTFN AWGGFAGGLY APWNRDDQVA RKILGIERCA RYRTEGFVLE GGAIHVDGEG TLLTTEECLL NPNRNPHLSR EEIEAVLAGH LAVERVIWLP QGLYNDETDG HVDNFCCFVR PGEVLLAWTD DPQDPNHPRC RAALEVLERV RDARGRALRV QRMPIPGPLH ASAEECAGVD PAADSQPRDP SIRLAASYVN FLIVNGGIIA PAFDDPRDAE AEALLRQSFP GREVLMLPGR EILLGGGNIH CITQQQPAAR RR
|
| |