Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_39910 |
Symbol | |
ID | 7762880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4042724 |
End bp | 4044286 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643806853 |
Product | metal dependent hydrolase |
Protein accession | YP_002801105 |
Protein GI | 226946032 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.776901 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCGAC GCGGCTTCCT CTCCAGGCTT GCAGCGATTA CCGCGGGTTT CGCCCATGCC CGCCCCGCTT TCGCGCAGTC GCTGGCATCC GGCCCGCTCG GGGCCCGCGT CATCCCGTCG ACCGGCAGGC GCTCGCCGGC GATTGGCACG GGCGGTTCCG GGCGTCTCGA GCGGGAGCGC GGCCAGGCTT CGCCGCCGAA AGCCATGAGG CGCCTGCTGT CCCTCCTGCT GACCGCCTGC CTGCCCTGGC AGGCAGAAGC CGCGACCGAG GCCGAGGCGC TGCTCGTACA CGGCGGCTAC GTGATGACGA TGGACCCGAC ACTGGGCGAC ATCGACGGCG GCGAGGTACT GATCCGCGAC GGCCGCATCG TCGCCGTCGG CCGCGGCCTG GACGCCGGCG ACGCCCACCG CATCGACGCG CGCGGGCAGG TCGTGCTGCC GGGCTTCGTC GACACGCATT CGCACCTGTA CGTGACCACG ATGCGCGGGC AGTTCCGCAA CCGCGACGGG CAGTTCTTCC CGGTGAGTTC GCGCCTGGCC GCGGCCATGA CGCCGGAGGA CACCCGTACC GCCATGCAGC TCGGCGCCCT GGAACTGCTG CAGGGCGGCA TCACCACCAC CGCCGACTTC TTCGACAACA TCCTCACCCC GGCGCACGGC GAGGCCGGAG TGCAGGCGCT GGAGGCGTCC GGCATCCGCG CGGTGATGTA CTACGGCGGT CCGGACAAGA CCACCCGCCA TCCCATCGAC CTCGCGCAGT TGCGCGCCCT GGCCGAACGC CGGGGCAAGG ACGCGCGGGT ACGAATCGGC CTGGCCTGGC GCTTGCCGCG CGATCGCGGG GATGCGGACA ACTGGGCGAT GCGCCAGCGC GAATACGACA CCGCGCGCGG CCTCGGCCTG CCGATCCAGG TGCACGTCAG CGGCGAGCCC GCCCCGATGT TCGAGGCGCT GATCCAGCGC GATTACCTGT TTCCCGGCCT GACCGTCGTG CATGCCACCG ATGCCGGCCA GGAGCGGCTG CAGGCACTCG AACGGGCCGG CGGCGGCCTG GCGCTGACAC CGCCGAGCGA GCAGCGCGTC GGCTACGGGC TGACCCGGCT GGACCACTTC GCCACGGTGA CCCGGCAGGG CCTGGGCATC GACGGCAATT CGCTGGCCGG CAGCGCCGAC ATGTTCGCCA CGCTGCGACT GGCGGCGCTG ACCTGGAGCG GCGGCGCGCG GGACGAGCGG GCGCCCGCTC CGCGCGCGCT GCTGGAACTG GCCACCCGCC GTGGCGCCGA GGCCGTGGGC CTGGGCGACG AGGTCGGTAC GCTGGCGCCG GGCAAGCGCG CCGACCTGCA GGTCATCGAT CCGGCTGCGC TGAATCTGGG CGGCTTCGGC GGCGGCGACC CGGCCGCGCT GCTGGTCTAT TCGGCGCGCC CGGACAACGT CCGCACGGTG CTGGTCGACG GCCGCCTCGT CAAGCGGGAC GGTCAACCGG TCGGCGTGGA CGCGGCGGAC CTGCTGGAGC GCGCCCGGCG CTCCGCCCGG GACCTGCTTG ACCGCAGCCG ATCCTCTCCC TGA
|
Protein sequence | MPRRGFLSRL AAITAGFAHA RPAFAQSLAS GPLGARVIPS TGRRSPAIGT GGSGRLERER GQASPPKAMR RLLSLLLTAC LPWQAEAATE AEALLVHGGY VMTMDPTLGD IDGGEVLIRD GRIVAVGRGL DAGDAHRIDA RGQVVLPGFV DTHSHLYVTT MRGQFRNRDG QFFPVSSRLA AAMTPEDTRT AMQLGALELL QGGITTTADF FDNILTPAHG EAGVQALEAS GIRAVMYYGG PDKTTRHPID LAQLRALAER RGKDARVRIG LAWRLPRDRG DADNWAMRQR EYDTARGLGL PIQVHVSGEP APMFEALIQR DYLFPGLTVV HATDAGQERL QALERAGGGL ALTPPSEQRV GYGLTRLDHF ATVTRQGLGI DGNSLAGSAD MFATLRLAAL TWSGGARDER APAPRALLEL ATRRGAEAVG LGDEVGTLAP GKRADLQVID PAALNLGGFG GGDPAALLVY SARPDNVRTV LVDGRLVKRD GQPVGVDAAD LLERARRSAR DLLDRSRSSP
|
| |