Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_33090 |
Symbol | |
ID | 7762205 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3382538 |
End bp | 3383599 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643806175 |
Product | Cobalamin biosynthesis CobC protein |
Protein accession | YP_002800439 |
Protein GI | 226945366 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01140] L-threonine-O-3-phosphate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0570805 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGAGC ACGGCGGCCG TCTGCGCGTC GCGGCGCGGT ATTACGGCAT TCCCCAGGGC GACTGGCTCG ACCTGTCCAC CGGCATCGCC CCCGAGCCCT GGCCCGTGCC GTCCATGTCG TCCGATGCCT GGGCGCGCCT GCCGGAAGAG GACGACGGCC TGGCCGAGGC CGCCTGCGCC TGCTACGGCG CCGCCCAGGC CTTGCCGGTG GCTGGCAGCC AAGCGGCGAT CCAGGCGCTG CCGGCGCTGT TCGCGGCGGG AAGGGTTGGC GTGCTGGCGC CGAGCTACGC CGAGCACGCC CAGGCCTGGC GACGCGCCGG TCACCGGCTG CTCCATCTGG CCGCCGGAGA CATCGAGGCG CGCCTCGACG AGCTCGACAT GCTGGTGCTG GCCAACCCCA ACAACCCCAC CGGCGAGCGC TTCGAGCCGT CCCGGCTGCT CGACTGGCAG GCGCGGCTGG CCCGGCACGG CGGCTGCCTG CTGGTCGACG AGGCGTTCAT GGACTGCACG CCCGAATACA GCCTGGCGGC CCACAGCCAA CGGCCGGGGC TGGTCGTGCT GCGCTCGTTC GGCAAGTTCT TCGGCCTGGC CGGCGTTCGC CTGGGCTTCG TGCTGGCCGA GACGGGACTG CTGGCGCGGC TGCACGAGCG CCTCGGTCCC TGGACGGTCA GCGGGCCGGC GCGGGCGCTC GGCCTGCAGG CGCTGGGGCC GGCCGGCGGC GCGGCGCGCG AACGACGCGC CGGGCAATTG CGGGCGGCGG GAAAGCGGCT GGCGGCCTTG CTGGACGCGC ACGGGTTGGC GCCGGCCGGC AGCACCGCGC TGTTCCAGTG GGTGCGGATG CCCGATGCGG CGCGGTTGCA CGACTTTCTC GCCCGCCAGG GCATTCTGGT GCGCCTGTTT GAGACGCCCG CCAGCCTGCG CTTCGGCCTG CCGGCGGACG AGCGCGGCTG GCAACGGCTG GCGCAGGCGC TGGCCGACCC GGCACGACCC CGTGGGCGCG GACTCGTTCG CGAGCGGGCG GGGACGATCG CCGACGAATC CGCAAGCAGG GAGTTTCCAT GA
|
Protein sequence | MLEHGGRLRV AARYYGIPQG DWLDLSTGIA PEPWPVPSMS SDAWARLPEE DDGLAEAACA CYGAAQALPV AGSQAAIQAL PALFAAGRVG VLAPSYAEHA QAWRRAGHRL LHLAAGDIEA RLDELDMLVL ANPNNPTGER FEPSRLLDWQ ARLARHGGCL LVDEAFMDCT PEYSLAAHSQ RPGLVVLRSF GKFFGLAGVR LGFVLAETGL LARLHERLGP WTVSGPARAL GLQALGPAGG AARERRAGQL RAAGKRLAAL LDAHGLAPAG STALFQWVRM PDAARLHDFL ARQGILVRLF ETPASLRFGL PADERGWQRL AQALADPARP RGRGLVRERA GTIADESASR EFP
|
| |