Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_09300 |
Symbol | |
ID | 7759878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 876759 |
End bp | 877949 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643803842 |
Product | diaminopimelate decarboxylase |
Protein accession | YP_002798144 |
Protein GI | 226943071 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0019] Diaminopimelate decarboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000620759 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGATGA ACCTGCCCGC CTCCGTCCAG CAGGCCATCG CCGGCCTGCG CCGTTTCCAT TCCGAGCCGT TGTGCGCCTA TGTCTACGAC CTGCCGGCGC TGGCGCGCCA CGCCCGCGCC CTGCGCGCGG CGCTGCCGGC CGGCTGCGAG CTGTTCTACG CCGCCAAGGC CAACCCCGAG GCACCGATCC TGCGCAGCCT GGCGCCCTGG GTCGACGGCT TCGAGGCAGC CTCCGGCGGC GAACTGCGCT GGCTCCACGA ATGCCATCCG GACAAGCCGC TGATCTTCGG CGGTCCCGGC AAGCTGGACA GCGAACTGGC CGCCGCCATG GACTGCGGGA TCTCCGCCTT CCACGTCGAG AGCATCGGCG AGTTGCGCCG CCTGGCGGTC GTCGCCAAGG CCCACGGCCG TCCGGCCCCG CTGCTGCTGC GACTCAACCT CAAGCTCGAC GAGGCGCCGG ACAGCCGCCT GGTGATGGGC GGCAAGCCGA CCCCCTTCGG CCTCGACGAG GAGGCCCTGG ACGAAGCCCT GGCGCTGCTC GCCGGCGAGC CCTGGCTGGA GCTTCAGGGC CTGCACTTCC ACCTGCTCTC CCATCAGTTG GACAGCGCCG CCCACCTGCA ACTGATGCGC AGCTACTTCC GCTGCTTCCG CCAGTTGAAC GCGCGCCACG GCCTGGATCT GAAACTGCTC AACGTCGGCG GCGGCATGGG CATCGATTAC CGGGACCACG CCCGCTCCTT CGACTGGCCG GGCTTCTGCG CAGGGCTGGC CGCGCTGATC GCCGAGGAAG GCATGGCCGG CACGCGCATC CGCTTCGAGA TCGGCCGCTT CCTCGCCGCC GCCTGCGGCT ACTACCTGAT GGAAGTGCTG GACATCAAGC GCAACCACGG CCAGTACTTC GCCGTGGCCC GCGGCGGCAC CCACCACTTC CGCACCCCGG CGGCCCAGGG CCACGACCAT CCCTTCGTGG TGCTGCGCGG CGAGCGCCCG GCCGAGCTGG AAAACCAGCC GGTGACCCTG GTCGGCCAGT TGTGCACGCC CAAGGACGTG CTGGCCCGTC AGCAGCCGGT GGCCGCCCTG GCGGTCGGCG ACCTGCTGGT GTTCCCCCTG GCCGGCGCCT ACGCCTGGAA CATCTCGCAC CAGCACTTTC TGATGCACGA GCCGCCGCGG ATGCTGTTCC TCGAAGCCTG A
|
Protein sequence | MPMNLPASVQ QAIAGLRRFH SEPLCAYVYD LPALARHARA LRAALPAGCE LFYAAKANPE APILRSLAPW VDGFEAASGG ELRWLHECHP DKPLIFGGPG KLDSELAAAM DCGISAFHVE SIGELRRLAV VAKAHGRPAP LLLRLNLKLD EAPDSRLVMG GKPTPFGLDE EALDEALALL AGEPWLELQG LHFHLLSHQL DSAAHLQLMR SYFRCFRQLN ARHGLDLKLL NVGGGMGIDY RDHARSFDWP GFCAGLAALI AEEGMAGTRI RFEIGRFLAA ACGYYLMEVL DIKRNHGQYF AVARGGTHHF RTPAAQGHDH PFVVLRGERP AELENQPVTL VGQLCTPKDV LARQQPVAAL AVGDLLVFPL AGAYAWNISH QHFLMHEPPR MLFLEA
|
| |