Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_25940 |
Symbol | |
ID | 7761505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 2648544 |
End bp | 2650061 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643805475 |
Product | sigma54-dependent activator protein |
Protein accession | YP_002799748 |
Protein GI | 226944675 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG3283] Transcriptional regulator of aromatic amino acids metabolism |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.935166 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTATTC ACGTCACCTT CATCGACCGC GTCGGTATCA CCCAGGAAGT CCTCGCTCTG CTCGGCGGGC GCAATCTCAA TCTGGACGCC GTGGAAATGG TTCCGCCCAA CGTCTACATC GACGCGCCGA CGCTCGGCCC CGAGGTCCTC GAGGAACTGC GCGACGCCCT GTTCAAGGTA CGCGGGGTGC GGGCGGTGGA GGTGGTCGAC ATTCTCCCCG GCCAGCGGCG CAGCCTGCAG CTCTACGCCC TGCTCGCCGC CATGCCCGAC CCGCTGCTCG CGGTGGACGG CAACGGCCGG GTGCTGCTGG CCAACCCGGC GCTGATCGAG CTGGCCGGCC GCGATCCCGA CGGCGAGACG CTCGGCGAAC TGTTCGCCGC CCCGGAGCTG CACGACGAAC TGCTGGCCCA CGGTTTCCGC CTGCCGATGC GCGAGGTCGT CCTCGACGGC CGGACCCTGC TGCTGGACGC CATGCCGATC GGCGAGAACG CCGGGCTGGC CGGCGCCCTG CTCAGCCTGT ACGCGCCCAA CCGCATCGGC GAGCGCCTGG CCGCCCTGCA TCACGAGCAG GCCGAAGGCT TCGATGCCCT GCTCGGCGAC TCGCCGGCGC TGCGTACGCT CAAGGCCCGC GCCCAGCGGG TGGCGGTGCT CGACGCGCCG CTGCTGATCC AGGGCGAGAC CGGCACCGGC AAGGAACTGG TGGCGCGCGC CTGCCACGCG GTCAGCGCGC GGCGCAATGC GCCTTTTCTC GCCCTCAACT GCGCGGCCCT GCCGGAGAAC CTCGCCGAGA GCGAGCTGTT CGGCTACGCC CCCGGCGCCT TCACCGGCGC CCAGCGCGGC GGCAAGCCCG GCCTGCTGGA ACTGGCTCAC CAGGGCACGG TATTCCTCGA CGAGATCGGC GAAATGTCGC CTTACCTGCA GGCCAAGCTG CTGCGCTTTC TCAGCGACGG CAGCTTCCGG CGCATCGGCG GCGAGCGCGA GATCCGGGTC GACGTGCGCA TCCTCAGCGC GACCCACCGC AACCTGGAGC AGATGGTTGG CGAGGGCCGC TTCCGCGAAG ACCTGTTCTA CCGGCTCAAC GTGCTCAACC TGGCCGTGCC GCCGCTGCGC GAGCGCAGCG AGGACATCCT GCCGCTGGCC CGCCACTTCC TCGACCAGGC CTGCGCGCAG ATCCGCCGCC CGCCCTGCCG CCTGGCCCCG GACACCTACC CGGCGCTGCT CGCCAACCGC TGGCCGGGCA ACGTGCGCCA GTTGCAGAAC GTGATCTTCC GCGCCGCGGC GATCAGCGAA GGCAATCTGG TGGAGATCGG CGACCTGGAC ATCGCCGGCA CCGCCGTGGC GCCGCAGGCC GACGAGATCG GCAGCCTCGG CGCGGCGGTC GAGAATTTCG AGCGCGGCAT CCTGGAAAGG CTCTACGCCA GCTATCCCTC CAGCCGCCTG CTGGCCGCCC GCCTGCAGAC CTCGCACACC GCCATCGCCA AGCGCCTGCG CAAGTACGGC ATTCCCGCCC GCTCCTGA
|
Protein sequence | MRIHVTFIDR VGITQEVLAL LGGRNLNLDA VEMVPPNVYI DAPTLGPEVL EELRDALFKV RGVRAVEVVD ILPGQRRSLQ LYALLAAMPD PLLAVDGNGR VLLANPALIE LAGRDPDGET LGELFAAPEL HDELLAHGFR LPMREVVLDG RTLLLDAMPI GENAGLAGAL LSLYAPNRIG ERLAALHHEQ AEGFDALLGD SPALRTLKAR AQRVAVLDAP LLIQGETGTG KELVARACHA VSARRNAPFL ALNCAALPEN LAESELFGYA PGAFTGAQRG GKPGLLELAH QGTVFLDEIG EMSPYLQAKL LRFLSDGSFR RIGGEREIRV DVRILSATHR NLEQMVGEGR FREDLFYRLN VLNLAVPPLR ERSEDILPLA RHFLDQACAQ IRRPPCRLAP DTYPALLANR WPGNVRQLQN VIFRAAAISE GNLVEIGDLD IAGTAVAPQA DEIGSLGAAV ENFERGILER LYASYPSSRL LAARLQTSHT AIAKRLRKYG IPARS
|
| |