Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_04940 |
Symbol | |
ID | 7759451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 465323 |
End bp | 466357 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643803415 |
Product | Transcriptional regulator, AraC family |
Protein accession | YP_002797723 |
Protein GI | 226942650 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.543435 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCCC GCGAGCCCTG GTACGAGCGC GACAGCCGCT TCATCGCCGC CCATCAGCAG CCCGCCGGGC TGCTCGACCT GGGCCTGGGC CGCGGCATCG ACAGCCACCG CCTGCTACGC GGCAGCGGCC TGTTCCACGA GGACATCCTG CGCGGCGCGA CGCTGATCAG CCCCGAGCAG TACCTGCGCC TGATCGACAA CGCCCAGCGC CTGCTGGCGG CGGACGACAC CAGCTTCCTG TTCGGCCAGC GGCTGCTGCC AGGCCACCAG GGCGCCGCCA GCCAGGCCCT GGCGCAGGCC GGCAACCTGC TGGAGGCGCT GCGCCTGCTC GGCGAACTGC GTGCCCTGCT CTGCCCGCTG CTGAGCCCGC GGCTGGTCTG CGACGAGCGG TACGCCTATT TGTACTGGCT CGACAGTTGC GGCGCCGGGC CGTCGCTGCG TTTTCTGCTC GAAGCGCACA TGGCGGCGGC GACCGCCCTG GGCCGGCGGC TGAGCGGCGA ACGCCTGCCC TGGCGCTTCC ATTTCCGGCA TGCCGAGCCG CGCTGCGTCG AGCAGTACTG GGTGCATCTG GGCGAGGACC TGCATTTCGG CAGCCCCTGC GACCTGATGC GCCTGCCGCT CGACTGCCTG AACCGCCCGC TGCCGCAGGC CGCCCCGACG GCCGGCGCGG TGAGCCGCCG ACAGGCCCGC GCGCAACTGG AGAGCGCCGG CCCGGCGGCC AGCCTGCTCG ACCGCCTGTA CGACTGGCTG CTCGCGCACG TGCGCGAGGC GCCCGGCCTG GAGCGCGCCG CCGAAGCCTT CGCGATGAGC CCGACCACCT TCAAGCGCAA GCTGCGCAAG CACGCCACCA GCTACCAGGA GCAGCACGAC CGCGCCCGTC TGCACGTGGC GCTCTGGCTC CAGCAGGTGA AGGGCTACGG CAACGAGGCG ATCGCCAGCT ACCTGCACTT CCACGACGCC AACAACTTCC GCCGCTCGTT CAAGCGCTGG ACCGGCATGC CGCCCAGTTC GCTGCGCCAG TCGCTGTGCG GTTGA
|
Protein sequence | MSAREPWYER DSRFIAAHQQ PAGLLDLGLG RGIDSHRLLR GSGLFHEDIL RGATLISPEQ YLRLIDNAQR LLAADDTSFL FGQRLLPGHQ GAASQALAQA GNLLEALRLL GELRALLCPL LSPRLVCDER YAYLYWLDSC GAGPSLRFLL EAHMAAATAL GRRLSGERLP WRFHFRHAEP RCVEQYWVHL GEDLHFGSPC DLMRLPLDCL NRPLPQAAPT AGAVSRRQAR AQLESAGPAA SLLDRLYDWL LAHVREAPGL ERAAEAFAMS PTTFKRKLRK HATSYQEQHD RARLHVALWL QQVKGYGNEA IASYLHFHDA NNFRRSFKRW TGMPPSSLRQ SLCG
|
| |