Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_20280 |
Symbol | |
ID | 7760955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 2018454 |
End bp | 2019383 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643804924 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002799207 |
Protein GI | 226944134 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0229385 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTCA GCGAGAAGCC CTCGGATCTC GAGGTGATCC TCAACGAGCC GCAGCACAGC TTCCGCTGGT ACGAGCACGA TTACCCCTTT CCCTTGGCGC GCTGGAACCA TCACCCCGAG TTCGAGATCC ACCTGATCCG GCGCGGCACC GGCAAGTTGC TCGCCGGGGA CTATATCGGC CATTTCGGCC CCGGCCATGT GGCGCTGATA GGCCCCGGAC TGCCCCACGA CTGGATCGGC GATCTGGCGC CGGAGCAATG CATCCGCGGG CGCGACCTGG TCCTGCAGTT CGACGGGGAA AGCCTGCTGC GGTTGCGCGA AAGGCTGCCC GAACTGGGCG AACTCCAGCC CCTGTTCGAG CGGGCCCGGC GGGGCGTCGA GTTCACGGGG CAAACGGCCC GCCATGCCGC CCGCCTGCTG GAAGCCATCG GCCGGGCGGC CGGCCTCAGG CGCCTGCTGC TGTTTCTGTC GCTCCTGGAA ACCCTGGCCC GCGCCGGCGC CGCCGAGAGC CGCACGCTGG CCAGCCCCCA CTACTCGCCC CGCTTGGACA GCCTGACGAC CAGACGGATC AACCAAGTGT TCGATTACAT CCTGGACGAT CTGGCCGGCG AGATCCGCCT GTCGGTCATC GCGCGGCGCC TGCGGATGAG CGAGCCGGCC TTCTCGCGCT TCTTCAAACG CACGACCGGG CACACCTTCG TCGATCTGAT CCGCAAGCTG CGCATCCAGC GGGCCTGCCG CCTGCTGCTG CAAAGCCGGC AGTCGATCGC GGACATCTGC TTCCAGGTCG GCTATGGCAA TCTGTCGAAC TTCAACCGCC ACTTCCGTCA CGAGATGCAC GAGACGCCCA GCGAGTACCG CCGCCGGCTC GGGCAGTCTC CCGGCCCGGC GGCCGGCGGA TCGGCAGCGG CCAGCGCCGA TCCGCCATGA
|
Protein sequence | MEFSEKPSDL EVILNEPQHS FRWYEHDYPF PLARWNHHPE FEIHLIRRGT GKLLAGDYIG HFGPGHVALI GPGLPHDWIG DLAPEQCIRG RDLVLQFDGE SLLRLRERLP ELGELQPLFE RARRGVEFTG QTARHAARLL EAIGRAAGLR RLLLFLSLLE TLARAGAAES RTLASPHYSP RLDSLTTRRI NQVFDYILDD LAGEIRLSVI ARRLRMSEPA FSRFFKRTTG HTFVDLIRKL RIQRACRLLL QSRQSIADIC FQVGYGNLSN FNRHFRHEMH ETPSEYRRRL GQSPGPAAGG SAAASADPP
|
| |