Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4098 |
Symbol | |
ID | 3681563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5091103 |
End bp | 5092311 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637719446 |
Product | hypothetical protein |
Protein accession | YP_324594 |
Protein GI | 75910298 |
COG category | [K] Transcription |
COG ID | [COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.342554 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.804601 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTTCAC GTCAAAATAT CACTGAACAG TTTACGACCT TCTTGCAATT CGATACTGAC AAAGCAATTA GTTGGGAAGT TGATGCTAGA CTGCACAGAA ATCTAATTAA CTGTCAAACA CGTCTTTCAC AGCCAGAAAA TTCCGAATCT TTTTGGGTGT GTTACTGGTA TAAAGTATGG CAAGAGAAAC CTGATGGTTT AGCCAGGGGA CATTTTTCAG CTTATTTACA AGAAGTCTGT TACTGGGCTG TTCACAAAAT AGTTACTAAT CTTTCCTCTA CCCAACACAC ACTATCTGAC TGCTTTCAAA TGGTGATCAT TCGGGTTGAT AAGGTGTTGA AAGGGTTTAA ACCCAACCTG AACTTTAACC TGAAAAATTA TGCTAGTGCG ATTTTTAGCA GCGAATTCAA AGAAATGCTG CGATCGCAAA ACGAAATTGA TATTTGTACT AATTGGCGAT TATTAAGAAA GTTAACTCAA AAGCGGCTAG TAGAGTCTTT ACAAAATCAG GGAATCCATG AGGATGTTAT CCAACGCTAT GTATTAGCGT GGAAATGCTA TCAAGCGTTA TACGCTCCTA AGCAAATTGT CGGTACTCGC AAATTATCTA GACCAGATGA TGCCACATGG ACAGCGATCG CTCAACTTTA TAATTTCCAA CGTCAAACCC AACTTTCACA ACCAGGGCCA GAGTGCAGTC GAGAAACCAT AGAGAAATGG TTGATTATTT CTGCTCAAGC CGTAAGAGAT TATTTATACC CCAATATGAT TTCCCTGAAT GTCTCAGTAG GGGAGGAAAA TTCTGGAGAT GAATATATAG ATATAGTACC GCAACTCAAG CAGGAGTCCT TGATGACAGA AATTCTGGCT CAAGAAGAAC TGCTCGACCG ACAATCACAA CAAGCGCAAA TTAGCAACGT CCTAGTTACA GCTTTAACTG AACTAGATCC AGAAGCCCAC AATATCATCC AACTTTACTA CAGAGAAGGC TTAACTCAAC AGCAGATTGC TCAACAGCTA GGAATTAAGC AGTACACCAT TTCTCGTAGG CTGGCTAAAG TCAAAGATTT TTTGCTCCTC AAACTTGTAA CGTGGAGTCA ACAATCGCTG CATATTTCTT TAAACTCGCT GGTACTCAAC TATGTAAGCA CCATTCTCGA AGAATGGTTA CAAACTTACT ATCACCGTTC TCTCTCTGAA TCACAATAG
|
Protein sequence | MRSRQNITEQ FTTFLQFDTD KAISWEVDAR LHRNLINCQT RLSQPENSES FWVCYWYKVW QEKPDGLARG HFSAYLQEVC YWAVHKIVTN LSSTQHTLSD CFQMVIIRVD KVLKGFKPNL NFNLKNYASA IFSSEFKEML RSQNEIDICT NWRLLRKLTQ KRLVESLQNQ GIHEDVIQRY VLAWKCYQAL YAPKQIVGTR KLSRPDDATW TAIAQLYNFQ RQTQLSQPGP ECSRETIEKW LIISAQAVRD YLYPNMISLN VSVGEENSGD EYIDIVPQLK QESLMTEILA QEELLDRQSQ QAQISNVLVT ALTELDPEAH NIIQLYYREG LTQQQIAQQL GIKQYTISRR LAKVKDFLLL KLVTWSQQSL HISLNSLVLN YVSTILEEWL QTYYHRSLSE SQ
|
| |