Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3303 |
Symbol | |
ID | 3680295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 4123726 |
End bp | 4125465 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637718654 |
Product | hypothetical protein |
Protein accession | YP_323806 |
Protein GI | 75909510 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.1343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCAG TAGACTTTAC CACCCTCACA GCTACTTGTA GCGAACTCCG CGCTCACTGG CTACCATCCC GCCTAGAGCA AGTTTATCAG CGCGATCGCT ACACTATTGC TATAGCATTA CGTACCCTCG ATAAAAGAGG TTGGTTACAA ATTTCTTGGC ATCCTCAAGC AACTCATATT TGTATTGGTG ATCCACCTCC ACGCACACCA GATACCTTTA CCTTCAGCCA ACAACTAGTC CACCAGTTGG GGGGATTAGC CTTAGTTGCA ATTGAAGCGA TCGCCCCTTG GGAGCGTGTA ATTGATTTAC AATTTGCCCG TCGCCCTGGA GATGCTGCAC TGTATCACAT CTATGGGGAA ATCATGGGTA AATACAGTAA CGTGATTCTC ACCGATGCCA ACAATCTAAT TATTACTGCT GCCCATCAAG TGAGTCAGCA ACAATCAAGT GTGCGTCCCA TCCTCACCGG ACAACCTTAT GAAACACCGC CAAAACTCAC CGGGACTATC CCCAGTTTGC AGGAAACTCA AGCACGTTGG CAAGAAAGAG TCAGTTTAGT GCCAGGAGCA ATTAAACGTC AGTTGCTCAA AAGTTATAGT GGCTTGAGTG CTGTGTTGGT AGAATCCATG TTATTGGTAG CCAACATTGC ACCAGAAACT TCTACTGATT CCCTAACTCC TGAAGACTGG CAACGATTAT TTGCACGCTG GCAAGAATGG CTACACACCT TAAATAGTGG TAAATTTCAA CCAGCTTGGA TGGCAGATGG ATATACAGTT ATGGGTTGGG GTGCTGTTGC ACCAGTCAAA GATATCCAAA CATTAATCAA CCAATACTAT ACCAAGCAAC TAAATCAACA ATTATTTGCC CAATTACGCC ATCAACTGAA TCAGAAATTA AGTAATATTT TAGGCAAATT ACGCAATAAA GCCCAAACCT TTAGCGATCG CCTACAGCAA TCAGATCGTG CTGATGAATA TCGCCAAAAA GCTGATTTAT TAATGGCGCA TCTGCAAAAT TGGGAACCGG GGATGAAAGA AATTAGCATA CCTGATTTTG AGACAGGTGA GCCTATGGCG ATCGCTCTTT CGCCTGATAA AAATGCTGTG CAGAATGCCC AAAATCTCTA CAAACAACAC CAAAAACTCA AACGCGCCCG CATAGCCGTC GAACCGCTAC TGCAAGAAGT ACAAGCAGAA ATCGATTATT TAGAACAAGT AGAAGCTGCT ATTGCCCAAA TAGATAACTA TCAAACAGCA GAAGATTTGC AAGCTTTAGA AGAAATCCGC GACGAATTAA TTGGACAGAA ATATTTAGAA GAGTTAGAGT ATCGTAGCCG CAATAACAAC GAAACTGCTA GCACTAACTT TCACAACTAT CGTACCCCTA ATGGCTTCAC AGTCTTAATC GGGCGCAACA ATCGCCAAAA TGACCAATTA ACATTTCGAG TAGCCGGAGA TTATGATTTA TGGTTCCATG CCCAAGAAAT CCCCGGAAGC CATGTACTAC TACGTTTAGA ACCGGGTGCA ATACCAGAAG TATCAGACTT ACAATATGTA GCTGATTTAA CAGCTTACTA CAGTCGCGGT CGTCAGAGTG ACCAAGTACC AGTCGTTTAC ACCCAACCCA AACACGTTTA TAAACCCAAA GGAGCTAAAC CAGGAATTGC TATTTACAAA CAGGAACGCA TCCTTTGGGG AAAACCGCAG TTAGTAGATA TAGAGAAAGT AGGAAGCTGA
|
Protein sequence | MQPVDFTTLT ATCSELRAHW LPSRLEQVYQ RDRYTIAIAL RTLDKRGWLQ ISWHPQATHI CIGDPPPRTP DTFTFSQQLV HQLGGLALVA IEAIAPWERV IDLQFARRPG DAALYHIYGE IMGKYSNVIL TDANNLIITA AHQVSQQQSS VRPILTGQPY ETPPKLTGTI PSLQETQARW QERVSLVPGA IKRQLLKSYS GLSAVLVESM LLVANIAPET STDSLTPEDW QRLFARWQEW LHTLNSGKFQ PAWMADGYTV MGWGAVAPVK DIQTLINQYY TKQLNQQLFA QLRHQLNQKL SNILGKLRNK AQTFSDRLQQ SDRADEYRQK ADLLMAHLQN WEPGMKEISI PDFETGEPMA IALSPDKNAV QNAQNLYKQH QKLKRARIAV EPLLQEVQAE IDYLEQVEAA IAQIDNYQTA EDLQALEEIR DELIGQKYLE ELEYRSRNNN ETASTNFHNY RTPNGFTVLI GRNNRQNDQL TFRVAGDYDL WFHAQEIPGS HVLLRLEPGA IPEVSDLQYV ADLTAYYSRG RQSDQVPVVY TQPKHVYKPK GAKPGIAIYK QERILWGKPQ LVDIEKVGS
|
| |