Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3693 |
Symbol | |
ID | 3679112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 4601220 |
End bp | 4604210 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637719044 |
Product | hypothetical protein |
Protein accession | YP_324194 |
Protein GI | 75909898 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTGGA AATGGTGCTT TCGACTCTCA ATAGTCTTTG TAGGACTTTG GCTACTCTTG GATTTGAGTT CCCGCTTGGG GGCAGAGATT TTTTGGTTTC AAGAGGTTGG CTATCTGCAA GTATTTCTCC TGCGGCTGGC GAGTCGTGGG GCTTTATGGG TGGTTGCTGT GGGTGTAACT GCTGTCTATC TGTGGGGAAA TTTAACTTTG GCGCAACGGC TAAAGTATCC CCGGTCTTTG AAGATTGCGG AGGTTAGGCG AGAAGAAGCA GAGTTGAGTG TGGGTCTGAA AAACTTTCTC AGTCCTCAAT ATTCTCTGTT AAATGCGCCT AAAATTCATG ATGCTGGCCA CTTAAAACCT TTCAGATTGC GTTGGCTGCT ACCCTTGACT GTGGTCTTCA GCTTATTGGC AGGGTTAATT TTAGTTCACT ATGGAAAAAT AGCTCTTGCT TACTGGTATC CAGCTTTTAA CAAGAATAGT TTACCGATAA TTACTCCATT TCGTTTAGAA ACTATCTGGG AACTGGGCAG GCAGGTTTTT TCCCAAGTTT TATATCTGGG TCTCATTGTC GGCGTAGCGA TCGCTATTCT TATTTACTCA CAATTTTTCC TCAGGGCGAT CGCTGTTATT CTCAGTGTTG TGTTTGGGAC AATTCTTTTT CACAACTGGG CTAAGGTTTT ACAGTATTTC TCCCCTACAC CCTTCAACAG CACTGACCCT TTATTTGGGA AAGATATCAG CTTTTATATA TTTTCCCTGC CATTGTGGGA ATTGCTAGAA CTGTGGTTAA TGGGGATGTT TTTGTATGGC TTTATTGCTG TAACTCTTAC CTATCTCCTC TCAGCTGACA GTCTCAGTCA AGGAATTTTC CCTGGTTTTT CACCCCAACA GCAACGCCAT CTCTACGGTA TGGCTGGCTT ATTAATGTTG ATGGTGGCTT TCAGTTATTG GCTGAGTCGT TATGAGTTGG TTTATTCGCC TCGCGGGGTG AGTTATGGCG CTAGTTACAC GGATGTGGTC GTACAGTTAC CCATTTATAA CATCTTGTGT GTTCTGGGAT TAGCGATCGC ATTTTACCTG TTGTGGCGGA CAATTTTCTG GCGCGCTAAA TCTCAGTATC GCCAATTTGT CTTTTACGGA TTGGGTGTTT ATTTGTTTGT GGTCGTAGCG GCTGGGTCTG TTTTACCTAC AGTAGTCCAG TATTTGATTG TTCAACCTAA CGAATTACAA CGGGAACAAC CATACATTCA ACGTACAATT GCCTTGACTA GGCAAGCATT TAGTTTAGAA ACAATTGATG CTAGAACTTT TAACCCCCAA GGAAATTTAA CTACAGCCGC TATCCAAGCT AATGATTTGA CGATTCGTAA CATTCGTCTG TGGGATAAGC GACCACTGTT AGAAACTAAC CGCCAACTGC AACAATTCCG CCCTTACTAT CGCTTCCCTG ACGCAGATAT CGACCGCTAC ACCTTAGAAG CGGAAGCAGC CGCAAATAGA CCAGTAAACC CTAACCAGTT GCCAGCACCA ACAGAACGAC GACAGGTATT AATTGCACCC AGGGAACTAG ATTACAGTGC AGTCCCAGAG CAGGCGCAAA CATGGATCAA CCAGCATTTA ATTTATACTC ACGGTTACGG GTTTACCATG AGTCCGGTCA ATACGGCTGG GCCTGGTGGA CTACCAGAAT ATTTTGTCAA AGATATTGCT GGAAGTAACG AAGGCGCACT TTCTACTTCC AGTGAAGCAG TTCGTGACAG CATTCCTATT GGGCAACCCC GACTTTATTA CGGTGAAATT ACCAATACTT ATGTAATGAC TGGTACAAAG GTGAGGGAGT TAGACTATCC CAGTGGTAGT GATAATGCGT ACAATGCTTA TGATGGTTTG GGTGGTGTCA TTATAGGCAA TGGTTGGCGA CGGGGACTAT TTGCCATGTA TTTAAAAGAT TGGCAAATGT TGTTTACGCA GGACTTTTTA CCAGAGACAA AGGTATTATT TCGCCGGGAT GTCAAGCAGA GAATTCAGGC GATCGCACCT TTTTTAAAAT TTGATAGTGA CCCCTATTTA GTTGCGGCTG ATGGTAGTCC AGCATTTCCA GGGCAGAATA ATTACTTGTA TTGGATTGTC GATGCTTACA CGACGAGCGA TCGCTATCCC TACTCAGACC CCGATAATAA TGGCATAAAT TACATTCGTA ACTCTGTCAA AGTAGTTATT GATGCTTACA ACGGCAGTGT AAAATTTTAC ATTGCAGATG CGACAGATCC CATCATTGCT ACTTGGTCAG CTATATTTCC CCAGATGTTT CAGCCATTGA GTGATATGCC AGTTACTCTC CGCAGCCATA TCCGCTATCC ATTAGATTAC TTTGGCATCC AATCAGAGCG GTTAATGACC TATCACATGA CTGACACCCA AGTATTTTAC AACCGAGAAG ACCAATGGCA AATCCCCAAT GAAATTTATG GCAGTGAAAG CCGTCCAGTA GAACCTTATT ATTTGATTAC TAGTTTACCT ACCGTCCCCT TTGAAGAATT TCTTCTCCTG CTACCTTATA CCCCCAAACA ACGGACTAAC TTAATTGCTT GGTTAGCCGC GCGATCTGAT GGTGAGAACT ACGGTAAATT GTTACTGTAT AACTTTCCTA AGGAACGGCT TGTATTCGGG CCAGAGCAAA TAGAAGCACG TATTAACCAA GACCCAGTAA TTTCCCAGCA AATTTCCTTA TGGAATCGTC AGGGTTCGAG GGCAATTCAG GGGAATTTGT TAGTAATTCC CATCGAACAA TCTCTGTTAT ATGTGGAGCC AATTTACCTG GAAGCAACAC AAAATAGCTT ACCAACTCTC GTGCGGGTAG TCGTAGCTTA CGAAAACCGT ATTGTCATGG CACAGACCTT GGAACAAGCT TTACAGGCTA TCTTTCAGCC AGAAGTCACA CCAGCACCAG CAATTATTCG TCCTTTCGAG GAAGTTACTC CACCAGGTTA A
|
Protein sequence | MFWKWCFRLS IVFVGLWLLL DLSSRLGAEI FWFQEVGYLQ VFLLRLASRG ALWVVAVGVT AVYLWGNLTL AQRLKYPRSL KIAEVRREEA ELSVGLKNFL SPQYSLLNAP KIHDAGHLKP FRLRWLLPLT VVFSLLAGLI LVHYGKIALA YWYPAFNKNS LPIITPFRLE TIWELGRQVF SQVLYLGLIV GVAIAILIYS QFFLRAIAVI LSVVFGTILF HNWAKVLQYF SPTPFNSTDP LFGKDISFYI FSLPLWELLE LWLMGMFLYG FIAVTLTYLL SADSLSQGIF PGFSPQQQRH LYGMAGLLML MVAFSYWLSR YELVYSPRGV SYGASYTDVV VQLPIYNILC VLGLAIAFYL LWRTIFWRAK SQYRQFVFYG LGVYLFVVVA AGSVLPTVVQ YLIVQPNELQ REQPYIQRTI ALTRQAFSLE TIDARTFNPQ GNLTTAAIQA NDLTIRNIRL WDKRPLLETN RQLQQFRPYY RFPDADIDRY TLEAEAAANR PVNPNQLPAP TERRQVLIAP RELDYSAVPE QAQTWINQHL IYTHGYGFTM SPVNTAGPGG LPEYFVKDIA GSNEGALSTS SEAVRDSIPI GQPRLYYGEI TNTYVMTGTK VRELDYPSGS DNAYNAYDGL GGVIIGNGWR RGLFAMYLKD WQMLFTQDFL PETKVLFRRD VKQRIQAIAP FLKFDSDPYL VAADGSPAFP GQNNYLYWIV DAYTTSDRYP YSDPDNNGIN YIRNSVKVVI DAYNGSVKFY IADATDPIIA TWSAIFPQMF QPLSDMPVTL RSHIRYPLDY FGIQSERLMT YHMTDTQVFY NREDQWQIPN EIYGSESRPV EPYYLITSLP TVPFEEFLLL LPYTPKQRTN LIAWLAARSD GENYGKLLLY NFPKERLVFG PEQIEARINQ DPVISQQISL WNRQGSRAIQ GNLLVIPIEQ SLLYVEPIYL EATQNSLPTL VRVVVAYENR IVMAQTLEQA LQAIFQPEVT PAPAIIRPFE EVTPPG
|
| |