Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_C0223 |
Symbol | |
ID | 3678022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007412 |
Strand | + |
Start bp | 257154 |
End bp | 261182 |
Gene Length | 4029 bp |
Protein Length | 1342 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637715303 |
Product | hypothetical protein |
Protein accession | YP_320497 |
Protein GI | 75812880 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0122903 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATTG ACCCAGAAAT CACCAGACAT AAAGAGTGGT TGGGTTTTCT CCAGCCTGTA GGGTTGGTGG TATCTCCCCC AGCGTTGGTG AAGGCGCAGG CGGTGGTCAA CCGGAATGTT GTAGATTTTC AACAATCGTT GCTGGCTGCG GTTGATGAAG ATGGCTTGAT TGCTGATTTT CCCGCTTTTA CTGTTAATGT ATTAGGCTGG GCTGAGAGTG ACCTTGTTAA ACCATCAGCA GAATATGAGA TAGCTTTACC TGATTACAGC GAAATTCTTG CGCCTACCTA TATTGTGCCT GACCCAGATA GCGACAAGCC GCAGATTTTG GTGCAAATAA TTTCTCCTGG TACCGAGTTA GATGTAGTCG CACCAGAATT AACTAAGTCT AGCAGTGGTT GGCACGCCAG CCCCCAAGCT AAATTTGAGC GTTTGTTGCG GGAGACGCAA ATACCCATAG GTTTGTTGTG TAATGGCGGG TTGCTGCGTT TGGTGTATGC GCCACGGGGC GAGTCTTCTG GACATTTGAC CTTTCCAGTG CAGGCGATGT GTGAGGTTTC TGGAAGGCTG ATTTTGGGGG CGATGGAAAT GCTGCTATCG GCATTTCGCG TGTTTAGTGC TACTACAGGT CGTTCTTTAA AAAATTTGTT GGAAGACAGC CGTAAGTACC AAGCGGAAGT TTCTACAACC TTGGCTAACC AAGTGCTGGA TGCTTTGTGG GAACTGCTGC GTGGGTTTCA AATGGCGGAT GCGGCAGTTG ATGGTAAGTT ATTAAGTGAA ATTGCTGCTA CAGATCCGCA ACACATTTAC GGTGGATTAA TCACTACGCT GATGCGGCTG GTGTTTTTGC TTTACGCTGA GGATGAAGGG CTAATGCCGC CAGATGATAT TTACCAGCGT AACTATTCTG TAACTGGCTT GTATGAACGG CTACGTGAAG ATGCAGGAAA CTACCCCGAC ACGATGGATC AGCGTTATGG GGCCTGGGTA TGGCTGTTAA GTTTATTTCG CTTAGTTTAT GATGGGGGCG GACAAACACC AGAGTATTTA CCAGCACGGC ATGGTCAGCT TTTTGACCCA GATGAATATG CGTTTTTGGA AAGTCGCCCA CGCGGTAGTA AGTTTGTGGC GGGTCAAGCA ATCGAGCCTC CGCGAATACC CGATGGTGTG ATTTATCGGT TGTTAGAAAA ATTACTAATT TTAGAAGGGG AACGGCTATC TTATAGATCC TTAGATGTAG AGCAAATTGG CTCGGTATAT GAAGGCATTA TGGGTTTTGC GGTGGAACGG GCAGAAAGTC CCAGTATTGG AGTTTACAGT AAACCCAAGG GTTCAAAGGT TTCTACAACG GTGGTAGTTG ATGTAGCAGC CATTTTGGCA GCGAAATCGG GCGATCGCCA AAAGTTACTG AAGGAGTTAG CAAATTGCGA AGTATCAGGT AATGCCCTCA AGGAACTGAA AGCAGCGCAA TCATTGGAAG ATATAGCGAT CGCACTAGGT CGAAAGGTAT CGCGGCAAAC ACCAAATTTA TTGCCTGTGG GTTCGCTGTA CTTGCAGCCA GGGGAAGAAC GCAGGCGTTC TGGTTCGCAT TATACGCCCA GGTCACTGAC TAAACCAATT GTGGAAACAA CCCTGCGTCC GGTTTTGGAA GCGTTGGGAG AAAAACCGAC TGCGGAACAA ATTTTATCTT TGAAAGTTTG TGATTTGGCG ATGGGTTCTG GTGCGTTTTT GGTAGAAACC TGTCGTCAGT TGGCTGAGAA GGTGGTGGAA GCGTGGGAAC GCGAGGAAAA CGATACCCGC CTGGGAATTA GCGATCACTC CGCCAGCCAA AGGCATCCCA ACCAAAATGC GACAATTAGC AATTATGGCA AAGAAGAACC TTTATTAATT GCGCGTCGCT TGGTGGCGCA ACGGTGTTTA TATGGTGTGG ATAAAAACCC GTTTGCGGTG AATTTGGCGA AGTTATCTTT ATGGTTGGTG ACGCTGGCGA AGGATTTACC GTTTACATTT CTTGATCATG CCCTCAAGTG TGGGGATTCG CTGGTAGGGT TGAGGAAAGA GCAGATTGGG TCTTTTGGGA AGGATGCTAC TGACGATTTA CCGCTATTTA TATATTTAAA AGAGCAACTT GACCGTGCGC GTTCTTATCG GGCGGAAATT CAGGCTTTGG ATACGCGCAG TGATGCTGAT GATGACCAAA AGCGGGATTA TCTATACAAA GTAGAACAAG AATTGTACCA AGCGCGGTTA ACGGGGGATG TAAGAATTGC GGCGTTTTTT GAAGGAAGTA ATAAGAAGCA GCGAGAGGAG AGAGAAACTG AGATTGCAGA ATTAGTAAGA AAGTGGCGAT ATCACCAAGC TGATACTGAG AGTTTGGAGG AAATTGCTAG TAGGTTGCGG AGTGGGGATA AGGGGATTAT TCCATTTAAC TGGGATATTG AGTTTCCAGA GGTTTTTGAT AGAGAAAATC CGGGGTTTGA TGCAATTGTG GGTAATCCTC CGTTTTTGGG TGGAAAGCGT ATTAGTACAG TTTTAGGCGA TGCTTACAAA GATTGGTTGC CTGTTGTAAA TCCTGAATCT AACAGTAATG CTGACTTGGT TGCACACTTT TTCCGTCGTG CTTTTGACCT GTTACGTCAG GGTGGAACTT TTGGATTAGT AGCAACTAAC ACAATTGCCC AAGGAGATAC CAGAAGTAGT GGTTTACGAT ATATCTGTCA GAACCAAGGC ACGATTTACA ATGCTCAAAA ACGCATGAAA TGGCCTGGAC AAGCAGCAGT AGTAGTAAGT GTGGTTTATG TGCTGAAGGG AACGTATAAA GGTATATATT TGCTGAATGG ACGAGAAGTT TCTCTAATTT CTGCGTTCCT GTTTCATGTG GGGACAAATG AAAATCCGGC AGTGTTGTTG GCGAATAGCA ATAAAAGCTT TATTGGCAGT TATGTTTTGG GCATGGGCTT TACTTTTGAT GATACCAACG ACGAAGCAAC ACCTATTACA GAAATGCACC GCTTAATTGA GAAAGATGGA AGAAATGCTG AACGAATATT TCCTTATATT GGAGGTGAAG AGGTTAACAG TAGCCCAACT CACGCGCATC GTCGCTATGT TATCAACTTC GGAGAAATGA GCGAAGATGA AGCGCGGAAG TGGTCAGATT TGATAGAGAT TGTTGAGATA AAAGTAAAGC CTCATCGTGA CACATTAAAG CGAGATGCTT ATCGTAAGCG GTGGTGGCAC TTTGCTGAAA AACAAGCAGC TTTATACAGA GCGATCGCTC CACTTGAAAA AGTGTTAGTT GTCTCCCGAC ACCAGCCAAA TTGGTCAGCA GCATTTATGG GAGCAAATGT TGTTTTTTCT GAAGGCTTGG TAGTTTTGGC TCTTTCACAA TACTCATCAT TTGCTCTCTT GCAATCTCGT ATTCACGAAA TTTGGGTGCG TTTTTTCGGA TCATCCCTAG AAGACAGACT TCGCTACACT CCCACAGACT GCTTTGAAAC CTTCCCCTTC CCCCAAAACT GGGAAACTAA CCCCACCCTA GAAGCCATAG GTCAAGAATA CTACGAATAT CGCGCCGCCT TAATGGTTCG CAACAACCAG GGACTAACCG ACACCTACAA CCGCTTCCAC GACCCAGAAG AACGCGACGC TGATATCCTA AAATTACGCT CACTGCACGC CGCAATGGAT AAAGCCGTAC TCGAAGCTTA CGGCTGGAGT GACATTCCCA CCGATTGCAC CTTCCTGCTA GACTACGACG ATGAGGAAGA CGAAGAAGAA ACCAGCAACG GACGACAACG CAAAAAACCT TGGCGTTACC GTTGGACAGA AGAAGTGCAT GATGAAGTTT TAGCACGCCT ACTCGACCTT AACCAAAAAA GAGCGCAAGC TGAAATTCTC GGCGGTAAAG CAGCACAAAA GAAACCCAAG CCTAAAGCTA CTAAGAAAAA AACAACCAAA ACCAAATCTA AAAAGGTTGG GGAAACTACA CCAATAATAC CGGGATTTGA TGTGGAGACT GGCTTATGA
|
Protein sequence | MAIDPEITRH KEWLGFLQPV GLVVSPPALV KAQAVVNRNV VDFQQSLLAA VDEDGLIADF PAFTVNVLGW AESDLVKPSA EYEIALPDYS EILAPTYIVP DPDSDKPQIL VQIISPGTEL DVVAPELTKS SSGWHASPQA KFERLLRETQ IPIGLLCNGG LLRLVYAPRG ESSGHLTFPV QAMCEVSGRL ILGAMEMLLS AFRVFSATTG RSLKNLLEDS RKYQAEVSTT LANQVLDALW ELLRGFQMAD AAVDGKLLSE IAATDPQHIY GGLITTLMRL VFLLYAEDEG LMPPDDIYQR NYSVTGLYER LREDAGNYPD TMDQRYGAWV WLLSLFRLVY DGGGQTPEYL PARHGQLFDP DEYAFLESRP RGSKFVAGQA IEPPRIPDGV IYRLLEKLLI LEGERLSYRS LDVEQIGSVY EGIMGFAVER AESPSIGVYS KPKGSKVSTT VVVDVAAILA AKSGDRQKLL KELANCEVSG NALKELKAAQ SLEDIAIALG RKVSRQTPNL LPVGSLYLQP GEERRRSGSH YTPRSLTKPI VETTLRPVLE ALGEKPTAEQ ILSLKVCDLA MGSGAFLVET CRQLAEKVVE AWEREENDTR LGISDHSASQ RHPNQNATIS NYGKEEPLLI ARRLVAQRCL YGVDKNPFAV NLAKLSLWLV TLAKDLPFTF LDHALKCGDS LVGLRKEQIG SFGKDATDDL PLFIYLKEQL DRARSYRAEI QALDTRSDAD DDQKRDYLYK VEQELYQARL TGDVRIAAFF EGSNKKQREE RETEIAELVR KWRYHQADTE SLEEIASRLR SGDKGIIPFN WDIEFPEVFD RENPGFDAIV GNPPFLGGKR ISTVLGDAYK DWLPVVNPES NSNADLVAHF FRRAFDLLRQ GGTFGLVATN TIAQGDTRSS GLRYICQNQG TIYNAQKRMK WPGQAAVVVS VVYVLKGTYK GIYLLNGREV SLISAFLFHV GTNENPAVLL ANSNKSFIGS YVLGMGFTFD DTNDEATPIT EMHRLIEKDG RNAERIFPYI GGEEVNSSPT HAHRRYVINF GEMSEDEARK WSDLIEIVEI KVKPHRDTLK RDAYRKRWWH FAEKQAALYR AIAPLEKVLV VSRHQPNWSA AFMGANVVFS EGLVVLALSQ YSSFALLQSR IHEIWVRFFG SSLEDRLRYT PTDCFETFPF PQNWETNPTL EAIGQEYYEY RAALMVRNNQ GLTDTYNRFH DPEERDADIL KLRSLHAAMD KAVLEAYGWS DIPTDCTFLL DYDDEEDEEE TSNGRQRKKP WRYRWTEEVH DEVLARLLDL NQKRAQAEIL GGKAAQKKPK PKATKKKTTK TKSKKVGETT PIIPGFDVET GL
|
| |