Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2357 |
Symbol | |
ID | 3683414 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 2931934 |
End bp | 2937654 |
Gene Length | 5721 bp |
Protein Length | 1906 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637717702 |
Product | Alpha-2-macroglobulin-like |
Protein accession | YP_322870 |
Protein GI | 75908574 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0821687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTATTA GAATCTGTAT ACGGTGTTTT CTTGTTCTAA CCTTGGTGTT AGGAACAGGC GGATGTAATT TTTTTGGTAT TAACTCAGCT AGAGAACCAT TACCAGCAGT CTCTCCACTC ACACCGCCAA AATTACCGGA TTGGATTGAA CAAATTAGTC CCATCGGACA AGCCCAACCT CTGAACCAAA TCCGTATTCG TTTTAAAGAG GCTTTAATAC CAGTTGAAAG TTTGGATAGT CCAGAACAAC AGCAGTTATT ACAAAAGTTT GCCCTTTGGC CGCCATTACC GGGACAGTTT CGCTTTTTGA CACCGCGCAT GGTGGGTTTT CAAGCTGATA AAGCATTGCC AATAGCCACG AGATTGCAAG TTACTCTTAA AGCAGGCTTA GCAGATTTAA AAAATCATCG TTTAAACAAA GATTTATCCT GGACTTTTAA TACTCCATCT ATCGATTTAA CGAATTTACC TGGTGTGAAT CCAATGGAAA AAGCTGATGC TGAACCCATT GATTTGCAAC CGAAGTTACA ATTTACATCT AATGTAGAAC TAGATTTAGC TTCTGTACAA GAACATTTAC AGCTAATCCC AGAAGGTAAA AATGAGGGTT TGCACTTCCA GGTAACTTTG AATAAGGAAG AAAATCCTCT AAATAATGAA GAACCTCTCA AGAAATTTGA CCCTTCAGCA CGTAATTGGA TTTATAATCT TAGACCTCAA AAAAATCTGG AAAAAGCCAC CAGTTATCGT TTGGTATTTT CTCCGGGAAT ACGTCCTGCC TATGGCAACC TAGCTACAGA AAAAGAATTT GCTAGTAAGT TATCTACTTA TTCACCTTTG GCTTTTCAAA AAATTAACTT TTATGGACAA CCAGATGCTG GGGGAACTTA TGGAAGATTT ATTAAAGGTA GTCCGCAGTT AGAATTTAAT AATATTTTGG TAGCCGATTC AGCTAAAGCT AATATTCAAA TTAGCCCAGC ACCAAAAGAT ATTTCTAGGC TGTTACAAGT TAATGATGAA GATAGAATTA TCGGGATTAA TCCTTATGCC TTAGAACCTG CCAAGACTTA TACAATTACT ATTGGCGAAA ATCTCCAGGA TAAGTTTGGA CAGACTTTAG GTAAACCTGT CTCACTTAAA TATGATACTG GAGATTTAGC TGGGGATATC TGGGTTCCCT CAGACTTGAA TATTTTCCCT TCAGGTAAAG ATTTACGGTT AAATATTAGT ACTGTAAATT TACCAGAATC TAAATATAAG GCTGCTTATC GAGTAGTTAA ACCAACAGAT TTAGTCTATT TTAACTATGG TAATGATTTA TTAACGAAAC CTGCTGAATG GCAAAGCTTC CAGGTATCAG GTAAGAAAAA TCAATCAGTT GATATCACTG TTCCTCTACG AGAAAGAATC AACGCTAAAA CGGGAATGTT AGCTTATGGA GTACAAGCCC GTACTAATAA ATATCAGGAA AATGGTAAGG ATTTGTGGAG AGAACCTACG ACTTATGGTT TGGTGGAATT GACTAATTTG GGTGTATTTA GTCAATGGTT TCCTGAATCG GGGTTAATTC GGGTGAATCA TTTAACAGAT GGTGCGCCAG TAAAAGCGGC TGTTATAGAA ATTTATCAAT CAAAATTACA AGCAAAATCT CGCCCTGAAC CTGTACCTTG TGCGACAGGG AAAACTGATG AAAATGGAAC TTTTAGAATT AATCGTGCAG AATTACAGCA ATGTACTGCT GGTAGCCAAA ATTCAATTAA ATCACCAGAA TTATTAGTAA TTGCCCGTGA AAATGAAGAT TGGGCATTTA CTAGAACTGA TGAATATAGC GGTGTTTATG GATATGGTAT TGATGCAGGT TGGCAAGGTA ATAAACCTGA ATCACGGGGA GTAATCTTTT CAGATAGACA GTTGTATCAA CAAGGAGAAA AAGCTTGGTT TACTGGTTTT GCTGACTACT TACAAAATGG TGTAATCCAA CAAGATAAAA ATGCTGACTA CCAAATAACA TTGGTAAATC CTGATGGACA AAAGACCAGT TTAGGTACAC AAACTACAAA TGAGTTTGGG ACGTTTTCCT TAGAAATGCC CATCAATAAA ACTCAAAGCT TAGGCTACTA TACAATTCAA GGTAAAGGTA AGAACGGACA GGAAATTTCT GGAGAATTTC GGGTGGCTGA GTTCAAACCA CCTAATTTTA AAGTCGAAGT CAAGTTAGAT AAAGAATTTG CTTACATTGG TGATGATGTT GATATTAATG CTTCCAGTAA TTATTTATTT GGTGCGCCTG TAGAAGGTGG AGAAGCGAAA TATTTTATTA CTCGTCAACA AGCTAACTTC ATCCCTAAAG GTTGGGAAGA GTTTACTTTT GGTCGCCAAT GGTTTTGGCC GGAGGAAACC CCCACCATAT CTAGTGATGT GTTGCAAAGT AATTCTCAGT TAAATACGAA TGGTAAAAGT AGCCAAACGG TAAAGGTAGC TAAAGATTTA CCGTATCCTA TGACTTATCG GGTAGATGTA CAAGTTGCCG ATGTCTCTAA TCTTTCGGTA GCTAATTCCC AAAGTTTTAC AGCCTTACCA AGTAATCGTT TAATTGGCTT GAAAAGTAAT TTTATTGCTG ATGCAGGTAA AGCATTTCCT ATAGAAGTAG TTGTTACTAA GCCTACAGGA GAAGTAATTG CAGGTCAACG AGTCCGGCTG GAATTGCAAC AGATAAAATA CAGCAGCGTC ACTCAATTGG TAGAAGGTAG CGAAACACCA AAAAACCAAG TTGAATATAA AACAGTTGCC CAAACAGAAA TTACATCTAC TAGTAATTCG CAATCGGTGA ATTTGACCCC AACTGAATCT GGTGCATATC GAATTAGAGT TAATTTTAGT GATGCCAAAA ATGAATTAAG TGCCACAGAT TCACAAATTT GGGTGACTGG AGGAAACGCA GTCTTTTGGG GTACGCGAGA TAAAGATGTT TTAGAAGTTA AGTTAGATAA AAAAGAGTAT AAAGCTGGTG AAACTGCTAC CGCTTTAATT CAATCTCCCT ATGCAGATGC AGAATTATAC TTTGCGGTGA TTAAAGATAA ACCCATTTAT CAACAAATTA CCAAAGTTCA AGGCAACGCA CCACAAATTC AGTTTCAAGT TACGCCAGAA ATGCTACCAA ATGCAGCCGT TGAAGCTGTG TTAGTTCGAC AAGGTAAACC TATTAGTCAG GTAGAAGTAG GAAGTTTAGA TAACTTGGTA AAAATTGGCT TTACTCCTTT TAAAGTTAAC CTAGAAGATA AGTATTTAAA ACTGCAAGTT AAACCAGTCC AAACATCTTT AGAACCTGGT GCAGAAGAAA CAATCCAACT GGAATTGAAG GATAATCAAG GCAATCCCAC CAAAGGACAG TTTACAGTCA TGGTGATAAA TGAGGCGGTA CTGCAACTTT CTGGTTATCG TCCGCCGAAT TTAGTAGATA CAGTTTATGC AGAACAGCCA ATATCTACCC GCTTTACTGA TAACCGTCCA GATGTGATAT TACAACCGCA AGATATAGCA AAACCCAAAG GCTGGGGTTA CGGCGGTGGT TTCTCCACAG GTGCAGCAAA TACTCGCACC CGCACCAACT TTCAACCCTT AGCTTACTAC AATGGTTCTG TATTGACCGA TGCTAACGGT AACGCACAGA TAACCTTCAA ATTACCAGAT GATTTAACTA CGTGGCGTGT GATGGCTGTG GCTACAGATG GAAACCTGCG TTTTGGCAAT GGGGACGCGA CATTTATCAC CACCAAACCC TTACTAACTA ATGCCATCTT GCCACAATTT GTCCGTCCAG GCGATCGCAT CCTCGCTGGT TTATCCGTCA CTAATAACAC CGGAAATCGA GGGAATCTCT CAATTAACGG TGAACTTAGC GGGACTGTGA AGTTTAACAG CAAAAATCCC ACAACTACTA CGTTGCAAAC CCAAGCTGAA TCTGCAACTC AAGCCTATCG CTTCCCAATG GTGGCGGATA GTGTGGGATT TGGTAAAGTT CGCTTCACCA CTCAGCTAAA TGGTACAGCC GATGCTTTTG AATTACCTCT GCAAGTAAAA CCACTGGAAA TCACCGAACA AGTAGTTGAG ACTGGTGTCA GCCAAAAACA AATCAAAATT CCCTTAAATG TTGATAAAAA TATCTTTCCC GAAGCCGGCG GTTTAGATAT TCAATTAGCG AGTACTTTGA TTCCCGAAAT TAAAGCACCA GCAAAAGAAG TATTAACAGA TAATGATTTG CCATTCACAG AACCATCTGC AAGTCAATTA ATCATTGCGA CAAATTTACA AACTCTTGCC CAAAAATATG GTCAAACATT TGCAGAATTT AATTCTAGCC AACAGGCAAA TTTAGCAGTT GAAAAATTGC GAAAACTACA AATCTCTGAT GGTGGTTTTG CGGCTTTCCC TGGACAGGAA AAATCAGACC CTTGGGTTTC TAGTTATGCG GCTGAATCTT TAGTAAAAGC TAGTCAAGTC TTCCCCGACT TGGTTGACTC AGGAATGCTA TCTCGCCTCA AAACCTATTT GCAAAAAGTT CTGGCAAACC CCGGAGAATA TGATTTTTGC AAACAGCAAC TATGTAAAAG GCAACTACAA CTTAATGCTT TAATAGCCCT AGCAGAACTG GGAGATAAAC GCAACACATT TCTAACAGAT ATTTATGAAC AAAGTAACAA ATTTGATTTA GTCACTCAAA TTAAACTAGC GCGATACTTA TCTCAATTCC CCGAATGGCA AGATGAATCT CAGCAATTGC TAAACAAGCT GCAACAAAAC ATCTATGAAA CTGGACGCAC AGCAGTTGTG AGTTTACCAC CTAGTTGGGG ATGGATGAGT TCACCAACCG CAGTACAAGC CCAAGCTTTA CGCTTATTTA TCGCCCAACA AAGCCAACCA AAACTAATAG ATAAATTACT CCAAAGTTTA CTTGCATTAC GCCGGGATGG AACATGGCAA ACTGACTATA ACAATGCCCA AGCACTAACA GCCTTAGTAG AATATAGCCA ACTACAACCC ACACCACCTA ATTTTGTCGC CACAGTGCAG TTAGCCGGTA AGAAGTTAGG AGAAAATCGC TTTGCAGGCT ATAAAAATCC CAGCCTCCAG CTAAATGTAC CGATGAATCA ACTACCCCGT GGTCGCCATG ATTTAACGCT ACAAAAATCG GGTAATGGAA CTCTACACTA CTTGGTTGCT TATAACTATC GCCTGCAAGG AAATCAACCA GGACGCTTTA ACGGCTTAAG CATAACACGA GAAATAAGTC AAGTAAATGC AGAGAAAGTT TTACGAAAAA CAGGTATTTA CGCCCTCGAT CAACCCTTGA CTTTAGCTCC CGGACAAGTG TTTGATATTG GTTTAGAAAT TATCGCCGAT CGCCCGGTAG ATCATCTAGT AATTAAAGAT CCCCTACCAG CAGCTTTAGA AGCCGTTGAC GCGAGTTTCC AAACCACCAC CGCCGCATTA CAAGCAAAAG CCGATAGTTG GGAACTGGGT TTTAGGAATA TTTATAGCGA TCGCATTATC GCCTATGCCG ACCACCTAGA ACCAGGAGTT TACAGCCTCC ATTATTTGGT ACGTTCTGTT ACCCCTGGGA CTTTTTCCTG GCCTGGTGCG GAAGTTCACC TGCAATATGC ACCAGAAGAA TTTGGACGCA CTGCGGAAAT GAAACTAATA GTAGAGGAGA CAGAAAAGTA A
|
Protein sequence | MIIRICIRCF LVLTLVLGTG GCNFFGINSA REPLPAVSPL TPPKLPDWIE QISPIGQAQP LNQIRIRFKE ALIPVESLDS PEQQQLLQKF ALWPPLPGQF RFLTPRMVGF QADKALPIAT RLQVTLKAGL ADLKNHRLNK DLSWTFNTPS IDLTNLPGVN PMEKADAEPI DLQPKLQFTS NVELDLASVQ EHLQLIPEGK NEGLHFQVTL NKEENPLNNE EPLKKFDPSA RNWIYNLRPQ KNLEKATSYR LVFSPGIRPA YGNLATEKEF ASKLSTYSPL AFQKINFYGQ PDAGGTYGRF IKGSPQLEFN NILVADSAKA NIQISPAPKD ISRLLQVNDE DRIIGINPYA LEPAKTYTIT IGENLQDKFG QTLGKPVSLK YDTGDLAGDI WVPSDLNIFP SGKDLRLNIS TVNLPESKYK AAYRVVKPTD LVYFNYGNDL LTKPAEWQSF QVSGKKNQSV DITVPLRERI NAKTGMLAYG VQARTNKYQE NGKDLWREPT TYGLVELTNL GVFSQWFPES GLIRVNHLTD GAPVKAAVIE IYQSKLQAKS RPEPVPCATG KTDENGTFRI NRAELQQCTA GSQNSIKSPE LLVIARENED WAFTRTDEYS GVYGYGIDAG WQGNKPESRG VIFSDRQLYQ QGEKAWFTGF ADYLQNGVIQ QDKNADYQIT LVNPDGQKTS LGTQTTNEFG TFSLEMPINK TQSLGYYTIQ GKGKNGQEIS GEFRVAEFKP PNFKVEVKLD KEFAYIGDDV DINASSNYLF GAPVEGGEAK YFITRQQANF IPKGWEEFTF GRQWFWPEET PTISSDVLQS NSQLNTNGKS SQTVKVAKDL PYPMTYRVDV QVADVSNLSV ANSQSFTALP SNRLIGLKSN FIADAGKAFP IEVVVTKPTG EVIAGQRVRL ELQQIKYSSV TQLVEGSETP KNQVEYKTVA QTEITSTSNS QSVNLTPTES GAYRIRVNFS DAKNELSATD SQIWVTGGNA VFWGTRDKDV LEVKLDKKEY KAGETATALI QSPYADAELY FAVIKDKPIY QQITKVQGNA PQIQFQVTPE MLPNAAVEAV LVRQGKPISQ VEVGSLDNLV KIGFTPFKVN LEDKYLKLQV KPVQTSLEPG AEETIQLELK DNQGNPTKGQ FTVMVINEAV LQLSGYRPPN LVDTVYAEQP ISTRFTDNRP DVILQPQDIA KPKGWGYGGG FSTGAANTRT RTNFQPLAYY NGSVLTDANG NAQITFKLPD DLTTWRVMAV ATDGNLRFGN GDATFITTKP LLTNAILPQF VRPGDRILAG LSVTNNTGNR GNLSINGELS GTVKFNSKNP TTTTLQTQAE SATQAYRFPM VADSVGFGKV RFTTQLNGTA DAFELPLQVK PLEITEQVVE TGVSQKQIKI PLNVDKNIFP EAGGLDIQLA STLIPEIKAP AKEVLTDNDL PFTEPSASQL IIATNLQTLA QKYGQTFAEF NSSQQANLAV EKLRKLQISD GGFAAFPGQE KSDPWVSSYA AESLVKASQV FPDLVDSGML SRLKTYLQKV LANPGEYDFC KQQLCKRQLQ LNALIALAEL GDKRNTFLTD IYEQSNKFDL VTQIKLARYL SQFPEWQDES QQLLNKLQQN IYETGRTAVV SLPPSWGWMS SPTAVQAQAL RLFIAQQSQP KLIDKLLQSL LALRRDGTWQ TDYNNAQALT ALVEYSQLQP TPPNFVATVQ LAGKKLGENR FAGYKNPSLQ LNVPMNQLPR GRHDLTLQKS GNGTLHYLVA YNYRLQGNQP GRFNGLSITR EISQVNAEKV LRKTGIYALD QPLTLAPGQV FDIGLEIIAD RPVDHLVIKD PLPAALEAVD ASFQTTTAAL QAKADSWELG FRNIYSDRII AYADHLEPGV YSLHYLVRSV TPGTFSWPGA EVHLQYAPEE FGRTAEMKLI VEETEK
|
| |