Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4140 |
Symbol | |
ID | 3681216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5161596 |
End bp | 5163398 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637719486 |
Product | S-layer region-like |
Protein accession | YP_324634 |
Protein GI | 75910338 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0020254 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0305803 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAAAA TTTTATGGAA TTGTGGGCTA GTTGTCCCAG CCCTTTTTAA TGTGCTTTTC ATTCTTTCTT CAGGTGCAAT GGCAGAAGCC CCACCGCAGG CGATCGCACC GGAAACACTG AATCTATCTG AATCAAAATC AGGGATTGAG GAAACTTCGT CAAATAGTGT AACTCAGGCT GAATCAAATC AGATTGCCAA TGTAGAGGTA ATTGGCGCGC CTGATTACTT ACTATCTCAA ACAGAAAATC TATTTTCTCA AGAAGATATC ACAGATGAAG AAAACCCCAT AGAACAAGTT ACATCTGTTT CTCAGTTATC TGATATTCAG CCTACAGATT GGGCATTTCA AGCTTTACAA TCCCTGGTGG AACGCTATGG AGCGATCGCA GGTTATACAG ATGGTACATT CAAAGGCGAT CGCGCTCTTA CCCGCTATGA ATTTGCGGCT GGTCTAAATA CTGCTTTAGA CCGGTTCAAT CAAATCATTG CCAATTCTAC AGCAGACTTA GTTAGACAAG AAGACTTAGA CACAATCAAG AAGTTAAGAG AAGAATTTTC TACGGAATTG GCTGCACTGC GAGGAAGACT TGATACAGTA GATGCAAAGT TAGAAACCAT CGAGAAACAA CAATTTTCTA CTACAGTTAA ACTCACAGGT CGAGCGCAGA TAGTTATTGG CTCACTTTTT GCTGGTAATA ACGTTATCAC TGGGCGGCCA GCACCCCGCG TAGTAACACA GCAAGGATCA GTGTCTTTGC GATTAAATGC CAGCTTTACA GGTAAAGATT CACTGGGTAT AACGCTGGGG GGAGGAAATA TTCAATCATT AGGACAAACA AGAGCTGGAT TATTAGGCAC TTTTGATGGC AGAACTGCTG ATAACTCCAG TATTACCAGA CCACCCAATG ATATTTCTGT TAGTGGTGTA CGTTATCGGT TTCCTTTTGG TTCAAATACC CAAGTCAACA TTTATGCTTT ATCCGATGGA GCTAATGAGC TAGGTTTTAC CGTTCCGATT AATCCATACT TTGAAAGTAG TCTAGCAACT GGTTCTAATG GGATTTCCCG ATTCTCACGA CGAGCTTTAG TCTATCAATA TGGAGATGCT GGCGGTGGAA TAGCAGTACT CCACAGATTA AATCAACAGT TCCAATTGGG AGTAGCTTAT AGCGCACCTA ACGCCAATAA CCCTGGCCCC AATACCGGCT TCTTCACAGG CCGATATTTA GCTTTAGGAC AGATACTATA CACCAGTCCT CAGAGGAATT TTCGGGCGGC TCTAACTTAC GTTAATACTT ATAGTCCACC AAACGCCCAA GGTTTAAGTG GAACAAACTT TGGCCCAGCA GCAGGAAGTA ACTTGGTCAA TAGCACCGTA GCAGGAACGG GGACAGTAGC AAATCTTTAC GGCGTACAAG CTTTTTATCA ATTTAGTCCC AAGTTTGCTA TGAATGGTTG GGTAAGTTAT GGCGCACACC GCTATTTAGG ACGCGGTGAT GGCCGAGCTA TGGATTGGGC TGTAGGAATG TCGTTCCCGG ATCTTGGAAA AAAGGGAAGT CTAGGGGGAT TGTTTGTGGG TATGGCTCCA ACACTGATCA GTCTTGGCAA AAATGTGAAT TTGGGAGCAG GCTTAGGACA AGCAGACAAA GACCTTTCCC TACATATTGA AGGATTCTAC CAATACAAAA TTAACGATAA AATCGACATT ACACCAGGTT TTATTTGGGT TACAGCGCCA GATTCCAATG CCAACAATCC TGATAGTGTA TATGCTTGGA TTCGTACTAC CTATAGGTTT TAG
|
Protein sequence | MFKILWNCGL VVPALFNVLF ILSSGAMAEA PPQAIAPETL NLSESKSGIE ETSSNSVTQA ESNQIANVEV IGAPDYLLSQ TENLFSQEDI TDEENPIEQV TSVSQLSDIQ PTDWAFQALQ SLVERYGAIA GYTDGTFKGD RALTRYEFAA GLNTALDRFN QIIANSTADL VRQEDLDTIK KLREEFSTEL AALRGRLDTV DAKLETIEKQ QFSTTVKLTG RAQIVIGSLF AGNNVITGRP APRVVTQQGS VSLRLNASFT GKDSLGITLG GGNIQSLGQT RAGLLGTFDG RTADNSSITR PPNDISVSGV RYRFPFGSNT QVNIYALSDG ANELGFTVPI NPYFESSLAT GSNGISRFSR RALVYQYGDA GGGIAVLHRL NQQFQLGVAY SAPNANNPGP NTGFFTGRYL ALGQILYTSP QRNFRAALTY VNTYSPPNAQ GLSGTNFGPA AGSNLVNSTV AGTGTVANLY GVQAFYQFSP KFAMNGWVSY GAHRYLGRGD GRAMDWAVGM SFPDLGKKGS LGGLFVGMAP TLISLGKNVN LGAGLGQADK DLSLHIEGFY QYKINDKIDI TPGFIWVTAP DSNANNPDSV YAWIRTTYRF
|
| |