Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2908 |
Symbol | |
ID | 3681413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 3613066 |
End bp | 3615249 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637718253 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_323414 |
Protein GI | 75909118 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000198552 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCCCAC CAATTGTTAA GAGCTATATT ATTGCTTTTG AAAAATATAA ATGGATTGGA TTAGCCAGCT TTGCTTTAGT TGTAGCGGGG TCAACAGTGG TGGCTATCCA ACCAGAACCA GCACCTACTT ATACAGCAAG CGGCACACTG ACATATTCTC GACCCCCGGT ATCATTTTCT ACGACAGGTA GCGAAATTCA GCAACAAGGT CAGGAATTAA ACCAGGAAGT TCTGCTATCA GATCAAGTAA TTGATAATGT GTCGGCAAAG GTGAAGATTC CGTCAAGAAG AATTGCTTCC AGCGTAGCGA TTACACCGCC GAGAAGAAAC TCACGTACTG GAGAATTAGA ATCTAATGTC ATTTCTCTGG CATATAAAGA TAGTGATCCC AAGCGGGCGC AGGAAATATT GCTGGAGCTG ATGCAGGCAA TGGTCAAGCT CAGTAGTGAT ATTAATACTG GGCGACTAAA AGCTATTATT GGCAAAATCA ATGAACGGAT ACCACAAGCA AGGTCTGAGC TACAAGCAGC CGAAAAAAGA TTGGAACAGT ACGATCGCCG GGAAAGACCT GCTATATTAG CCGCAGAAAA TGGTAGTTTA CTTGGTGCTG TTACTGGTAG CCAAAATCAG CAACGGGTAA TTCGCCAAAC TATTACGGGA ATTGAGGCGC AGATTTTCAG CTTGCAAGAT AAGTTGGGTT TAGATGTCGG TCAATCTTAT GTTTCTTCGG CGTTGAGTGC TGACCCAATT ATCGCCAACT TGCGATCGCA AATTTATCAG ACAGAATCAC AAATCGCTTT ACTTCGTAAA GATTTGCGCC CAGAACACCC AAATATGATT CAGTTGCAGC GTCAAAAACA AGCATCTGAA GAATTACTCC AACAAAGGGC GGCGGAAGTA ATTGGCGGTG GTGGTACAGC AGCACCCTTA GCAACAAATA TTAGTGGTAT CCGCGCCCAA AGTAGTTTAG ATCCAGCCCG ACAGCAGCTA GCTAACCAGA TGGTAGCTTT GCAAACGCAA AAAGAAACCC TACAACAACA ATTAGCCCAG CAAATACGAG ATGAAATCCG GCTGCGGCAA GAATATTCCC AAATACCCAA CAAACAATTA GAGCGATCGC GCTTAGAACA AGCAGTAGGG CTGAAGAAAG CAGTTTATGA TCAAATGCAA GCCAAGCTCA CCGATGCTCA AACAGCCGAA GCAGAGACAG TAAGCAGCTT TAGCATTGCT CAACCCCCCG TGGTAGCGGC TGATGCCAAA AAACCCAAAA GTGTACCTTT AACCTTGGGG GTAGGTGGTT TCTTAGGATT GATAGTCGGT GGTGGGGTGA TTTTCTTACT AGGTTCCTTG GAAGGAACAC TCCGCACCAG AGAAGCTATC AGAGACAGCC TCAAACAACG GGATGTGGCA ATGTTGGGAG AAATACCTGT ATTGCCAGTG GATGATTTAC CACCGGAAGC TCTACCTGTC ATCCTTTCTC TAGATTCTTT GTATTTAGAG TTTTATGAGA AATTACGCAG TAACTTAAGG CGCATTGGTG GTAGAAACTT AAAAGTGGTT TTGGTAACTA GTACTAGTAG CCAGGAAGGT AAAACAACCA GCGCTTATAA CCTGGGTATA GCCTCCGCCC GCGCTGGCAA AAGAACCTTG ATCATTGAGA CAGATTTGCG ATCGCCTTCA CGTTCCACCT CTTTGAGAGT TTCTCCCGAC GAGGATGCCA CACTTGAACC CCTGCGTTAT TATGGCAGCT TAAGTGAATG TATTCGCTTA GTCCCTGAAG TCGAAAATTT ATACATCATC CCTAGCCCTG GGCCTGTGCG TCAATCTGCC GCTATTTTAG AATCTAGCGA AATGCGGCGG CTCATGGAAG ATGTGAGAGA ACGCTATGAT TTAGTTATTT TAGATACTAG TCCGTTAAGT GTATCTAATG ACCCCTTGTT AATTCAACCC TATAGTGATG GCATAGTACT CGTATCACGA GTTAACTATA CACAAGACAG TATGATGGCT GAAGCCATTG ATCAACTAGT GGAAGCAGAA CTAGGACTAG TGGGAGTTAT TATCAACGGG GCTGATATCA CCGTTTCTTT ACCACCCTCG GCTGAATTCT CCGATTCAGT ACCTGGGGAA GAACGGACAA GAGATGAAGA GTCAGAAATT TCTGTTGGTG TAAGCAATAA CTAG
|
Protein sequence | MTPPIVKSYI IAFEKYKWIG LASFALVVAG STVVAIQPEP APTYTASGTL TYSRPPVSFS TTGSEIQQQG QELNQEVLLS DQVIDNVSAK VKIPSRRIAS SVAITPPRRN SRTGELESNV ISLAYKDSDP KRAQEILLEL MQAMVKLSSD INTGRLKAII GKINERIPQA RSELQAAEKR LEQYDRRERP AILAAENGSL LGAVTGSQNQ QRVIRQTITG IEAQIFSLQD KLGLDVGQSY VSSALSADPI IANLRSQIYQ TESQIALLRK DLRPEHPNMI QLQRQKQASE ELLQQRAAEV IGGGGTAAPL ATNISGIRAQ SSLDPARQQL ANQMVALQTQ KETLQQQLAQ QIRDEIRLRQ EYSQIPNKQL ERSRLEQAVG LKKAVYDQMQ AKLTDAQTAE AETVSSFSIA QPPVVAADAK KPKSVPLTLG VGGFLGLIVG GGVIFLLGSL EGTLRTREAI RDSLKQRDVA MLGEIPVLPV DDLPPEALPV ILSLDSLYLE FYEKLRSNLR RIGGRNLKVV LVTSTSSQEG KTTSAYNLGI ASARAGKRTL IIETDLRSPS RSTSLRVSPD EDATLEPLRY YGSLSECIRL VPEVENLYII PSPGPVRQSA AILESSEMRR LMEDVRERYD LVILDTSPLS VSNDPLLIQP YSDGIVLVSR VNYTQDSMMA EAIDQLVEAE LGLVGVIING ADITVSLPPS AEFSDSVPGE ERTRDEESEI SVGVSNN
|
| |