Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1045 |
Symbol | |
ID | 3678597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 1271148 |
End bp | 1273358 |
Gene Length | 2211 bp |
Protein Length | 736 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637716381 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_321564 |
Protein GI | 75907268 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0489] ATPases involved in chromosome partitioning [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0728146 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.010052 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTCCC ATCAAGACGA TCAAGACCTG ATTAACTTCC AACAGTATTG GCTGATTGCA AAAAGACGTT GGTTACTGAT AGCCACGATT ATGGGATCGG TGTTTGGAGT CACAGGGTTG CTTACTTTTA GTCAAAAACC AATCTATGAA GCAGAGGGTA AACTCCTCTT TAATAAGCAA AGTGGCGTTT CTTCCCTAAC AGGTGTAAGT GAGCAATTGG GACAACTTAG TGGGGTGACC AATCTTTCTA ACCCTTTGGA AACTGAGGCA GAAATTATTC GCTCAAACCC CATAATTCAA AAAACCATTG CTGAGTCGCA GCTTAAAGAT GAACAAGGAC AACCTTTAGA AATAGATGAT TTCCTACGAA GCTTAAAAAT AAAAAGTGTC CGAGGAACTG ATATCTTACA ACTTACCTAT CGTAGTGCTA ATCCTGAAGA AGCAACAGCG ATTATTAATA GATTGATGAA CACTTATCTG GAGAACAATG TTCGCAATAA TCGTTCTGAA GCTACAGCCG CAAGGGAATT TTTAAGCAAG CAATTACCTC TAGTCGAGGC GAAAGTCACA GAAGCAGAAG CAGCACTACG TCGATTCAAA GAAAAATACG AGGTTGTTTC CCTACAAGAA GAAGCTATAC AAGGAGTCAA AAGGCTGAAT GATTTATCCG GCCAAGTGAC TCAACTGCGC GCACAGTTAG TTGATGCTAG AACTCGTTCT GGTGCTTTGC AAAATCAATT AGCATTGAAC ACAAAGCAAG CTATGGCACT GAGCAGCTTA AGCCAATCCA ATGCAGTACA ACAAGTGCTA TCAGAATACC AAAAGGTTCA AGACCAGTTA GCTGTGGAGA GGTCACGATT TACTGAGGAA CACCCGGTTA TTGCTAATTT ATTAAATAAA GAACAAGCTC TCAAAGAGCA GCTAGAAGGA AGAGTGAGCA AAACTTTAGG TAGTTGGCAG CCGATTCCAG AGCAAGATTT ACAAATAGGG GAACTCAAAC AGACTCTAAC CGCTAATTTA GTGCAGGTGG AAGTTGAACG ATTGGGATTA GAAAATCAAG TTGGTGTGTT GATGAAGGCA TTTGTACTTT ATCAAGCACG TCTGAGAGTC TTACCCAAGT TGGAACAACA ACAACTACAG TTACAACGAC AGCTACAGAT TGCCCAAACA ACCTATGAAC AAATGCTAAA ACGATTGCAA GATGTTGAGG TGGTGGAAAA TCAGAATGTG GGTAATGCCA GGATTGTTTC TGAAGCTTTA CTTCCCAAAA CACCAGTTTC TCCTCGGATT GTCTTGAATT TGGCATTAGG GGGATTTTTA GGCTTCTTCT TAGCTATTGG TGCAGCTTTG TTGCTAGAGG CTGGAGACAA GTCTGTGAGG ACACAAGAAG AAGCTCAACA GTTGCTGGAT TATCCTTTGT TGGGTACAAT TCCAGCTTTT GACCAAAAAG CTAGACTGGC TCGTGGTGAG AGTATCACCG AATTACCCGT ACTCAATAAC CCTTACTCTT CAGTTAATGC AGCCTTTGAA ATGCTGCAAA TCAATTTGGG CTTCTCCTTT TCTGATAAGA AACTGAAGGT AATTGTGGTT AGCAGTTGTG TGATGAATGA AGGCAAGTCT TTTATTGCTG CTAATTTGGC GGTAGCGACT GCCCAAATGG GAAGACGAGT ATTATTGATT GATGCAGATA TGCGTCGCCC TCGTCAACAT GAAATGTGGC AACAGCCTAA CTTGATGGGT TTAAGCAATG TTTTAGTGGG TCAGGCTACC CTAGCCGAAG CTGCTAAGGA AGTGGTAATT AATCTAGAAT TACTTACTTC TGGTACCATA CCGCCTAACC CTGCGGCTCT ACTAGACTCA CAACGTATGA ATGGGTTACT CCAACAAGCT GCTGAAGATT ATGACTACGT AATTATTGAC ACTCCACCTT TAAGTGTTTT GGCGGATGCT TCGATCATAG GCAAAATGGC AGATGGAATG TTATTGGTTG CACGTCCTGG TGTGCTTAAT TCTGCTGCGG CTAAGACTAC AAAGACACTC ATTGAGCATT CGCGTGTGTC AGTGTTGGGA ATGGTGGTGA ATTGTGTAGC TACTGATAGT AATGACTACG GTTACTACTA CTCCCATAAA AATACTGGAG ATAATAATTC TGGCAAAAAG GATAGGATTA AGTCAAATTT AAGTAAAATC ACTGGATTGA GGCTGCTTTA A
|
Protein sequence | MPSHQDDQDL INFQQYWLIA KRRWLLIATI MGSVFGVTGL LTFSQKPIYE AEGKLLFNKQ SGVSSLTGVS EQLGQLSGVT NLSNPLETEA EIIRSNPIIQ KTIAESQLKD EQGQPLEIDD FLRSLKIKSV RGTDILQLTY RSANPEEATA IINRLMNTYL ENNVRNNRSE ATAAREFLSK QLPLVEAKVT EAEAALRRFK EKYEVVSLQE EAIQGVKRLN DLSGQVTQLR AQLVDARTRS GALQNQLALN TKQAMALSSL SQSNAVQQVL SEYQKVQDQL AVERSRFTEE HPVIANLLNK EQALKEQLEG RVSKTLGSWQ PIPEQDLQIG ELKQTLTANL VQVEVERLGL ENQVGVLMKA FVLYQARLRV LPKLEQQQLQ LQRQLQIAQT TYEQMLKRLQ DVEVVENQNV GNARIVSEAL LPKTPVSPRI VLNLALGGFL GFFLAIGAAL LLEAGDKSVR TQEEAQQLLD YPLLGTIPAF DQKARLARGE SITELPVLNN PYSSVNAAFE MLQINLGFSF SDKKLKVIVV SSCVMNEGKS FIAANLAVAT AQMGRRVLLI DADMRRPRQH EMWQQPNLMG LSNVLVGQAT LAEAAKEVVI NLELLTSGTI PPNPAALLDS QRMNGLLQQA AEDYDYVIID TPPLSVLADA SIIGKMADGM LLVARPGVLN SAAAKTTKTL IEHSRVSVLG MVVNCVATDS NDYGYYYSHK NTGDNNSGKK DRIKSNLSKI TGLRLL
|
| |