Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1116 |
Symbol | |
ID | 3678531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 1360588 |
End bp | 1362813 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637716452 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_321635 |
Protein GI | 75907339 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0704492 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTTCAAA CTAATCTAAG TACAAATATA AACAGCGCTG ATTCAGAACC CAGTTACGGC AAAATCTTTT CAGTTTTTCT GCGTAGGTCC CCTTGGTTCT TAACGGCATT TTTAACAACA ATTGCCCTAG CAGCCCTGAT GACTGCAAGA ACCAAGCCTA CTTACAAAAG CACAATGCAA CTGTTAGTAG AACCTAACTA TCAAGGCAAA AGAGAGGGTG GTACTCCAGA GACCCAGTTT CTCGAACCTG ATATACAAAT AGATACTGCT ACACAACTGA ACTTGATGCA AAGTTCAGGA CTAATTCACA AAGCAGTTGA CAAATTGCGA TCAGAATATC CAAATATTAC TGTAGCTGAT ATTAAAAATG CCTTAGTCTT AAATCAAGTT AGGACAAAAG AAGATAATGT CGCTACTAAA ATTTTTCAAG TAGATTATAC TTCTGGCGAC CCTGAACAAA CTCAAAAAGT TCTCGGTGCT ATCAGGCAAG TCTATGTGGA ATATAACAAA CAGCAGCAAG ACAACCGACT CAAAAAGGGT CTGCAAGTTA TTAGAGAACA GTTGAGCAAA GCCAGTGAAG AAGTGAATGC AGCTGAAGCC AACTTACAAA GATTCCGCAG AAACCAAAAT TTAATTAACC CAGAAGAACA AGCAAAAGCT CTAGAAGTTG CTTTAAATAC TATTGAGCAA GAACGTCGTA CTACTCGTTC TACATACGAA GAGGCTTTAG CAAAACAAAA ATCTTTAGAA GAGCAATTAA ATCGTTCTCC CAAAAATGCT TTAGTTACTT CTCGTTTGAG TCAATCCGTC CGCTACCAAG GCTTACTCAA CGAAATTCAA AAAACAGAAC TGTTACTAGC TCAAGAACGT TTACGCTTCA CAGATGAAAC CCCCAGTGTC CAGAAGCTAA AAGAACAGTT GCAAAGCCAA AAAGAATTAT TGCAACAGGA AGTAGGTAGA ACTCTAGGTG TTAAATCTGC TAATGCCTTC CGTGGAGGAG AACCGCTTTT AGAACAGGGA CAGTTTGGGG AAATTGATCT GACTTTGGCT AGTCAATTAG TAGAAACCCA GACAACCATA GTTTCATTGA CTGCCCGTGA TCAAACTCTG GCACAAAAAG AAAATCAACT ACGCCTAGAA ATTAAACGTT TTCCGCCCCT GTTGGCTTAT TACAACCGTA TACAACCACA GTTGCAATTC AGCCGTGAGA GACTAGAGCA GTTATTACGG GCAGAACAAC AACTACGGCA AGAACTGGCT AAAGGGGGAT TTAACTGGGA AGTGGTGGAA GAACCCCAGT TAGGGATGAA ATTAGGCCCA AATCTCCAAC AGAATTTGAT GTTGGGTGCT GTTGTGGGGT TAATGCTGGG AGGTATTGCT GCCTTTATTC GAGAAGCTGC TGATGATTCT GTTCACACCA CAGCTGAGTT AGAAAAACAA GTAGCTTTAC CTTTATTAGG AAGCACCCCG AAATTGCCAC CAGCAAAGGG TAGAGAATCA GTCATCAAGT TGCCTTTTGG CAAGCCTGAG GTTTTAGGGC CTTGGACTGT GGAGGTGCTA CAGTCTTCCA TCCGTTGGGA ATCGTTGGAT CTGATTTACA AAAATATCGA GCTGTTAAAC TCGGTATCTA GCTTAAAGTC CTTGATGATT ACCTCACCTC TACTGGATAG AGGTAAATCT GGTTTGGCCT TGGGTTTAGC TATGAGTGCA GCCCGGTTAC ACAAACGGGT GCTACTGATT GATGCGAATT TGCGAGATCC TAGCCTCCAT GAACATCTCA ATCTTCCCAA CGACCAAGGG CTATCAACTT TGTTAGCCAG CGAAATCACT TTACCCGAAC AGATTGGTAT TCATAACGTA GGTTCTGCTT ACATCGATAT TTTGACAGCT GGTCCCTCGC CCAGTGACTC AGCAAATCTC TTGAGTTCTC CACGGATGAA ACAGTTAATG GCAGCATTTG AGGAGAACTA CGATTTGGTG ATTATTGATG CGCCACCAGT TATAGGTCTA GTAGATGCTT TGTTGACAGC ATCATCTTGT AGAAGTGTGG TGATGGTAGC TAGTCTGGGA AAAGTGACTC GCAATCAAAT CGCTCAAGCT ACAGCTATGT TGAGCAAGTT AAACCTGATT GGTGTTGTGG CTAATGGAGT TTCTAACTCT GATAGCACCT ATGTACCTTA TGCGCGGGAA TACCGTTTTG CTTTGCAACA AGCGGTGGAA AAGTAA
|
Protein sequence | MVQTNLSTNI NSADSEPSYG KIFSVFLRRS PWFLTAFLTT IALAALMTAR TKPTYKSTMQ LLVEPNYQGK REGGTPETQF LEPDIQIDTA TQLNLMQSSG LIHKAVDKLR SEYPNITVAD IKNALVLNQV RTKEDNVATK IFQVDYTSGD PEQTQKVLGA IRQVYVEYNK QQQDNRLKKG LQVIREQLSK ASEEVNAAEA NLQRFRRNQN LINPEEQAKA LEVALNTIEQ ERRTTRSTYE EALAKQKSLE EQLNRSPKNA LVTSRLSQSV RYQGLLNEIQ KTELLLAQER LRFTDETPSV QKLKEQLQSQ KELLQQEVGR TLGVKSANAF RGGEPLLEQG QFGEIDLTLA SQLVETQTTI VSLTARDQTL AQKENQLRLE IKRFPPLLAY YNRIQPQLQF SRERLEQLLR AEQQLRQELA KGGFNWEVVE EPQLGMKLGP NLQQNLMLGA VVGLMLGGIA AFIREAADDS VHTTAELEKQ VALPLLGSTP KLPPAKGRES VIKLPFGKPE VLGPWTVEVL QSSIRWESLD LIYKNIELLN SVSSLKSLMI TSPLLDRGKS GLALGLAMSA ARLHKRVLLI DANLRDPSLH EHLNLPNDQG LSTLLASEIT LPEQIGIHNV GSAYIDILTA GPSPSDSANL LSSPRMKQLM AAFEENYDLV IIDAPPVIGL VDALLTASSC RSVVMVASLG KVTRNQIAQA TAMLSKLNLI GVVANGVSNS DSTYVPYARE YRFALQQAVE K
|
| |