Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1034 |
Symbol | |
ID | 3678702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 1257449 |
End bp | 1259344 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637716370 |
Product | glycosyl transferase family protein |
Protein accession | YP_321553 |
Protein GI | 75907257 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.370681 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTTAA AATTGAGCGT TCTTGGCGCT GTTGAGCGGT GCTTCAATAA TCTAGTAAAG CGTCCAGCCC TTGCTGTGAC TGCCTCAATT CTGTGGTTGA TTCTAATTGG TTGGATAGGC TATGGATGGA ACTTAGGCAG TGTTGGCTTG GTTGACGAAA CAGAGCCATT ATTTGCTGAA GCTTCACGGC AAATGTTAGT TACGGGTGAT TGGATCACGC CGTTTTTCAA TGGTCAAACT CGTTTTGACA AACCTGCTTT AGTTTATTGG TGTCAAGCGA TCGCCTATGC GGTGTTTGGT GTGAATGAGT GGGCAGTGCG TCTTCCCTCA GCGTTAGCAG CAATGGGAGC AGTGTCCCTA GCTTTTTATA CAGTTCATTG GTCTATAACG AAAAAGGATG AGTTAGAACA AGTGACACTG CCAACTCGCC GCTACTTAAC AGCTGGTGTA GCCGCAGGTG TCATGGCACT CAATGCACAA ATGATTGTCT GGGGAAGAAC TGGTGTCTCA GATATGCTCT TAACTGGGTG TATCGCCTCA GCTTTGTTAT GCTTTTTCCT CGGATACGCG GCAATGGAGT CTGGAGAAAG ACAGGAAGCA GGAGATGGGG GAATGCCGAA CCCAAAGGGC AGGAGGAATA AGCGATCGCT ATTCCCTAAC AAGTGGTATT TGGCTTGTTA TGTGCTAACT GCTGGGGCAA TTTTGACTAA AGGCCCTGTG GGTATTGTTT TACCGGGGAT CATTGTCTTG GTATTTTTGC TATATGTGGG GCAGTTGCGA ACAGTACTGC GGGAAATGCG CCTCGTTTTA GGAACAGTTA TTATCTTAGG TTTATCTGTT CCCTGGTATG CCTTGGTGAT TTGGCGTAAT GGTGAGAGTT ATATCAACTC TTTTTTTGGA TATCACAATG TGGAGCGTTT TACGGAAGTA GTTAATGGTC ACTCGGCTCC TTGGTATTTT TACTTTGTGA TAGTGACACT ATTTTTCGCC CCATATTCGG TCTATTTACC TTTAGCACTT TTCAGACTGA AGTTTTGGCA GCGATCGCAC TGGCAAAATC AAGAACGTTC TCAACAGTTG GGTTTATTTG CCTGTATTTG GTTTCTGAGC GTTTTTAGTT TCTTTACGAT CGCTGTTACC AAACTGCCAA GCTACGTCTT ACCTTTAATG CCGGCGGCCG CTATTTTAGT AGCATTGTTA TGGAGTGATT TCTTCCCCAG TGGTGAACAA ACAAACAAGA TAGAGATTAC TTATCCATCT TCTCTTTTAC TAGCCAGTGG CTGGGTAAAT GTTATATTTT TAACCATTGT GGCAGTAGCG TCGTTTCACA CATATCATCT GTTGGGTAAT GATGATGCAG CCCCCAACTT CCGCCAAAAT TTACAAGATT CTGGATTACC GGCGATCGGC GGCTGGCTTT GGCTCGCCGG GGCAATTTTT GTTGCTGTTT TAATATTGCG TCGCTATTGG CATTCTATTA TCGGCGTTAA TATGCTCGGA TTCGCGGCCT TTTTGCTAGT TGTCACCATG CCAGCTTTGT TTTTGATGGA TCAAGAGCGT CAACTACCAT TAAGAGACTT ATCTGCTGTT GTAGCTCAAG TACAACAACC AAAAGAAGAA ATAATGATGG TTGGTTTCAA AAAGCCAAGT GTAGTTTTTT ATAGTCACAA ACAGATAAAT TTTGTCCAGA CAACAGAAGA GGGTGTGGAA TATATCCATA ATTTAGCTAA TCAAGCAGTT AAACCATCTT CCCTATTACT TGTGACTAAC AAAAAAAACT TTTTCAAAAT GGACTTACCG CCAGATAATT ACGAAAATTT AGAAATTCAA GGTGCTTACC AATTGACTCG GATTAATTTC AAGAAGATGA AAACTGAAAA AGTTAAAATT TCCTGA
|
Protein sequence | MRLKLSVLGA VERCFNNLVK RPALAVTASI LWLILIGWIG YGWNLGSVGL VDETEPLFAE ASRQMLVTGD WITPFFNGQT RFDKPALVYW CQAIAYAVFG VNEWAVRLPS ALAAMGAVSL AFYTVHWSIT KKDELEQVTL PTRRYLTAGV AAGVMALNAQ MIVWGRTGVS DMLLTGCIAS ALLCFFLGYA AMESGERQEA GDGGMPNPKG RRNKRSLFPN KWYLACYVLT AGAILTKGPV GIVLPGIIVL VFLLYVGQLR TVLREMRLVL GTVIILGLSV PWYALVIWRN GESYINSFFG YHNVERFTEV VNGHSAPWYF YFVIVTLFFA PYSVYLPLAL FRLKFWQRSH WQNQERSQQL GLFACIWFLS VFSFFTIAVT KLPSYVLPLM PAAAILVALL WSDFFPSGEQ TNKIEITYPS SLLLASGWVN VIFLTIVAVA SFHTYHLLGN DDAAPNFRQN LQDSGLPAIG GWLWLAGAIF VAVLILRRYW HSIIGVNMLG FAAFLLVVTM PALFLMDQER QLPLRDLSAV VAQVQQPKEE IMMVGFKKPS VVFYSHKQIN FVQTTEEGVE YIHNLANQAV KPSSLLLVTN KKNFFKMDLP PDNYENLEIQ GAYQLTRINF KKMKTEKVKI S
|
| |