Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_3121 |
Symbol | |
ID | 3680867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 3876674 |
End bp | 3878293 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637718465 |
Product | glycosyl transferase family protein |
Protein accession | YP_323624 |
Protein GI | 75909328 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGAAG GAAGCTTTAT TTGGGGTCAT CTGGAAAGAC AGCACCGTGC AGTTGATAAA TGGATTGATC GCTTATGGCT GGTAGTGTTG TTAATTGCAG CATTACTATT ATTCAGCGTT AACTTAGGAG GATTACCCCT GCAAGATTGG GACGAGGGAA CTATAGCCCA AATTGCTAGG GAAATAACAC AAGAATCGGC CAATTCCATA CGTTGGCTAT ATCCCACCCT CAGAGGGCAA CCTTATCACG ATACACCGCC TCTCATGCAC ATAGTGATTG CAGGGGCTTA CCATTTAGGA GGTGTGAATG AATGGACGAC ACGCTTACCT GGGGCAATTT TAACTGCCTT ATCAGTACCA TTACTATATT GTCTTGGTAG AGAGATATTT CGTCAACGCT GGGCAGCGAT TTATAGTGCC TTAATCTACT TAACAATGTT GCCAGTAGTC CGTTATGGGC GCTTGGCAAT GTTAGAAGGG GCTGTGGTGA GTTTCTTGTT GGTGATGATG CTGTGTGTGT TGCGATCGCG TCGAGATTTA CGTTACTGTC TCGGCATAGG TTTAAGTTTA GGATTAATTT GTCTCACCCA AGGACTATCC GGCTTTTTAT TAGGTGCAGT CGTCCTTGTG TTTTTGTTTT GGGATACACC AAGACTACTT ACCAGTTATT ATCTATGGAT AGCGATCGCC ATTGGAATTT TGCCTGTAGC TGGTTGGTAT AGCGCTCAAC TACTACACTA TGGCAATGAT TTTGTCCAGA ATGGCTTACT TCTTCAATCC CTACAACAAG CGGGTATAGT TAACCACAAG AATGCTCAAC CATCTTGGTT TTACATAGTT GAACTCCTCA AGTGGACTTG GCCTTGGTTA ATCTTCTTAC CACAAACAAC CAAGTTACTT TGGGAAAATC GCAATCTTAG CTGGGCAAGA CTCATACTTG TATGGAGCAT AGTTTATTTG CTACTCATTT CTCTCATGAG CGTGAAACTT GCTTGGTACA TATTCCCCAT TTACCCAAGC CTAGCCTTAG CTTTTGGCGC ACAGTTAGCA GAAATAGAAA ACTTACCTTT ATTATCATCC TATCCCCGCG CTTGGGTCGC TGGTTTGTCA ATCTTAGCCG TGGTAGCTTC AGCTGCTAGC ATTCACTTTA GTTGGGGAAT AGCTGCCAAA ACTGACTTAC AGCTAATTTT TGCCGCAGTC GCTTTGACGA TGATTATGGG AGCGATTTTA GCAGAACGAG GCGACGGACA ATTTCTCAAA ATATTGTTTT GGGGTAGTTA TATTTCCCTG CTGCTGTTGA TGAAATCTAA CTACTGGGTT TGGGAATTAT GGCAAGCTTA CCCAGTAAAA CCTGTAGCGG CAATGATAGA GAAAGTAAAT CCAGCTGCAA AAAAGATTTA TACATCTTTT CCCTATCATC GCCCATCATT GGATTTTTAT AGCGATCGCC ATATTATTCC CGCCTCTGCC GAAGAACTAA AACATCATTG GCACTACGAC AGACAACCAT ATCTACTCAT CAGTTCATCA GATTTCCAAC GCCTGCAATT AGACTCCGTT CAGATACTCG ATAAAACTGA AGGCTGGCAA TTAATTACCA AAGAAACCAA TCGATTGTAA
|
Protein sequence | MQEGSFIWGH LERQHRAVDK WIDRLWLVVL LIAALLLFSV NLGGLPLQDW DEGTIAQIAR EITQESANSI RWLYPTLRGQ PYHDTPPLMH IVIAGAYHLG GVNEWTTRLP GAILTALSVP LLYCLGREIF RQRWAAIYSA LIYLTMLPVV RYGRLAMLEG AVVSFLLVMM LCVLRSRRDL RYCLGIGLSL GLICLTQGLS GFLLGAVVLV FLFWDTPRLL TSYYLWIAIA IGILPVAGWY SAQLLHYGND FVQNGLLLQS LQQAGIVNHK NAQPSWFYIV ELLKWTWPWL IFLPQTTKLL WENRNLSWAR LILVWSIVYL LLISLMSVKL AWYIFPIYPS LALAFGAQLA EIENLPLLSS YPRAWVAGLS ILAVVASAAS IHFSWGIAAK TDLQLIFAAV ALTMIMGAIL AERGDGQFLK ILFWGSYISL LLLMKSNYWV WELWQAYPVK PVAAMIEKVN PAAKKIYTSF PYHRPSLDFY SDRHIIPASA EELKHHWHYD RQPYLLISSS DFQRLQLDSV QILDKTEGWQ LITKETNRL
|
| |