Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4186 |
Symbol | |
ID | 3680989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 5243542 |
End bp | 5245386 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637719533 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_324680 |
Protein GI | 75910384 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00132881 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCTCA GTCAATTGGT CAATGTTGAT TCTGACACAG TATCTAAAAC CGTTTATGAT GCTGTCATAG TTGGAACAGG GGTAGCTGGA GCCATTGTTG CCAAAGAATT GAGTCAACAG GGCAAGAAGG TTTTAATCAT TGAAGCTACA GTACACAAAG ATTTAACTCT TGCAGGCTTT CAAAGTTATG TTGATACCTT CTACAAAGCG GTGGATAAAA ATCCTAACTC TCCCTATCCC GCTAACGCTA ATGTTCCCAG CCCCACTGAT TACAACGACT ATTTTATAGA GAAGGGACCC ATGCCTCTTG CGGGTTCCTA CACCAGGGTT CTGGGCGGAA CAACCATGCA CTGGGAAGCA AAAACCCCAC GGATGTTACC GGAAGATTTC AAGTTATCCA GTACCTACGG ACAAGGTCTT GATTGGCCGA TTAGCTATCA CGACCTTGAG CCGTATTACC GCAAAGCCGA GCATGAAATG GGCGTTTGTG GAGACGTAGA AGAGCAGAAA AAACTAGGCC TTGAATTTCC CCAAGACTAT GTTTTTCCCA TGGAAAAGCT CCCCCCCTCT TACTTGGATC AAAAAGTCAT TGAGAAAGTC GATGGCACCA AGGTTGAACT CTATGGCAAG ACTCACACAA TCAGCTTTGC TACATTTCCT CAAGCTCGTA ATGGAGTACC TAACCCTAAA TATGATCAAG GTAACCTCTT TGTTCCCGAT GGAGTTACTA CTGTCCATCC TGTTCAATAT GGTGAACGTT GCCAAGGAAA TGCTAACTGT GTTCCTATCT GTCCAGTTCA GGCAAAATAT GATGCTCGCA GAACTTTAAG TAAAGCCTTT GAAACAGGTA GAGTTCATGT ACTCTACCAA GCAGTGGCTT ATAAGCTGGA ATACGATCGC CAAAATGGTC GCATCACTGC CATTCACTAC AAAAACTATA AAAAACCCAA CTCTAGCGAT TACACCACAG GAATCGCTAA GGGAACCGTA TTTGTCCTAG CGACTAATGC AGTCGAAAAC GCTCGGCTAT TGCTCGGCTC TGACTTGCCT AACACCAGTG GATTGATAGG ACGCTATTTA ATGGATCACC CCTTCACCTT AGCTTGGGCT TTGATGCCTG AAGTCACTGG TACTATGCGG GGGCCTCTTG TAACATCAGG AATTGGCACA TTCCGTCAAG GTGATTTTCG TAAGAAACAA TCCGCCTTTG CCGTAGACAT TCACAACGAT GGTTGGGGAT GGGCTACCGG ATCACCTAAG AGTGAAGTAG AAGACGCGAT CGATAACAAA AACAAATATG GACAAGAACT GCGTCAGACA TTGATTAGTC GAATTTCCCG ACAACTGTTG TTGGCTTTCA TGTGTGAATT ACTCCCTGAG TACGGTAACC GAGTCACTAT CGATCCCAGA CACAAAGATA AACTGGGTAA TTATCGTCCC GTCATTAATT TCAACCTACC CGACTACAGT CGTAGAACCC TTGCTTACAC CAGAAAAGTC TCCCGCGTCA TCTTCGAGCG TTTAGGTGCA GAAGACTATA CTCACTACGA TCCTCAAGAC CCTGCCTATT TTGAGTTTGA GGGTGAAGGC TACGTATACA AAGGTGGAAA CCACTTTTCG GGAACTCATA TTATGGGAAC AACGCCCTCG AATTCAGTTG TTGATAGTTA TCTGCGTTCT TGGGATCATA AAAATCTTTT CCTCGTCGGC GCAGGAAGTA TGCCCACCAT TGGTAGTTCC AATACAACCT TAACCATTGC CGCATTGAGT TTTAGAACTG CCGAGCATAT CCTGCAAGAA TTAAATTCTT ATAATTTGCC AGCTGCTAAT TTGCAAGCTC GCTAA
|
Protein sequence | MSLSQLVNVD SDTVSKTVYD AVIVGTGVAG AIVAKELSQQ GKKVLIIEAT VHKDLTLAGF QSYVDTFYKA VDKNPNSPYP ANANVPSPTD YNDYFIEKGP MPLAGSYTRV LGGTTMHWEA KTPRMLPEDF KLSSTYGQGL DWPISYHDLE PYYRKAEHEM GVCGDVEEQK KLGLEFPQDY VFPMEKLPPS YLDQKVIEKV DGTKVELYGK THTISFATFP QARNGVPNPK YDQGNLFVPD GVTTVHPVQY GERCQGNANC VPICPVQAKY DARRTLSKAF ETGRVHVLYQ AVAYKLEYDR QNGRITAIHY KNYKKPNSSD YTTGIAKGTV FVLATNAVEN ARLLLGSDLP NTSGLIGRYL MDHPFTLAWA LMPEVTGTMR GPLVTSGIGT FRQGDFRKKQ SAFAVDIHND GWGWATGSPK SEVEDAIDNK NKYGQELRQT LISRISRQLL LAFMCELLPE YGNRVTIDPR HKDKLGNYRP VINFNLPDYS RRTLAYTRKV SRVIFERLGA EDYTHYDPQD PAYFEFEGEG YVYKGGNHFS GTHIMGTTPS NSVVDSYLRS WDHKNLFLVG AGSMPTIGSS NTTLTIAALS FRTAEHILQE LNSYNLPAAN LQAR
|
| |