Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_10950 |
Symbol | alg44 |
ID | 7760039 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1046532 |
End bp | 1047698 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643803999 |
Product | alginate biosynthesis protein Alg44 |
Protein accession | YP_002798301 |
Protein GI | 226943228 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00119266 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACCG CGACCCTTAA TGTAAACGTG GTCCACGAGT CGGAAGCGCA ACGCCAACAC GCGCGCGTCA AGCTGCCCGG TAAGATCCGC TTCCTCGGCC CCAACCGGGA AACCATCGAG CAGCGTCTGA TCGACATCTC CGCCGGTGGC TTCAGCTTCG CCAGCGGCAA GCCGGTCACC CAGCAGGGTG CCTTCCACCG CGGCAAGCTG CTGTTCCAGC TGGACAGCCT GGGCCTGGCC ATGGACGTGG AGTTCCAGGT GCGCAACCTC GACCCGGAAA GCGGCCGCAC CGGCTGCCAG TTCCACGGTC TGGGCGCACG CGAGATCTCC ACCCTGCGCC AGATGATCAC CTCGCACCTG AGCGGCGAGT TGGTCACCGT CGGCGATGTG ATCTGCACCC TGCAGCGCGA CAACTTCACC AAGGCGCGCA AGGGCAAGGG CCTGGCCCAG CAGACCATGT TCGAACGCTT GCGCGCGGTC AGCTTCAGCC TGGCGATCTT CATCGTCGGC CTCGGCGCCT TCGGCCTGAT CCTCAAGCAG CTCTACGACC TCTACTTCGT CACCCATGCC GAGTCGGGCA TGGTCAGCGT GCCCAGCATG GAAGTGACCA TGCCCCGCGA AGGCACGGTG CAGAGCCTGG TCGGCCCGGA CGGCCTGGTC GCCAACGGTG CGCCGATCGC CAGCTTCTCC GCCTCCATGC TGGAAATGCT CAAGGGCCAC CTGAGCGAGG AGCAGCTCAA CCCGGCCAAC GTCGAGAAGC TGTTCACCCG GCAGATGAAG GGCACCCTGA CCAGCCCGTG CGATTGCAAG GTGGTCGCCC AGCGCGTGGC CGACGGCCAG TTCGCCTCCA AGGGCCAGGT GATCTTCGAG CTGCTGCCGC GCGACGCCGC CGCTACCGTC GAGGCCCGTT TCCGCTACCA CGACTTCGCC AAGGTCAAGC CGGGCACCCA GGTCACTTTC AGCGTTCCCG GCGAGGACCA GCCGCGCCGC GGCCGGATCG TCAGCACCGC CCTGCAGAAT GAAGGACTGT CCAGCGATAT CCGCGTGCTC ATCCAGCCCG AACAGCCCCT GGACAGCGCC CTGGCCGGCC AGCCGGTGGA AGTGGTCATC GACCACGGCC CTTCCTACGA CTGGCTGATC GACAAGGCCG TGACCGCCGG ACTCTGA
|
Protein sequence | MNTATLNVNV VHESEAQRQH ARVKLPGKIR FLGPNRETIE QRLIDISAGG FSFASGKPVT QQGAFHRGKL LFQLDSLGLA MDVEFQVRNL DPESGRTGCQ FHGLGAREIS TLRQMITSHL SGELVTVGDV ICTLQRDNFT KARKGKGLAQ QTMFERLRAV SFSLAIFIVG LGAFGLILKQ LYDLYFVTHA ESGMVSVPSM EVTMPREGTV QSLVGPDGLV ANGAPIASFS ASMLEMLKGH LSEEQLNPAN VEKLFTRQMK GTLTSPCDCK VVAQRVADGQ FASKGQVIFE LLPRDAAATV EARFRYHDFA KVKPGTQVTF SVPGEDQPRR GRIVSTALQN EGLSSDIRVL IQPEQPLDSA LAGQPVEVVI DHGPSYDWLI DKAVTAGL
|
| |