Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_05290 |
Symbol | |
ID | 7759485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 507134 |
End bp | 508273 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643803449 |
Product | hypothetical protein |
Protein accession | YP_002797757 |
Protein GI | 226942684 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0316409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTATC GCTTCGAATG GACCACCAGC CTCTGCGCGC CCGACTTTCC CCACGAGGCC TACGCCAGGC TCCATGCTCT GGTGCCGCAG GCCACCCCGT TCAACCGGCT CGGCTGGCTG CGGGCCGCTG AGCGCGCCCT CGAAGCGGAA CAGCGCCTGC ACGTGCTGCT GGCCTGGGAA GGCCGCGAAC TGCGCCTGTG CCTGCCGCTG GTGCGTAGCC GCGAGCGCCG CCTCGGCCTG CGCTGGACGA TCCTGCGCCA TCTCGGCTAT CCGCTCGGCG ACCGCATCGC CCTGCTCTGC CAACTCGACG AAAACGGCCG GCGCCGGGCG CGCGAGGCCA TCCACCGGCA TCTGCCGCAT GCCCTCCTGC AACTGCACGA ACTGGCCGCG GATGGCGAGC AGCAAACCCT GCTCGAAGGC TGGGCCGTGG CCAGTTCGAG CCACGAAAAG CGCACGAGCT GCCGGGTGCC GGTGCACGCG ATCGGCGAGG ACGACCGGCG CGAACCGTCC GGCGACCTGC GCTACAAGTT GCGCCGGGCG CGCAAGCGCG CGGCCGCCTG CGACGCTCGG ATACGCCGGC TGAGCCCGGA CGGCGCGAGC ATCGGCGCCG CACTGGAGAC CATCGCCGCG GTCGAGCGGG CCAGTTGGAA GGGCGCGCAG GGCGTCGGCA TCTTCTCCGG CGAACGGCGC CGGCAATGGA TGCGCGAGGC CTTCGGCGAA CTCGCCGCGG ACGGCCTGGT GCGCATCGTC CTGCTCGAAC ACGGCGGCCG CTGCATCAGC TACCGCCTCG GGCTCCTCGA ACACGGCCGG CTGTACGACT ACAACCTGGC CTTCCTGCCG GACTACGCCG AACTCGGCAG CGGCCGCCTG CTGCTGGACG AATGGATCCG CTGGGGCCTG GAGGAGGGCT GGCGGTACGT CGACGCCTCG CGGGTCAGCC TGCGCGATTC CAGCCATCAA CTGCACGAAC GCATGACCGG CGCGGTCCTG CAACTGCGCT GGAGCCTCTA CTCCCGGCGC CCCGAGGGCA TCGCCCTGGG CCTCGCCTAC CGCCTCTGGA GCGCCCTCAA GGCCCGGCGC CGGAGCGCCA TGGTCTGCGC CTCCGCCGAC ACAGGAGAAC GCCCATGCCC AACCGAGTGA
|
Protein sequence | MDYRFEWTTS LCAPDFPHEA YARLHALVPQ ATPFNRLGWL RAAERALEAE QRLHVLLAWE GRELRLCLPL VRSRERRLGL RWTILRHLGY PLGDRIALLC QLDENGRRRA REAIHRHLPH ALLQLHELAA DGEQQTLLEG WAVASSSHEK RTSCRVPVHA IGEDDRREPS GDLRYKLRRA RKRAAACDAR IRRLSPDGAS IGAALETIAA VERASWKGAQ GVGIFSGERR RQWMREAFGE LAADGLVRIV LLEHGGRCIS YRLGLLEHGR LYDYNLAFLP DYAELGSGRL LLDEWIRWGL EEGWRYVDAS RVSLRDSSHQ LHERMTGAVL QLRWSLYSRR PEGIALGLAY RLWSALKARR RSAMVCASAD TGERPCPTE
|
| |