Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_29900 |
Symbol | |
ID | 7761891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3092437 |
End bp | 3094641 |
Gene Length | 2205 bp |
Protein Length | 734 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643805863 |
Product | Protein-tyrosine kinase wzz family protein |
Protein accession | YP_002800131 |
Protein GI | 226945058 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01005] exopolysaccharide transport protein family [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.131777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACCCCA CTTCCATATC CAGCCGGCAA GACCGCGAGG AAGCCGGCAT CGACCTGCGC AAGTTGCTCG GCGCCCTGCT CGACCACAAG TGGCCGATCC TCGGCCTGAC CCTCCTGTTC TTCCTGGGCG GCACCCTCTA CGGCCTGTTC GCCACACCGA TCCACCGCAC CAGCGCCCTG GTTCAGATCG AGAAGAAAAG CGGCACCGTT CCGGGACTGG AGATGGGGGA CTTCACCCAG GCCTCCAGCG CCAGGACGGA AATCGAACTG ATCCGCTCGC GCAGCCTGAT CGGCCAGGCG GTCGACAACC TGCACCTCGA CGTCCGGGCC ACGCCCCAGC GCTTTCCGCT GCTCGGCGAC TACCTCGCCC GCCGCCACCG GGGCCCGGAG CCCGCGCCGC CCTTGCTCGG TCTGCAGGAA TACGCCTGGG GGGGCGAGAA GATCGACATC CTGCGCTTCG AGGTCGAGCC GAGCCGGATC GGCAAGGCTT TCCTGCTCAT CGCCGGGGAA GACGGCGCCT TCACCCTGCA GGAAAGCGAG GGCGAGCCGC TGCTGCGGGG ACGGGTCGGC GAAGCCATCG AACGGAACGG CATCCGCCTG CAGATCCGCG AACTGCAGGC CCGGCCGGGT ACCCGCTTCA CCCTGAAGAA GGCGAGCCGT TCGAGCACCG TGCGCAGTTA CCGGAACGCC TTGCAGCTCA CCGAACTGGG CACCGACACC GGCCTGATCA CCGTGGCCAT GGAACACCCC GACCCCGAGC ACGCCAGCCG AGTGGTGGAC GAGATCGGCC GCCTGTTCGT CCGCCAGAAT ATCGAACGCA TGTCCGCCGA GGCCAACGGC AGCCTGGAGT TCGTCCGCAG CCAGTTGCCG GAAGTGCGAC GCGAACTGGA CCAGGCCGAA GGCGCCCTGA ACGACTACCG CAAGCGACAC GGCTCCGTCG ATATCGCCAT GGAAACCGGC ACCGTGCTCA GCCAGGCCGT CGACCTGGAA ACGCGGATCT CGGAACTGAG GATGCAGCAG GCCGAACTCG ACCGCCGCTT CACCCCCGAG CACCCGGCCT ACAAGACCCT GCTGCTACAG ATCCAGGGCC TGACCCAGCG CAAGAACGAA ATCGCCAGAA GGGTGCAGAG CTTGCCGGAA ACTCAGCAGG AACTCCTGCG CCTGTCGCGC GACGTGCAGG TCGGCACCAC TATCTACACC CAATTGCTGA ACAAGGCCCA GGAACTGGAT CTGGTCCGCG CCGGCACCAT CGGCAACGTG CGCATCGTCG ATCCGGCGGT CATCGACGGC GGCCCGGTCG CCCCGAACAA GAGCCTCATC CAGGTGCTCG CCACCCTGGC CGGAGCGCTG CTGGCGATCG GCTTCGTGCT GCTGCGCCGC TTGCTCGATC CGGGCCTGGA AACCCCGGAA GCCATCGAAC AACTGGGCCT GCCGGTGTAC GCGGCGGTAC CCTTCAGCGC CCACCAGGTC CATGCCAAAA TCCGCCGCCG CCTGCCGGCC GGCGATCCCG CCATCTCGCC GCTGCTCGCT CTCGGCCACC CGAACGATCC GGCGGTGGAA GCCCTGCGCA GCCTGCGGAC CAGCCTGCAC TTCATCACGC TGGGCGCCCA GGACAACCGC CTGGTGATTT CCGGTCCCGG CCCGCAGGCG GGCAAGAGCT TCATCTGCGC CAACCTGGCC GCCGTCGTCG CCCAGGCCGG CAAGCGCGTG CTGCTGATCG ACGTGGACAT GCGCAAGGGC CACCTGCACA AGCTGCTCGG CATGCCGGCG AGCCCGGGGC TGGCGGAGCT GCTCGGCGGG CACTGCACGC TCGCCGACGC GCTCCACCCG ACCCCGCTCG AAGGCCTGTT CCTGCTGCCA CGGGGCCAGC TCCCGCCGAA TCCCTCGGAG CTGCTGATGC GCCCCGAGTT CGCGGCCACC CTCGAACAGG CCAGCGCGAG CCACGACCTG GTCATCCTCG ACACCCCGCC GCTGCTGGCC GTCACCGACG CCGCCATCGT CGGCCGGCAA GCGGCCACCA CGCTGATCGT GACCCGCTTC GGCGTGAGCT CCGCCCACGA AATCGAGATG ACCGTCAGGC GCTTCGCGCA GAGCGGCATC GAAATCAAGG GCGCCATCCT CAATGGCCTG GAAAAGCGCG CCGCCACCTA CGGCTACGGC CACGCCGCCT ACCACCACTA CGAGTACAAG TCGGACAACG CCTGA
|
Protein sequence | MHPTSISSRQ DREEAGIDLR KLLGALLDHK WPILGLTLLF FLGGTLYGLF ATPIHRTSAL VQIEKKSGTV PGLEMGDFTQ ASSARTEIEL IRSRSLIGQA VDNLHLDVRA TPQRFPLLGD YLARRHRGPE PAPPLLGLQE YAWGGEKIDI LRFEVEPSRI GKAFLLIAGE DGAFTLQESE GEPLLRGRVG EAIERNGIRL QIRELQARPG TRFTLKKASR SSTVRSYRNA LQLTELGTDT GLITVAMEHP DPEHASRVVD EIGRLFVRQN IERMSAEANG SLEFVRSQLP EVRRELDQAE GALNDYRKRH GSVDIAMETG TVLSQAVDLE TRISELRMQQ AELDRRFTPE HPAYKTLLLQ IQGLTQRKNE IARRVQSLPE TQQELLRLSR DVQVGTTIYT QLLNKAQELD LVRAGTIGNV RIVDPAVIDG GPVAPNKSLI QVLATLAGAL LAIGFVLLRR LLDPGLETPE AIEQLGLPVY AAVPFSAHQV HAKIRRRLPA GDPAISPLLA LGHPNDPAVE ALRSLRTSLH FITLGAQDNR LVISGPGPQA GKSFICANLA AVVAQAGKRV LLIDVDMRKG HLHKLLGMPA SPGLAELLGG HCTLADALHP TPLEGLFLLP RGQLPPNPSE LLMRPEFAAT LEQASASHDL VILDTPPLLA VTDAAIVGRQ AATTLIVTRF GVSSAHEIEM TVRRFAQSGI EIKGAILNGL EKRAATYGYG HAAYHHYEYK SDNA
|
| |