Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_04540 |
Symbol | |
ID | 7759412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 429787 |
End bp | 432033 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643803376 |
Product | hypothetical protein |
Protein accession | YP_002797684 |
Protein GI | 226942611 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2982] Uncharacterized protein involved in outer membrane biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACGC TCGGCAAAAT CCTTGGCCTG CTCGTCCTCG GGCTGCTGCT GGTCGTAGTC GCCCTCGGCT TCGCCCTTAC TCAGTTGTTC GATCCCAACG ACTACAAGGA CGAAATCCGC CAGCTCGCCC GCGACAAGGC CAATCTGGAA CTGACCCTGA ACGGCGACAT CGGCTGGAGT CTCTTCCCCT GGCTGGGCCT GCAGTTGCAG CAGACCAGCG TGGCCAGCGC CCGCACCCCC GACCAGCCGT TCGCCGACCT GGACATGATC GGCCTGTCGG TACGAGTGCT GCCGCTGCTG CGCCGGGAGA TCCAGATGAG CGACATCCGC GTCGACGGCC TCGAACTGCA CCTGCGCCGC GACAAGCAGG GCCGCGGCAA CTGGGAGGAC ATCGGCCGTC CGGCACGGAC GGCCGGGCAG CCGGAGCCGG AGGCCACCGC CGGGACAGAC ACGGCACCGG CCGAAGCGCC CGCCGCGGCC GATGCGGCCG ATGCGGGCAG CGCCCGGGCG CTGCGCCTGG ACATCGACAG CCTGGCGCTG AACAACGCCC GGGTGGACTA CCTGGACGAG CGCAGCGGCC AGAAATTCAG TGCCGAGAGC ATCCAGCTCA ACAGCGGCGC GATACGCCCG GACAGCGACA TCCCCCTCAA GCTGACCGCC TTCTTCGGCA GCAACCAGCC GGTCATGCGC GCGCGCACCG ATCTCGAGGG CGTGCTGCGC TTCGACTCCG CGCTCCAGCG CTTCCAGTTG GGCAACGCCC GGCTGTCCGG CGAAGCCTCG GGCGAACCGC TCAAGGGCAA GAGCCTGAAC TTCGCCGCCC AGGGCGAGTT GCTCGCCGAC CTGGCCGCCC AGGTGGCCGA ATGGAACGGC CTGAAGCTCA GCGCCAACCA GTTGCGCGCC ATCGGTGAAC TGAAGGTGCG CGAGCTGGAC AAGATCCCGC AACTGAGCGG CGGCCTGTCC ATCGCCCAGT TCAACCCGCG CGAATTCCTC GCCGGCCTCG GCCAGGAGCT GCCGGCGACG GCCGATCCCG CCAGCCTGAC CCGCCTCGAA CTGACCACCC AGTTGGGTGG CTCGCCGAGC GCCGTGGCCT TCGACAAGCT CGACCTGAAG CTGGACGACA GCCAGTTCAC CGGCCGCATC GCCGTCGCCG ACCTCGCCCG CCAGGCCCTG CGCGTACAGC TCAAGGGCGA CAAGCTGGAC CTCGACCGCT ACCTGGCGCC GAAGAGCGAG AAGCGAGAGG CCGCCGGCGC CGCGCGCCAG GCCGAAGTCA AGGGCGCCGT GGCCAGCGCC ATCCAGAACA GCGACACCCC GCTGCCCGAT GCCCCCACCC AGCAGGCTTG GAGCGAAGAC CCCGTGCTGC CGGTCGACAC CCTGCGCAAA CTGGACCTGC AGGCCAGCCT CGACATCGGC CAGTTGACCA TCGAGCGCCT GCCGGTCGGC AACGCCCACC TGCAGGCCAG CGCCAAGGAC GGCCTGCTGA CCCTGCAGAG CCTGCGCGGC GAACTCTTCG GCGGCGGCTT CGAGACGCAG GCCCGCCTTG ACGCCCGTCC GTCCACGCCC CTGCTGAGCC TGCAGCAGCG CCTCAGCCGG ATTCCGGTGG AGAAATTCAT CCGGACGGAA GGCAAGGAAA CGCCTCCCAT CAAGGGCCAG CTCGATCTGA ACGCCGACCT GGAAACCCGC GGTAACAGCC AGAAGGCCTG GATCGAAGGA CTCAACGGCA CGGCCAGCTT CGCCCTCCAC AAAGGCGTGC TGCCCGAAGC CAACCTCGAA CAGCAGCTCT GCCTGGCCAT CGCCACCCTC AACCGCAAGG GCCTGAACAA TCCGCCCAAG GCCAAGGACA CGCCCTTCGA GGAACTCAAG GGCAACCTGC GCTTCACCAA CGGCGTCGCC AGCAACCCCG ACCTCAAGGC GCGCATTCCC GGCCTGACGG TCAACGGCAA CGGCGACCTG GACCTGCGCG TGCTCGGCAT GAACTACCGC ATCGGCGTGA TCATTGAAGG CGACAGGCGC GCCATGCCCG ATGCCGCCTG CCAGGTCAAC GAGCGCTATG TCGGCCTGGA ATGGCCGGTC CGCTGCCGCG GCCCGCTGGA ACTGGGCGCC AAGGCCTGCC GCCTGGACCG CGACGGCCTG CGCCAGATCG CCGCCAAGCT GGCCGGCGAG AAGCTCAACG AGAAGCTCGA AGAAAAACTC GGCGACAAGG TCAGCCCGGA ACTCAAGGAT GCGCTGAAGG GGCTGTTCAA CCGATGA
|
Protein sequence | MKTLGKILGL LVLGLLLVVV ALGFALTQLF DPNDYKDEIR QLARDKANLE LTLNGDIGWS LFPWLGLQLQ QTSVASARTP DQPFADLDMI GLSVRVLPLL RREIQMSDIR VDGLELHLRR DKQGRGNWED IGRPARTAGQ PEPEATAGTD TAPAEAPAAA DAADAGSARA LRLDIDSLAL NNARVDYLDE RSGQKFSAES IQLNSGAIRP DSDIPLKLTA FFGSNQPVMR ARTDLEGVLR FDSALQRFQL GNARLSGEAS GEPLKGKSLN FAAQGELLAD LAAQVAEWNG LKLSANQLRA IGELKVRELD KIPQLSGGLS IAQFNPREFL AGLGQELPAT ADPASLTRLE LTTQLGGSPS AVAFDKLDLK LDDSQFTGRI AVADLARQAL RVQLKGDKLD LDRYLAPKSE KREAAGAARQ AEVKGAVASA IQNSDTPLPD APTQQAWSED PVLPVDTLRK LDLQASLDIG QLTIERLPVG NAHLQASAKD GLLTLQSLRG ELFGGGFETQ ARLDARPSTP LLSLQQRLSR IPVEKFIRTE GKETPPIKGQ LDLNADLETR GNSQKAWIEG LNGTASFALH KGVLPEANLE QQLCLAIATL NRKGLNNPPK AKDTPFEELK GNLRFTNGVA SNPDLKARIP GLTVNGNGDL DLRVLGMNYR IGVIIEGDRR AMPDAACQVN ERYVGLEWPV RCRGPLELGA KACRLDRDGL RQIAAKLAGE KLNEKLEEKL GDKVSPELKD ALKGLFNR
|
| |