Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_33130 |
Symbol | btuB |
ID | 7762209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 3386626 |
End bp | 3389697 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643806179 |
Product | TonB-dependent vitamin B12 receptor |
Protein accession | YP_002800443 |
Protein GI | 226945370 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | [TIGR00708] cob(I)alamin adenosyltransferase [TIGR01779] TonB-dependent vitamin B12 receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATTCGCA AGCCCTGTTT CGGGCCGGCG GCGTCCGCGC TGCTGGGCCT TTTTCCTTTT TCCCCGCTCC TGGCGGCCGA CGAACCCCTC GCCCTCGACG ATCTGGTGGT GACCGCCGCA CGCAGTGCCC AGAGCCTGCG CGACAGTCTC GCCGCAGTGA GCCTGATCGA GCGCGACGAC ATCGAGCGCA GCCAGGCGCA GTCGGTGCCG GAGCTGCTCA AGAAGGTGCC GGGGGTGTCC ATCGCCAACA ATGGCGGGCC GGGCAAGTCC ACTTCCGTCT ACCTGCGCGG CACCGAATCC GACCACGTGC TGGTGCTGAT CGATGGCGTA CGGGTCGGTT CGGTGACCAG CGGCACCGCC GCCTGGCAGG ACCTGCCGGT GGAGATGATC GAGCGCATCG AGGTGGTCCG CGGGCCGCGC TCCAGCCTGT ACGGCTCGGA GGCCATCGGC GGGGTGATCC AGATCTTTAC CCGCAAGGGT GGCGACGGCG CGCCCAGGCC GTTCTTCTCC GCCGGCTACG GCACCCACGA CAGCTACACC GGCAGCCTCG GCGTCTCCGG CGGCAACGCC CACGGCTGGT ACAGCCTGGC GCTGAGCAGC GCGGACAGCG ACGGCATCAA CGTCAAGCGT CCCGGCGCCA GCGGCTACGA GAGCGACGCC GACGGCTACC GCAACCACTC CGCCTCGCTG CGCGCCGGCT GGCGCTTCGA CAACGGTCTG GAGCTGGAGG GCAGCTTCCT GCGCGCCAAG TCGCACAACG ATTACGATCA GGTGAACAGC CGGCGCACGT CCGGCTTCTC CGCCAATGCC GACGGCGAGC AGAACGTGGT CAGCGGTCGC GCCCGTTTCA GCCCGCTGGC GTTCTGGCAG GTCACCCTGC AGGCGGGGCG CAACGAGGAC AAGTCCGACA CCTACCAGGA CGGCCACTTC TATTCGCGTT TCGACAGCCG CCGCGACAGC GCCAGTTGGC AGAACGACCT GACCCTGGCC GAAGGCCACA TCCTCACCCT GGGCGTCGAC TACCAGCGCG AGGAAGTCAA CGGCAGCACC GACTACGACG AGGATTCGCG GGAAAACAAC GGCGCCTTCA TCCAGTATCT CGGCGAATAC GGTCGCCACG ACTGGCAGGT GTCCCTGCGC CGCGACAACA ACGAGCAGTT CGGCCAGCAC GAGACCGGCA ACATCGCCTA CGGCTACGCC CTGACCGACG CCCTGCGCGC CACCATCAGC TACGGCAGCG CCTTCAAGGC ACCGACCTTC AACGAGCTGT ACTATCCCTT CTACGGCATC GCCGACCTCG AAGAAGAGAC CTCGCACAGC CTGGAGGTCG GCCTGTCCGG CTCGCACGCC TGGGGACACT GGTCGCTGAA CGCCTATCGC ACCAAGGTGA ACGACCTGAT CGTCTACGAT TCTTCCATCC AGGGGCCGGC GAACCTCGAC GAGGCGCGTA TCCGCGGCCT CGAGCTGGAG GTCGGCAGCC GTACCTTCGG CTGGGACTGG AGCGCCAACT ACAGCCTGCT GGAGCCGGAA AACAGCGGCT CGGGCACCAA CGACGGCAAC GAGCTGCCGC GCCGCGCCGC GCAGATGTTC AACCTGGAGC TGGACAGGCG CTTCGGCGAT TTCGCCGTCG GCGCCACGTT GCACGCCGAG GGCCGGCGCT ACGACGACGT GGCCAACGAC GACGAACTGT CCGGCTATGC CACCGTGGAT CTGCGCGGCG AATACCGCAT CAGCCCGGAG TGGCGCCTGC AGGCGCGTGT CGCCAACCTG CTGGACGCCG ATTACCAAAC CGCCTGGACT ACAACCAGCC GGGGCAGGCG GTGTACCTTA CCGTCCGCTA TCAGGCGCTG TGAGGCGCCG CCGGCCGGCG GGCCGGAACC CTACAAGGAG AAAGCGCATG CTCAATCTGT CCTCCCGCAG CCAACTGATC GTCGGTGCGC TGCTGGCCCT GCTGATGGCC GTGACCCGTG GTCATCACTT CGCCACTCTC GATCTGCCGA GCGCCTCCTG GGCGGTGTTC TTCCTCGCCG GCGTCCTGCT GCGCCCGCGC TGGGCGTTCC CGGCGCTGTT CCTGGAAGCC TCGCTGCTCG ACTTCGTCGC CATCGGCTGG ATGGGCGCGA GCGACTGGTG CCTGTCGCCG GCCTACTGGC TGCTGGTGCC GGCCTACGGC TCGCTGTGGC TGGGCGGACG CCTCTATGCC CGCCTGCAAC GCGACAGCCT GGTCGCGCTC GGCCTGGCGG TGGTCTTCGG CGCTTTCGTC TGCTACCTGT TCTCCGGCGG CGGTTTCTAC TTCTTCTCCG GCCGCTACGC GGAGCCGACC TTCGCCGGCT TCGTGCAGCG CCTGATCGCC TACTACCCGC GCAACCTGGC CGGCCTGGCC CTCTACGTGG GCCTCGCCGC GCTGCTCTAC GCGGGCTTCG CCACCCGCCT GAAGACGCTG CGGGTGCAGG ACGCGCGCGG ATGAGCGAGT CGCCGGAGCG GGACGCCCGC CACAAGGCAC GCATGCAGCG CAAGAAAGCG GTGGTCGACG CAAGGATCGC CCGCGCCGGC GACGAGCACG GCCTCTTGCT GGTGCACACC GGCAACGGCA AGGGCAAGAG CAGCGCCGCC TTCGGCATGG CCGCCCGCGC TCTGGGGCAC GGCATGCGGG TCGGCGTGGT GCAGTTCGTC AAGGGCGCCG CCAGCACTGG CGAGGAAGCC TTCTTCCGGC GTTTTCCCGA GCAGGTGTGC TACCACGTGA TGGGCGAGGG CTTCACCTGG GAAATCCAGG ACCGCCAGCG CGACATCGAC AGGGCCCGCG AGGCCTGGAA GGTGGCCCGC GAACTGCTCG GCGATCCGTC GATCGGCCTG GTGCTGCTCG ATGAACTGAA CATCGCGCTG AAATACGGCT ATCTGGAGCT GGAGCCGGTG CTCGCCGACA TCCGCGCCCG GCCGTGGCAC CAGCACGTGG TGGCGACCGG CCGTGGCGCG CCGCCCGGAC TGATCGAGGC GGCCGACACC GTGACCGAGA TGAGCCCGGT CAAGCACGCC TTCCAGGCCG GGGTGAAAGC GCAGAAGGGG ATCGAGTTCT GA
|
Protein sequence | MIRKPCFGPA ASALLGLFPF SPLLAADEPL ALDDLVVTAA RSAQSLRDSL AAVSLIERDD IERSQAQSVP ELLKKVPGVS IANNGGPGKS TSVYLRGTES DHVLVLIDGV RVGSVTSGTA AWQDLPVEMI ERIEVVRGPR SSLYGSEAIG GVIQIFTRKG GDGAPRPFFS AGYGTHDSYT GSLGVSGGNA HGWYSLALSS ADSDGINVKR PGASGYESDA DGYRNHSASL RAGWRFDNGL ELEGSFLRAK SHNDYDQVNS RRTSGFSANA DGEQNVVSGR ARFSPLAFWQ VTLQAGRNED KSDTYQDGHF YSRFDSRRDS ASWQNDLTLA EGHILTLGVD YQREEVNGST DYDEDSRENN GAFIQYLGEY GRHDWQVSLR RDNNEQFGQH ETGNIAYGYA LTDALRATIS YGSAFKAPTF NELYYPFYGI ADLEEETSHS LEVGLSGSHA WGHWSLNAYR TKVNDLIVYD SSIQGPANLD EARIRGLELE VGSRTFGWDW SANYSLLEPE NSGSGTNDGN ELPRRAAQMF NLELDRRFGD FAVGATLHAE GRRYDDVAND DELSGYATVD LRGEYRISPE WRLQARVANL LDADYQTAWT TTSRGRRCTL PSAIRRCEAP PAGGPEPYKE KAHAQSVLPQ PTDRRCAAGP ADGRDPWSSL RHSRSAERLL GGVLPRRRPA APALGVPGAV PGSLAARLRR HRLDGRERLV PVAGLLAAGA GLRLAVAGRT PLCPPATRQP GRARPGGGLR RFRLLPVLRR RFLLLLRPLR GADLRRLRAA PDRLLPAQPG RPGPLRGPRR AALRGLRHPP EDAAGAGRAR MSESPERDAR HKARMQRKKA VVDARIARAG DEHGLLLVHT GNGKGKSSAA FGMAARALGH GMRVGVVQFV KGAASTGEEA FFRRFPEQVC YHVMGEGFTW EIQDRQRDID RAREAWKVAR ELLGDPSIGL VLLDELNIAL KYGYLELEPV LADIRARPWH QHVVATGRGA PPGLIEAADT VTEMSPVKHA FQAGVKAQKG IEF
|
| |