Gene Avin_33130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_33130 
SymbolbtuB 
ID7762209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3386626 
End bp3389697 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content68% 
IMG OID643806179 
ProductTonB-dependent vitamin B12 receptor 
Protein accessionYP_002800443 
Protein GI226945370 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID[TIGR00708] cob(I)alamin adenosyltransferase
[TIGR01779] TonB-dependent vitamin B12 receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTCGCA AGCCCTGTTT CGGGCCGGCG GCGTCCGCGC TGCTGGGCCT TTTTCCTTTT 
TCCCCGCTCC TGGCGGCCGA CGAACCCCTC GCCCTCGACG ATCTGGTGGT GACCGCCGCA
CGCAGTGCCC AGAGCCTGCG CGACAGTCTC GCCGCAGTGA GCCTGATCGA GCGCGACGAC
ATCGAGCGCA GCCAGGCGCA GTCGGTGCCG GAGCTGCTCA AGAAGGTGCC GGGGGTGTCC
ATCGCCAACA ATGGCGGGCC GGGCAAGTCC ACTTCCGTCT ACCTGCGCGG CACCGAATCC
GACCACGTGC TGGTGCTGAT CGATGGCGTA CGGGTCGGTT CGGTGACCAG CGGCACCGCC
GCCTGGCAGG ACCTGCCGGT GGAGATGATC GAGCGCATCG AGGTGGTCCG CGGGCCGCGC
TCCAGCCTGT ACGGCTCGGA GGCCATCGGC GGGGTGATCC AGATCTTTAC CCGCAAGGGT
GGCGACGGCG CGCCCAGGCC GTTCTTCTCC GCCGGCTACG GCACCCACGA CAGCTACACC
GGCAGCCTCG GCGTCTCCGG CGGCAACGCC CACGGCTGGT ACAGCCTGGC GCTGAGCAGC
GCGGACAGCG ACGGCATCAA CGTCAAGCGT CCCGGCGCCA GCGGCTACGA GAGCGACGCC
GACGGCTACC GCAACCACTC CGCCTCGCTG CGCGCCGGCT GGCGCTTCGA CAACGGTCTG
GAGCTGGAGG GCAGCTTCCT GCGCGCCAAG TCGCACAACG ATTACGATCA GGTGAACAGC
CGGCGCACGT CCGGCTTCTC CGCCAATGCC GACGGCGAGC AGAACGTGGT CAGCGGTCGC
GCCCGTTTCA GCCCGCTGGC GTTCTGGCAG GTCACCCTGC AGGCGGGGCG CAACGAGGAC
AAGTCCGACA CCTACCAGGA CGGCCACTTC TATTCGCGTT TCGACAGCCG CCGCGACAGC
GCCAGTTGGC AGAACGACCT GACCCTGGCC GAAGGCCACA TCCTCACCCT GGGCGTCGAC
TACCAGCGCG AGGAAGTCAA CGGCAGCACC GACTACGACG AGGATTCGCG GGAAAACAAC
GGCGCCTTCA TCCAGTATCT CGGCGAATAC GGTCGCCACG ACTGGCAGGT GTCCCTGCGC
CGCGACAACA ACGAGCAGTT CGGCCAGCAC GAGACCGGCA ACATCGCCTA CGGCTACGCC
CTGACCGACG CCCTGCGCGC CACCATCAGC TACGGCAGCG CCTTCAAGGC ACCGACCTTC
AACGAGCTGT ACTATCCCTT CTACGGCATC GCCGACCTCG AAGAAGAGAC CTCGCACAGC
CTGGAGGTCG GCCTGTCCGG CTCGCACGCC TGGGGACACT GGTCGCTGAA CGCCTATCGC
ACCAAGGTGA ACGACCTGAT CGTCTACGAT TCTTCCATCC AGGGGCCGGC GAACCTCGAC
GAGGCGCGTA TCCGCGGCCT CGAGCTGGAG GTCGGCAGCC GTACCTTCGG CTGGGACTGG
AGCGCCAACT ACAGCCTGCT GGAGCCGGAA AACAGCGGCT CGGGCACCAA CGACGGCAAC
GAGCTGCCGC GCCGCGCCGC GCAGATGTTC AACCTGGAGC TGGACAGGCG CTTCGGCGAT
TTCGCCGTCG GCGCCACGTT GCACGCCGAG GGCCGGCGCT ACGACGACGT GGCCAACGAC
GACGAACTGT CCGGCTATGC CACCGTGGAT CTGCGCGGCG AATACCGCAT CAGCCCGGAG
TGGCGCCTGC AGGCGCGTGT CGCCAACCTG CTGGACGCCG ATTACCAAAC CGCCTGGACT
ACAACCAGCC GGGGCAGGCG GTGTACCTTA CCGTCCGCTA TCAGGCGCTG TGAGGCGCCG
CCGGCCGGCG GGCCGGAACC CTACAAGGAG AAAGCGCATG CTCAATCTGT CCTCCCGCAG
CCAACTGATC GTCGGTGCGC TGCTGGCCCT GCTGATGGCC GTGACCCGTG GTCATCACTT
CGCCACTCTC GATCTGCCGA GCGCCTCCTG GGCGGTGTTC TTCCTCGCCG GCGTCCTGCT
GCGCCCGCGC TGGGCGTTCC CGGCGCTGTT CCTGGAAGCC TCGCTGCTCG ACTTCGTCGC
CATCGGCTGG ATGGGCGCGA GCGACTGGTG CCTGTCGCCG GCCTACTGGC TGCTGGTGCC
GGCCTACGGC TCGCTGTGGC TGGGCGGACG CCTCTATGCC CGCCTGCAAC GCGACAGCCT
GGTCGCGCTC GGCCTGGCGG TGGTCTTCGG CGCTTTCGTC TGCTACCTGT TCTCCGGCGG
CGGTTTCTAC TTCTTCTCCG GCCGCTACGC GGAGCCGACC TTCGCCGGCT TCGTGCAGCG
CCTGATCGCC TACTACCCGC GCAACCTGGC CGGCCTGGCC CTCTACGTGG GCCTCGCCGC
GCTGCTCTAC GCGGGCTTCG CCACCCGCCT GAAGACGCTG CGGGTGCAGG ACGCGCGCGG
ATGAGCGAGT CGCCGGAGCG GGACGCCCGC CACAAGGCAC GCATGCAGCG CAAGAAAGCG
GTGGTCGACG CAAGGATCGC CCGCGCCGGC GACGAGCACG GCCTCTTGCT GGTGCACACC
GGCAACGGCA AGGGCAAGAG CAGCGCCGCC TTCGGCATGG CCGCCCGCGC TCTGGGGCAC
GGCATGCGGG TCGGCGTGGT GCAGTTCGTC AAGGGCGCCG CCAGCACTGG CGAGGAAGCC
TTCTTCCGGC GTTTTCCCGA GCAGGTGTGC TACCACGTGA TGGGCGAGGG CTTCACCTGG
GAAATCCAGG ACCGCCAGCG CGACATCGAC AGGGCCCGCG AGGCCTGGAA GGTGGCCCGC
GAACTGCTCG GCGATCCGTC GATCGGCCTG GTGCTGCTCG ATGAACTGAA CATCGCGCTG
AAATACGGCT ATCTGGAGCT GGAGCCGGTG CTCGCCGACA TCCGCGCCCG GCCGTGGCAC
CAGCACGTGG TGGCGACCGG CCGTGGCGCG CCGCCCGGAC TGATCGAGGC GGCCGACACC
GTGACCGAGA TGAGCCCGGT CAAGCACGCC TTCCAGGCCG GGGTGAAAGC GCAGAAGGGG
ATCGAGTTCT GA
 
Protein sequence
MIRKPCFGPA ASALLGLFPF SPLLAADEPL ALDDLVVTAA RSAQSLRDSL AAVSLIERDD 
IERSQAQSVP ELLKKVPGVS IANNGGPGKS TSVYLRGTES DHVLVLIDGV RVGSVTSGTA
AWQDLPVEMI ERIEVVRGPR SSLYGSEAIG GVIQIFTRKG GDGAPRPFFS AGYGTHDSYT
GSLGVSGGNA HGWYSLALSS ADSDGINVKR PGASGYESDA DGYRNHSASL RAGWRFDNGL
ELEGSFLRAK SHNDYDQVNS RRTSGFSANA DGEQNVVSGR ARFSPLAFWQ VTLQAGRNED
KSDTYQDGHF YSRFDSRRDS ASWQNDLTLA EGHILTLGVD YQREEVNGST DYDEDSRENN
GAFIQYLGEY GRHDWQVSLR RDNNEQFGQH ETGNIAYGYA LTDALRATIS YGSAFKAPTF
NELYYPFYGI ADLEEETSHS LEVGLSGSHA WGHWSLNAYR TKVNDLIVYD SSIQGPANLD
EARIRGLELE VGSRTFGWDW SANYSLLEPE NSGSGTNDGN ELPRRAAQMF NLELDRRFGD
FAVGATLHAE GRRYDDVAND DELSGYATVD LRGEYRISPE WRLQARVANL LDADYQTAWT
TTSRGRRCTL PSAIRRCEAP PAGGPEPYKE KAHAQSVLPQ PTDRRCAAGP ADGRDPWSSL
RHSRSAERLL GGVLPRRRPA APALGVPGAV PGSLAARLRR HRLDGRERLV PVAGLLAAGA
GLRLAVAGRT PLCPPATRQP GRARPGGGLR RFRLLPVLRR RFLLLLRPLR GADLRRLRAA
PDRLLPAQPG RPGPLRGPRR AALRGLRHPP EDAAGAGRAR MSESPERDAR HKARMQRKKA
VVDARIARAG DEHGLLLVHT GNGKGKSSAA FGMAARALGH GMRVGVVQFV KGAASTGEEA
FFRRFPEQVC YHVMGEGFTW EIQDRQRDID RAREAWKVAR ELLGDPSIGL VLLDELNIAL
KYGYLELEPV LADIRARPWH QHVVATGRGA PPGLIEAADT VTEMSPVKHA FQAGVKAQKG
IEF