Gene Avin_11040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_11040 
Symbol 
ID7760048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1055269 
End bp1056276 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content68% 
IMG OID643804008 
ProductCytochrome bd ubiquinol oxidase, subunit II 
Protein accessionYP_002798310 
Protein GI226943237 
COG category[C] Energy production and conversion 
COG ID[COG1294] Cytochrome bd-type quinol oxidase, subunit 2 
TIGRFAM ID[TIGR00203] cytochrome d oxidase, subunit II (cydB) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCATCG ATCTTCCCCT GATCTGGGCG GTGATCATCA GCTTCGGCGT GATGATGTAC 
GTGGTGATGG ACGGCTTCGA CCTGGGCATC GGCATGCTCT ATCCGTTCTT CAAGGACGCC
GGCGACCGCG ACGTGATGAT GAACACCGTG GCGCCGGTCT GGGACGGCAA CGAAACCTGG
CTGGTGCTCG GCGGCGCCGG GCTGTTCGCG GCCTTCCCGC TGGCCTATGC GGTGGTCCTC
TCGGCGCTCT ACCTGCCCCT GGTGTTCATG CTGATCGGCC TGGTGTTCCG CGGCGTGGCC
TTCGAGTTCC GCTTCAAGGC CCGCGAGGAG CGGCGGCATA TCTGGGACAA GGCGTTCATC
GGCGGTTCGC TGGCCGCCAC CTTCTTCCAG GGGGTGGTCC TCGGCGCCTT CATCGAGGGC
ATCCCGGTCG TCGACGGCCG CTTCGCCGGC GCTGCGCTCG ACTGGCTGGC GCCCTTCCCG
CTGTTCTGCG GCCTGGGCCT GGTCGTCGCC TACAGCCTGC TCGGCTGCAC CTGGCTGATC
ATGAAGACCG AGGGTCCCCT GCAGCGACGC ATGCACACTC TGGCCAGGCC GCTGGCCCTG
GCGTTGCTGG CGGTGATCGG CATCCTCAGC CTGTGGACGC CGCTGGCCCA CGAGGCCATC
GCCCGGCGCT GGTTCAGCCT GCCCAACCTG ATCTGGTTCC TGCCGGTGCC GCTGCTGGTG
CTGCTGAGCA TCCGGGCCCT GCTGCGTTCG GTGGCGAACG ACGATCAGGT ACGGCCCTTC
CTGCTCACCC TGGCGCTGGT CTTCCTCGGC TACAGCGGCC TGATCATCAG CCTGTGGCCG
AACATCATCC CGCCCTCGAT CAGCCTCCGC GACGCCGCCG CGCCGCCGCA GAGCCAGGGC
TTCGCGCTGG TCGGCGCGCT GCTGATCATC CCCGTCATCC TCGGCTACAC CGCCTGGAGC
TACTACGTGT TCCGCGGCAA GGTGCGGCAC GGCGAAGGCT ATCACTGA
 
Protein sequence
MGIDLPLIWA VIISFGVMMY VVMDGFDLGI GMLYPFFKDA GDRDVMMNTV APVWDGNETW 
LVLGGAGLFA AFPLAYAVVL SALYLPLVFM LIGLVFRGVA FEFRFKAREE RRHIWDKAFI
GGSLAATFFQ GVVLGAFIEG IPVVDGRFAG AALDWLAPFP LFCGLGLVVA YSLLGCTWLI
MKTEGPLQRR MHTLARPLAL ALLAVIGILS LWTPLAHEAI ARRWFSLPNL IWFLPVPLLV
LLSIRALLRS VANDDQVRPF LLTLALVFLG YSGLIISLWP NIIPPSISLR DAAAPPQSQG
FALVGALLII PVILGYTAWS YYVFRGKVRH GEGYH