Gene Avin_01020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_01020 
Symbol 
ID7759069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp103098 
End bp104255 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content68% 
IMG OID643803028 
Productcytochrome c oxidase, subunit II 
Protein accessionYP_002797344 
Protein GI226942271 
COG category[C] Energy production and conversion 
COG ID[COG1622] Heme/copper-type cytochrome/quinol oxidases, subunit 2 
TIGRFAM ID[TIGR02866] cytochrome c oxidase, subunit II 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.24737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCGAC ATCCACGACT CTGGATGGGT CTTCTGTCGT GCTCGGCCCT GTCCCAGGCG 
CAGGCCGCCT GGGATGTGAA CATGCGGCCG GGGGTCACGG AAGTCAGCCG TTCCGTCTTC
GATCTGCACA TGACCATCTT CTGGATCTGC GTGGCCATCG GCGTGCTGGT GTTCGGCGCG
ATGTTCTGGT CGATGTTCGC CCACCGCCGT TCGCGTCGCC CGCAGCCCGC CCACTTCCAC
GAGAACACCC GGGTCGAGGT GCTGTGGACG GTGATCCCGC TGTTGATCCT GATCGCCATG
GCGGTGCCGG CGACCCGCAC CCTGCTGCAC ATCTACGACC CGTCCGAGCC CGACCTGGAC
ATCCAGGTCA CCGGCTACCA GTGGAAGTGG CACTACAAGT ACCTGGGCGA GGACGTGGAG
TTCTTCAGCA ACCTGGCCAC CGACCGCAAC GCCATCGGCA ACCAGGCGCC GAAGAACGAC
CACTACCTGC TGGAGGTGGA CGAACCGCTG GTGATCCCGG CCGGCGCCAA GGTGCGCTTC
CTGGTCACCG CGGCGGACGT CATCCACTCC TGGTGGGTAC CGGAACTGGC GGTGAAGAAG
GACGCCATCC CCGGCTTCAT CAACGAGACC TGGACCCGCG TCGCCGAGCC GGGCCTCTAC
CGCGGCCAGT GCACCGAACT GTGCGGCAAG GATCACGGCT TCATGCCGGT GGTGGTGGAG
GTCAAGGCTC CGGCCGACTA CGCCGCCTGG CTGGCCGGCA AAAAGGCCGC CGCCGCCGAG
GCCACGGCGC AGGCCGGCAA GGCCTGGACC CTGGAGGAAC TGGTCGCCCA GGGCGAGCGG
GTCTACCGGA CCGCCTGCGT CGCCTGCCAC CAGCCGACCG GCGAGGGCCT GCCGCCGGCA
TTCCCGGCGC TCAAGGGTTC GAAGATCGCC ACCGGACCGA AGGAAGGCCA CATGAACATC
GTCATCGACG GCAAGCCGGG CACTGCCATG GCCGCCTTCG GCAAGCAGCT CTCGGACGTC
GACCTGGCGG CGGTGATCAC CTACGAGCGC AACGCCTTCG GCAACGCGCT CGGCGACAGC
GTCACCCCGC AGGACATCCA CGCCTTCCGG CAGGCCCGGG AAACCGGCCA GGGCATGCAG
CCCGCCCAAC CCCAATAG
 
Protein sequence
MMRHPRLWMG LLSCSALSQA QAAWDVNMRP GVTEVSRSVF DLHMTIFWIC VAIGVLVFGA 
MFWSMFAHRR SRRPQPAHFH ENTRVEVLWT VIPLLILIAM AVPATRTLLH IYDPSEPDLD
IQVTGYQWKW HYKYLGEDVE FFSNLATDRN AIGNQAPKND HYLLEVDEPL VIPAGAKVRF
LVTAADVIHS WWVPELAVKK DAIPGFINET WTRVAEPGLY RGQCTELCGK DHGFMPVVVE
VKAPADYAAW LAGKKAAAAE ATAQAGKAWT LEELVAQGER VYRTACVACH QPTGEGLPPA
FPALKGSKIA TGPKEGHMNI VIDGKPGTAM AAFGKQLSDV DLAAVITYER NAFGNALGDS
VTPQDIHAFR QARETGQGMQ PAQPQ