Gene Avin_26070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_26070 
Symbol 
ID7761516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2663463 
End bp2665703 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content71% 
IMG OID643805486 
Productoxidoreductase 
Protein accessionYP_002799759 
Protein GI226944686 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAGC TGCACTTCCC GGACGAGACG GCCGCCACCG GCGGCATCCG CAACTTCAGC 
CGCCGCGCCT TCCTGCGCGG CACCGGCGGC CTGGCGCTGG GCATCTACTT CGCGCCGCTG
CTCGGGCGCG GGAGCCTGGC CGGAGCGGCC GGCGATTTCG AGCCGAACGC CTTCCTGCGC
ATCGGCTCCG ACGGCCTGGT CACGGTGATC GCCAAGCACG TCGAGATGGG CCAGGGCAGC
TACACCGGCC TCGCCACCCT GGTCGCCGAG GAACTGGATG CGGACTGGGG CCGCGTGCGC
GTCGAGGGCG CCCCGGCGAA TACCGAGCTG TACAAGAACC TTGCCTTCGG CCTGCAAGGG
ACCGGTGGCA GCAGCGCCAT CGCCAACTCC TTCGAGCAGA TGCGCAAGGC CGGCGCCAGC
GCCCGTGCCA TGCTGGTGGC CGCCGCCGCC GAGCAATGGC GGGTGCCGGC GGAGCAGATC
GGGGTGCGCC AGGGCGTGGT CGAGCACGCC GCCTCGGGCC GCAAGGCCGG CTTCGGCGAG
CTGGCCGAGG CCGCGGCCAG GCAAGCGCTG CCCGCCGATG TGAAGCTCAA GGAGCCGAAG
GACTTCGGGC TCATCGGCCG GCAGACGCTG CCGCGCACCG ACAGCCCGGA CAAGACCGAC
GGCAGCGCGG TCTTCACCCA GGACATCCAC CTGCCGGACA TGCTGGTGGC CGTCCCCCTC
CATCCGCCAC GCTTCGGCGC CAGTCCGGCG AAAGTCGATG CGAGCAAGGC CAGGGCGCTG
CCGGATGTGG TCGCGGTGGT CGAGTATCCC GGCGACGAGC ACCGTTTCGC CGGCGTCGCC
GTGCTGGCCC GCAATACCTG GGCGGCGATC CGGGGACGCG ACGCGCTGCA GGTGGAATGG
GACGAGAGCA AGGCCTTCCG CCTCGGCAGC GCGGAGATCC TCGCCCGGTA CCGCGATCTG
GCCGGCCAGC CCGGCAAGGT CGCGCGCGAC GAAGGCGACG CCGGCGGCGC GCTGGAGCGG
GCGGCCAGGC GCATCGACGC CGAGTTCGAA TTCCCCTATC TGGCCCACGC CGCCATGGAG
CCGCTGAACT GCGTGGTCAG GCTGGACGAG GGGCGCTGCG AAATCTGGAA CGGCGAGCAG
TGGCAGTCCG GCGACCAGCG ACTGGTGGCG CAGTTGCTGG GCATTGCCCC GGAGCATGTG
TCGATCACCC AGCTCTACGC CGGCGGTAGC TTCGGCCGGC GCGCCAACCC GCACTCGGAC
TACGTGCTGG AGGCGGTGTC CATCGCCAGG GCGGCCCGTG ACCAGGGGCT GAAGGGGCCG
GTGAAGATGG TCTGGCCCCG CGAGAACGAC ACCCGCGGCG GCCACTACCG GCCGCTGTTC
CTGCACCGGG CCAGCCTTGG CCTGGACGAA GAAGGCAGGC TGACGGCCTG GCAGCATCGC
CTGGTGGGAC AGTCGTTCAT GCAGGGCACG CCGTTCGAAT CGGTGATGCT CAAGGACGGC
ATCGACAAGG TGGCGGTGGA GGGCGTCGAC AATCTCGCCT ACGCGGTCCC CAACCTGCGC
GTCGAACTGC ACCTGGCGCA GGACATCGGC GTGCCGACCC AGTGGTGGCG CTCGGTGGGC
CACACCCATA CCGCCTATGC GGCGGAAACC CTGATCGACG AAGCCGCCCA GGCCGCCGGC
AAGGACCCCT ACCAGTACCG CCGGGCGCTG CTGGACAGGC AGCCGCGCCA CCTCGGCGTG
CTCGACCTGG CGGCGGAGAA GGCCGGCTGG ACGAGCGCCT TGCCGCCGGG AGCGCAGGGC
GAGCGGCGCG GGCGCGGCAT CGCGGTGCAC GAGTCGTTCG GCAGCTTCGT GGCGCAGGTG
GTGGAAGTGA CCCTGAAGCC CGACCACAGC TACACGGTGG ACAGGGTCGT CTGCGCGGTG
GACTGCGGCG TGGCGATCAA CCCGGACGTG ATCCGCGCGC AGATGGAAGG CGGCATCGGC
TTCGCCCTGT CGGCGGCCAT GCACAGCGCC ATCACCCTGA AGGATGGCGT GGTCGAGCAG
TCGAACTTCC ACGACTTCCA GGTGCTCCGC CTCAACGAGA TGCCGCGGGT CGAGGTGCAC
ATCGTGCCCT CGGCGGAAGC ACCCAGCGGC GTCGGCGAGC CGGGCGTGCC GCCGCTGGCG
CCGGCGCTGG CCAATGCGCT GTTCGCCGCG AGCGGCAAGC GCATCCGCCG GCTACCGATC
GGCAAGCAGT TGCAGGCATA G
 
Protein sequence
MAELHFPDET AATGGIRNFS RRAFLRGTGG LALGIYFAPL LGRGSLAGAA GDFEPNAFLR 
IGSDGLVTVI AKHVEMGQGS YTGLATLVAE ELDADWGRVR VEGAPANTEL YKNLAFGLQG
TGGSSAIANS FEQMRKAGAS ARAMLVAAAA EQWRVPAEQI GVRQGVVEHA ASGRKAGFGE
LAEAAARQAL PADVKLKEPK DFGLIGRQTL PRTDSPDKTD GSAVFTQDIH LPDMLVAVPL
HPPRFGASPA KVDASKARAL PDVVAVVEYP GDEHRFAGVA VLARNTWAAI RGRDALQVEW
DESKAFRLGS AEILARYRDL AGQPGKVARD EGDAGGALER AARRIDAEFE FPYLAHAAME
PLNCVVRLDE GRCEIWNGEQ WQSGDQRLVA QLLGIAPEHV SITQLYAGGS FGRRANPHSD
YVLEAVSIAR AARDQGLKGP VKMVWPREND TRGGHYRPLF LHRASLGLDE EGRLTAWQHR
LVGQSFMQGT PFESVMLKDG IDKVAVEGVD NLAYAVPNLR VELHLAQDIG VPTQWWRSVG
HTHTAYAAET LIDEAAQAAG KDPYQYRRAL LDRQPRHLGV LDLAAEKAGW TSALPPGAQG
ERRGRGIAVH ESFGSFVAQV VEVTLKPDHS YTVDRVVCAV DCGVAINPDV IRAQMEGGIG
FALSAAMHSA ITLKDGVVEQ SNFHDFQVLR LNEMPRVEVH IVPSAEAPSG VGEPGVPPLA
PALANALFAA SGKRIRRLPI GKQLQA