Gene Avin_38840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_38840 
SymboldnaE 
ID7762773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3927954 
End bp3931484 
Gene Length3531 bp 
Protein Length1176 aa 
Translation table11 
GC content66% 
IMG OID643806747 
ProductDNA polymerase III subunit alpha 
Protein accessionYP_002800999 
Protein GI226945926 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGTTT CCTTCGTCCA CCTGCGCGTG CACACCGAGT TTTCCCTGGT CGACGGCCTG 
GTACGGGTCA AGCCGCTGAT CAAGGCCGTG GCCGGGGCTG GCATGCCCGC GGTGGCGGTC
ACCGACCAGA GCAACATGTG CTCACTGGTG AAGTTCTACA AGGCGGCCAT GGCGGGTGGG
GTCAAGCCGA TCTGCGGCGC CGATATCTGG CTGGCCAGCG CCGAGCCCGA TGCTCCGCTG
TCGCGCCTGA CCCTGCTGGC GATGGATGCC AAGGGTTACC GCAACCTCAC CGAGCTGGTT
TCCCGGGGCT GGACCGAGGG TCAGCACGGC GATGGCCTGG TGATCATCCA GCGCGACTGG
GTGAAGGAGG CCGCCGAAGG GGTGATCGCC CTGTCCGGGG CGAAGGAAGG GGAGATCGGC
CAGGCGCTGC TCAACGGCCA TGTCGACCAG GCCGCAGCGC TGCTCGAGGA GTGGCTGGCG
GTGTTTCCCG AGCGCTTCTA TCTGGAAGTG CAGCGTACCG GCCGGGTCGG CGACGAGGAG
TACCTGCATG CCGCCGTCGA ACTGGCCGAT CGCTACGGCG CTCCGCTGGT GGCGACCAAC
GACGTGCGCT TTCTCAAGCA GGAGGACTTC GAGGCCCACG AGACCCGCGT CTGCATCGGC
GAGGGACGCA CCCTCGGCGA CCCGCGGCGG CCGCGTAACT ACTCCGACCA GCAGTACCTG
AAGACCCCGG CGGAGATGTG GGAGCTGTTT TCCGACCTGC CCGAAGCGCT GGAGAACAGC
GTGGAGATCG CCCGGCGCTG CAACATCGAA GTGCGCCTCG GCAAGTACTT CCTGCCCAAC
TTCCCGGTGC CGGCCGGCAT GACCATCGAC GACTACCTGC GCCAGGTGGC CTACGAGGGG
CTGGAGGAGC GTCTGGCGGT GCTCTGGCCG CAGGAGACCA CGCCCGACTA CGCGGATAAG
CGCGAGGTCT ACGTCGAGCG TCTGGAGTTC GAGCTGAACA CCATCATCCA GATGGGCTTT
CCTGGCTACT TCCTGATCGT GATGGACTTC ATCAAGTGGG CGAAGAACAA CGGCGTGCCG
GTCGGTCCCG GCCGCGGCTC CGGCGCCGGC TCGCTGGTCG CCTACGCGCT GAAGATCACC
GACCTCGATC CCCTGGCCTA CGACCTGCTG TTCGAGCGCT TCCTCAACCC CGAGCGGATC
TCCATGCCCG ACTTCGACGT CGACTTCTGC ATGGACGGCC GCGACCGGGT CATCGACTAC
GTGGCCGACA CCTACGGGCG CAACGCCGTG AGCCAGATCA TCACCTTCGG CTCGATGGCG
GCCAAGGCGG TGGTGCGCGA TGTGGCGCGG GTGCAGGGCA AGTCCTACGG GCTGGCCGAC
CGCCTGTCGA AGATGATCCC CTTCGAGGTC GGCATGACCC TGGAGAAGGC CTACGAGCAG
GAGGAGATGC TCCGCGAGTT CCTCAAGAAC GACGAGGAGG CCCAGGAGAT CTGGGACATG
GCCCTGAAGC TCGAGGGCAT CACCCGCGGC ACCGGCAAGC ACGCCGGCGG CGTGGTGATC
GCGCCGACCA AGCTCACCGA TTTCGCGCCC ATCGCCTGCG ATGCGGACGG CGGCGGCCTG
GTCACCCAGT TCGACAAGGA CGACGTGGAG GCGGCCGGCC TGGTCAAGTT CGACTTCCTC
GGGCTGCGCA CCCTGACCAT CATCAAGTGG GCGATGGAAA CCATCCACCG CGAGCAGCGG
CGCCGGGGCG AAACCGAACT GGTGGACATC GACCGCATCG CGCTGGACGA CAAGGCCACC
TACGCGCTCC TGCAGAGGGC CGAGACCACC GCGGTGTTCC AGCTCGAATC GCGCGGCATG
AAGGAACTGA TCAAGAAGCT CAAGCCCGAC AACATCGAGG ACATGATCGC GCTGGTCGCG
CTGTTCCGCC CGGGCCCGCT GCAGTCGGGC ATGGTGGACG ACTTCATCAA CCGCAAGCAT
GGCCGCGCGG AGCTTTCCTA CCCGCACCCC GATTACCAGT ACGCGGGCCT GGAGCCGGTC
CTGAAACCCA CCTACGGCAT CATCCTGTAT CAGGAGCAGG TGATGCAGAT CGCCCAGGTG
ATGGCCGGCT ACACCCTCGG CGGCGCGGAC ATGCTGCGTC GCGCCATGGG CAAGAAGAAG
CCCGAGGAAA TGGCCAAGCA GCGCGGCGGC TTCGTCGAAG GCTGCGCGAA GAATGGCATC
GATGCCGAGC TGGCGGGCAA CATCTTCGAT CTGGTGGAAA AGTTCGCCGG TTACGGGTTC
AACAAGTCGC ACTCGGCCGC CTATGGCCTG GTTTCCTACC AGACGGCCTG GCTGAAGACC
CGTTATCCGG CGCCCTTCAT GGCCGCGGTG CTTTCCGCGG ACATGCACAA CACCGACAAG
GTGGTGACCC TGATCGAGGA ATGCCGCAGC ATGAAGCTGC GCATCGTGGC GCCGGACGTG
AACAACTCCG AGTTCATGTT CACCGTCGAC GACGAGGGGC GCATCGTCTA CGGCCTGGGG
GCGATCAAGG GCGTCGGCGA GGGGCCGGTC GAGGCCATCG TCGAATGCCG CGCCGCCGGC
GGTCCCTTCA CGGATCTGTT CGACTTCTGC GCGCGGGTCG ATCTCAAGCG GATCAACAAG
CGTACCCTCG AGGCGCTGAT CCGTGGCGGC GCGCTGGACC GCCTGGGACC GTACTTCGCC
GACGAGCCCA AGGCCTACCA GGCCAACATC GACCGCAACC GGGCGGTCCT GCTGGCCGCC
GTGGAGGAAG CCGTGCAGGC CGCCGAGCAG ACCGCGCGCA GCGCCGACAG CGGCCATCTG
GACCTGTTCG GCGGGCTGTT CGCCGAGCCC GAGGCGGACG TCTATGCCAA TCACCGCAAC
GCCCGCGAGC TGTCGCTCAA GGATCGCCTC AAGGGCGAAA AGGACACCTT GGGCCTGTAC
CTGACCGGCC ATCCGATCGA CGAGTACGAA GGCGAGGTCC GCCGCTTCGC CCGCCAGCGC
ATCGTCGAGC TGCGCCCGGC GCGGGAACTC CAGACGATCG CCGGACTGAT CGTCAACCTG
CGGGTAATGA AGAACAAGAA GGGCGACAAG ATGGGTTTCA TCACCCTCGA CGATCGCTCC
GCGCGGATTG AGGCCTCCCT GTTCGCCGAT GCCTTCGCCG GCGCCCAGGC GCTCCTGCAG
ACCGATGCCC TGGTAGTTGT GGAGGGCGAG GTCAGCAACG ACGACTTCTC CGGTGGCCTG
CGCCTGCGCG CCAAGCGGGT GATGAGCCTG GAGGAGGCAC GCACGGGACT GGCCGAGAGC
CTGCGGCTGC GGGTCGCCAG CGAGGCGCTG GAAGGGGATC GCCTGCGCTG GCTGGCGGAG
CTCTGCAGCC GCTATCGGGG CGCCTGCCCC ATCACCCTGG ACTACATCGG GCGCGAGGCG
CGTGCCCTGC TGCAGTTCGG CGAAGCGTGG CGGATCGATC CGGCGGACAG CCTGATTCAG
ACGCTGCGTG ACCAGTTCGG CAAGGACAAC GTCTTTCTGC ACTACCGCTG A
 
Protein sequence
MSVSFVHLRV HTEFSLVDGL VRVKPLIKAV AGAGMPAVAV TDQSNMCSLV KFYKAAMAGG 
VKPICGADIW LASAEPDAPL SRLTLLAMDA KGYRNLTELV SRGWTEGQHG DGLVIIQRDW
VKEAAEGVIA LSGAKEGEIG QALLNGHVDQ AAALLEEWLA VFPERFYLEV QRTGRVGDEE
YLHAAVELAD RYGAPLVATN DVRFLKQEDF EAHETRVCIG EGRTLGDPRR PRNYSDQQYL
KTPAEMWELF SDLPEALENS VEIARRCNIE VRLGKYFLPN FPVPAGMTID DYLRQVAYEG
LEERLAVLWP QETTPDYADK REVYVERLEF ELNTIIQMGF PGYFLIVMDF IKWAKNNGVP
VGPGRGSGAG SLVAYALKIT DLDPLAYDLL FERFLNPERI SMPDFDVDFC MDGRDRVIDY
VADTYGRNAV SQIITFGSMA AKAVVRDVAR VQGKSYGLAD RLSKMIPFEV GMTLEKAYEQ
EEMLREFLKN DEEAQEIWDM ALKLEGITRG TGKHAGGVVI APTKLTDFAP IACDADGGGL
VTQFDKDDVE AAGLVKFDFL GLRTLTIIKW AMETIHREQR RRGETELVDI DRIALDDKAT
YALLQRAETT AVFQLESRGM KELIKKLKPD NIEDMIALVA LFRPGPLQSG MVDDFINRKH
GRAELSYPHP DYQYAGLEPV LKPTYGIILY QEQVMQIAQV MAGYTLGGAD MLRRAMGKKK
PEEMAKQRGG FVEGCAKNGI DAELAGNIFD LVEKFAGYGF NKSHSAAYGL VSYQTAWLKT
RYPAPFMAAV LSADMHNTDK VVTLIEECRS MKLRIVAPDV NNSEFMFTVD DEGRIVYGLG
AIKGVGEGPV EAIVECRAAG GPFTDLFDFC ARVDLKRINK RTLEALIRGG ALDRLGPYFA
DEPKAYQANI DRNRAVLLAA VEEAVQAAEQ TARSADSGHL DLFGGLFAEP EADVYANHRN
ARELSLKDRL KGEKDTLGLY LTGHPIDEYE GEVRRFARQR IVELRPAREL QTIAGLIVNL
RVMKNKKGDK MGFITLDDRS ARIEASLFAD AFAGAQALLQ TDALVVVEGE VSNDDFSGGL
RLRAKRVMSL EEARTGLAES LRLRVASEAL EGDRLRWLAE LCSRYRGACP ITLDYIGREA
RALLQFGEAW RIDPADSLIQ TLRDQFGKDN VFLHYR