Gene Avin_33710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_33710 
SymbolalgE5 
ID7762266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3445248 
End bp3448244 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content67% 
IMG OID643806232 
ProductSecreted mannuronan C5-epimerase 
Protein accessionYP_002800496 
Protein GI226945423 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID[TIGR03661] type 1 secretion C-terminal target domain (VC_A0849 subclass) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTACA ACGTCAAAGA TTTCGGGGCG CTGGGCGATG GCGTCAGCGA CGATACGGCC 
GCCATCCAGG CGGCGATCGA TGCCGCCTAC GCGGCCGGCG GCGGCACCGT CTACCTGCCG
GCCGGCGAAT ACCGGGTCAG CGGTGGCGAG GAGCCTTCCG ACGGTTGCCT GACCATCAAG
AGCAACGTCT ACATCGTCGG CGCGGGGATG GGCGAGACGG TGATCAAGCT GGTCGACGGC
TGGGATCAGG ACGTCACCGG CATCGTCCGC TCGGCCTATG GCGAGGAGAC CAGCAACTTC
GGCATGAGCG ACCTGACCCT CGACGGCAAC CGCGACAACA CCAGCGGCAA GGTCGACGGC
TGGTTCAACG GCTACATTCC CGGCGAGGAC GGCGCCGACC GCGACGTGAC CCTGGAGCGG
GTGGAAATCC GTGAAATGTC CGGTTACGGT TTCGATCCGC ACGAGCAGAC CATCAACCTG
ACGATCCGCG ACAGCGTGGC CCACGACAAC GGCCTCGACG GCTTCGTCGC CGATTTCCAG
ATCGGCGGGG TGTTCGAGAA CAACGTCTCG TACAACAACG ACCGCCACGG CTTCAACATC
GTCACCAGCA CCAACGACTT CGTCCTGAGC AACAACGTCG CCTACGGCAA CGGCGGCGCC
GGCCTGGTGA TCCAGCGCGG CTCCTACGAC GTGGCTCACC CCTACGGCAT CCTGATCGAC
GGCGGCGCCT ACTACGACAA CGGCCTGGAA GGCGTGCAGA TCAAGATGGC CCACGACGTC
ACCCTGCAGA ACGCCGAGAT CTACGGCAAC GGCCTCTATG GGGTGCGCGT CTACGGCGCC
GAGGACGTGC AGATCCTCGA CAACTACATC CACGACAATT CGCAGAGCGG TTCCTACGCG
GAAATCCTCC TGCAGTCCTA CGACGATACC GCCGGGGTGT CCGGCAATTT CTACACCACC
ACCGGCACCT GGATCGAAGG CAACACCATC GTCGGCTCGG CCAACTCCAC CTACGGCATC
CAGGAGCGCG CCGACGGCAC CGACTACAGC AGCCTCTACG CCAACAGCGT CAGCAATGTG
CAGAGTGGCT CGGTGCGCCT CTACGGCACC AACTCCGTCG TCTCCGACCT GCCCGGCACC
GGCCAGCAGG CGACCCTCGA AGGCACGACC GGCAACGACA CGCTGACCGG CAGCGACGCC
CACGAGACGC TGCTCGGCCT GGACGGCGAT GACCGCCTGA ACGGCGGCGC CGGCAACGAC
ATCCTCGACG GCGGGGCGGG GCGCGACAAC CTGACCGGCG GCGCGGGCGC CGACCTGTTC
CGCGTCTCCG CGCGCACCGA CAGCTACCGC ACCGACAGCG CCAGCTTCAA CGACCTGATC
ACCGACTTCG ACGCCGACGA GGACAGCATC GACCTGTCGG CGTTGGGCTT CACCGGGCTG
GGCGACGGCT ACAACGGCAC CCTCGCCGTG GTGCTCAACA GCGCCGGGAC CCGCACCTAC
CTGAAGAGCT ATGAGGCGGA TGCCGAGGGC CGGCGTTTCG AGATCGCCCT GGACGGCAAC
TTCGCCGGCC TGCTCGACGA CGGCAACCTG ATCTTCGAGC GTCCCGTCAT CGAAGGGGAC
GCCGGCAACA ACGCCCTGCT CGGCACCTCG GCCGCCGAGA CGCTGCTCGG CCATGCCGGC
AACGATACGC TGGACGGCGC CGGCGGCGAC GACATCCTGG TCGGCGGTGC CGGGCGCGAC
ACCCTCACCG GTGGGGCCGG GGCCGACCTG TTCCGCTTCG ACGCGCTGTC CGACAGCCAG
CGCAACTACA CCACCGGCGA CAACCAGGGC GACCGCATCG TCGACTTCAG CGTGGGCGAA
GACAAGCTCG ACGTGTCGGC GCTGGGCTTC ACCGGGCTGG GCGACGGCTA CAACGGCACC
CTCGCCGTGG TGGTCAACAG CGCCGGCGAC CGCACCTACG TGAAAAGCTA CGAGAACGGC
GCCGACGGCT ACCGCTTCGA GTTTTCCCTC GACGGCAACT ATCTGGAGCT GCTCGGCAAC
GAGGATTTCA TCTTCGCCAC GCCCAGCGGC CAGCAACTCC TCGAAGGCAG CGCCGGCAAC
GACAGCCTGC AGGGCACGGC CGCCGACGAA ATCGTCCACG GTGGGGCAGG GCGCGACACC
CTGAGCGGCG GGGCCGGGGC CGACGTGTTC CGCTTCAGCG AACTGACCGA CAGCTACCGC
ACCGCGAGTA CCAGCTTCGC CGATCTGATT ACCGACTTCG ATCTGGCCGA CGACCGCATC
GACCTGTCCG GGCTCGGTTT CAGCGGCCTG GGCGACGGCT ACGACGGCAC CTTGGCCGTG
GTGGTCAACA GCACCGGCAC CCGCACCTAC CTGAAGAGCT ACGAGGCCAA CGCCGCCGGC
GAACGCTTCG AGATCGCCCT GGACGGCGAC CTGTCCGCGT TCACCGGGGC CAACCTGATC
CTCGACGAGC GCGTCGTGCT GGAAGGCAGC GACGGCAACG ACACGCTCGA CGGCGGCAGT
GCGGCCGAGG AATTGCTCGG CGGGGCCGGC AACGACAGCC TGGACGGCGG CGCCGGCAAC
GATATCCTCG ACGGCGGCGC CGGGCGCGAC ACTCTCACCG GCGGCAGCGG CGCCGACGTG
TTCCGCTACG ACGACGCACT CGACAGCTTC CGCAACTACG GCACCGGCGT GACCGGCACC
GACACCATCA CCGACTTCAC CCCCGGCGAG GATCTGATCG ACCTGTCCGC GCTCGGCTAC
ACCGGGCTGG GCGACGGTTA CAACGGCACC CTTGCCGTGG TGCTCAACGG CGACGGCACC
AGAACCTACC TGAAGGACCG CGAGAGCGAC GCCGAAGGCA ACCAGTTCGA GATCGCCCTG
GGCGGCGATC TCGTCGACCG GCTTGATGCG GGCGACTTCA TCTTTGCCGA GGCAGCCGCG
ACCACCGCGA TCGAGGTGGT CGGCGGTACG CCGACCGAGG AGCAGTTGGT TGCTTGA
 
Protein sequence
MDYNVKDFGA LGDGVSDDTA AIQAAIDAAY AAGGGTVYLP AGEYRVSGGE EPSDGCLTIK 
SNVYIVGAGM GETVIKLVDG WDQDVTGIVR SAYGEETSNF GMSDLTLDGN RDNTSGKVDG
WFNGYIPGED GADRDVTLER VEIREMSGYG FDPHEQTINL TIRDSVAHDN GLDGFVADFQ
IGGVFENNVS YNNDRHGFNI VTSTNDFVLS NNVAYGNGGA GLVIQRGSYD VAHPYGILID
GGAYYDNGLE GVQIKMAHDV TLQNAEIYGN GLYGVRVYGA EDVQILDNYI HDNSQSGSYA
EILLQSYDDT AGVSGNFYTT TGTWIEGNTI VGSANSTYGI QERADGTDYS SLYANSVSNV
QSGSVRLYGT NSVVSDLPGT GQQATLEGTT GNDTLTGSDA HETLLGLDGD DRLNGGAGND
ILDGGAGRDN LTGGAGADLF RVSARTDSYR TDSASFNDLI TDFDADEDSI DLSALGFTGL
GDGYNGTLAV VLNSAGTRTY LKSYEADAEG RRFEIALDGN FAGLLDDGNL IFERPVIEGD
AGNNALLGTS AAETLLGHAG NDTLDGAGGD DILVGGAGRD TLTGGAGADL FRFDALSDSQ
RNYTTGDNQG DRIVDFSVGE DKLDVSALGF TGLGDGYNGT LAVVVNSAGD RTYVKSYENG
ADGYRFEFSL DGNYLELLGN EDFIFATPSG QQLLEGSAGN DSLQGTAADE IVHGGAGRDT
LSGGAGADVF RFSELTDSYR TASTSFADLI TDFDLADDRI DLSGLGFSGL GDGYDGTLAV
VVNSTGTRTY LKSYEANAAG ERFEIALDGD LSAFTGANLI LDERVVLEGS DGNDTLDGGS
AAEELLGGAG NDSLDGGAGN DILDGGAGRD TLTGGSGADV FRYDDALDSF RNYGTGVTGT
DTITDFTPGE DLIDLSALGY TGLGDGYNGT LAVVLNGDGT RTYLKDRESD AEGNQFEIAL
GGDLVDRLDA GDFIFAEAAA TTAIEVVGGT PTEEQLVA