Gene Avin_51180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_51180 
SymbolalgE2 
ID7763963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5191069 
End bp5194065 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content68% 
IMG OID643807943 
ProductSecreted mannuronan C-5 epimerase 
Protein accessionYP_002802177 
Protein GI226947104 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID[TIGR03661] type 1 secretion C-terminal target domain (VC_A0849 subclass) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTACA ACGTCAAAGA TTTCGGGGCG CTGGGCGACG GCGTCAGCGA CGATACGGCC 
GCCATCCAGG CGGCGATCGA TGCCGCCTAC GCGGCCGGCG GCGGCACCGT CTACCTGCCG
GCCGGCGAAT ACCGGGTCAG CGGCGGCGAG GAGCCTTCCG ATGGTTGCCT GACCATCAAG
AGTAATGTCC ATATCGTCGG CGCGGGGATG GGCGAGACGG TGATCAAACT GGTCGACGGC
TGGGACCAGG ACGTCACCGG CATCGTCCGC TCGGCCTACG GCGAGGAGAC CAGCAACTTC
GGCATGAGCG ACCTGACCCT CGACGGCAAC CGCGACAACA CCAGCGGCAA GGTCGACGGC
TGGTTCAACG GCTACATTCC CGGCGAGGAC GGCGCCGACC GCGACGTGAC CCTGGAGCGG
GTGGAAATCC GCGAAATGTC CGGTTACGGT TTCGACCCGC ACGAGCAGAC CGTCAACCTG
ACGATCCGCG ACAGCGTGGC CCACGATAAC GGCCTCGACG GCTTCGTCGC CGATTTCCAG
ATCGGCGGGG TGTTCGAGAA CAACGTCTCG TACAACAACG ACCGCCACGG CTTCAACGTG
GTCACCAGCA CCAACGACTT CGTCCTGAGC AACAACGTCG CCTACGGCAA CGGCGGCGCC
GGGCTGGTGG TGCAGCGCGG CTCGTCCGAC GTGGCGCACC CCTACGACAT CCTGATCGAC
GGCGGCGCCT ACTACGACAA CGGCCTGGAA GGCGTGCAGA TCAAGATGGC CCACGACGTC
ACCCTGCAGA ACGCCGAGAT CTACGGCAAC GGCCTATACG GGGTGCGCGT CTACGGCGCC
GAGGATGTGC AGATCCTCGA CAACTACATC CACGACAATT CGCAGAACGG TTCCTACGCG
GAAATCCTCC TGCAGTCCTA CGACGATACC GCCGGGGTGT CCGGCAATTT CTACACCACC
ACCGGCACCT GGATCGAAGG CAACACCATC GTCGGCTCGG CCAACTCCAC CTATGGCATC
CAGGAGCGCG ACGACGGCAC CGACTACAGC AGCCTCTACG CCAACAGCGT CAGCAATGTG
CAGAACGGCT CGGTGCGCCT CTACGGCGCC AACTCCGTCG TCTCCGACCT GCCCGGCACC
GGCCAGCAGG CGACCCTCGA AGGCACGGCC GGCAACGACA CGCTTGGCGG CAGCGACGCC
CACGAGACGC TGCTCGGGCT GGACGGCAAC GACCGCCTGA ACGGCGGCGC CGGCAACGAC
ATCCTCGACG GCGGCGCCGG GCGCGACAAC CTGACCGGCG GCGCGGGCGC CGACCTGTTC
CGCGTCTCCG CGCGCACCGA CAGCTACCGC ACCGACAGCG CCAGCTTCAA CGACCTGATC
ACCGACTTCG ACGCCAGCCA GGACCGCATC GACCTGTCCG CGCTGGGCTT CACCGGGCTG
GGCGACGGCT ATGACGGCAC CCTGCTGCTG CAGGTCAGTG CCGACGGCAG CCGCACCTAT
CTGAAGAGCC TGGAGGCGGA CGCCGAGGGG CGGCGTTTCG AGATCGCCCT GGACGGTAAC
TTCGCCGGCC TGCTCGGCGC CGGCAACCTG CTCTTCGAAC GCACCGCCAT CGAGGGGGAT
GCCGGCGACA ACGCCCTGCT CGGTACCTCG GCCGCCGAGA CATTGCTCGG CCATGCCGGC
AACGACACGC TCGACGGCGG GGCCGGCGAC GACATCCTGG TCGGCGGCGC CGGGCGCGAC
AGCCTCACCG GCGGCGCCGG AGCCGACGTG TTCCGCTTCG ACGCGCTGTC CGACAGCCAG
CGCAACTACG ACATCGGCGA CAACCAGGGC GACCGCATCG CCGACTTCGC GGTGGGCGAA
GACAAGCTCG ACGTATCGGC GCTGGGCTTC ACCGGGCTGG GCGACGGCTA CAACGGCACC
CTCGCCCTGG TACTCAACAG CGCCGGCGAC CGCACCTACG TGAAAAGCTA CGAGAACGGC
GCCGACGGCT ACCGCTTCGA GTTTTCCCTC GACGGCAACT ATCTGGAGCT GCTCGGCAAC
GAGGATTTCA TCTTCGCCAC GCCCAGCGGC CAGCAACTCC TCGAAGGCAG CGCCGGCAAC
GACAGCCTGC AGGGCACGGC CGCCGACGAG GTGATCCACG GCGGCGGCGG GCGCGACACG
CTGGCCGGCG GGGCCGGGGC CGACGTGTTC CGCTTCAGCG AACTGACCGA CAGCTACCGA
ACCGACAGTG CCAGCTATGC CGATCTGATC ACTGACTTCG ATGCCAGCGA GGATCGTATC
GACCTGTCCG GCCTCGGCTT CAGCGGTCTG GGCAACGGCT ACGGCGGTAC CCTGGCGCTG
CAGGTGAACA GCGCCGGCAC CCGCACCTAC CTGAAGAGCT TCGAGACGAA CGCCGCCGGC
GAGCGTTTCG AGATCGCCCT GGACGGCGAC CTGTCCGCGC TCGGCGGGGC CAACCTGATC
CTCGACGAGC GCGTCGTGCT GGCGGGCGGC GACGGCGACG ACACGCTTTC CGGCAGCAGC
GCGGCCGAGG AACTGCTCGG CGGGGCCGGC AACGACAGCC TGGACGGCGG CGCCGGCAAC
GACATCCTCG ACGGCGGGGC GGGGCGCGAC ACCCTGAGCG GCGGCAGCGG CAGCGACATC
TTCCGCTTCG GCGGCGCGCT CGACAGCTTC CGCAACTACG CCAGCGGGAC GAACGGCACC
GACAGCATCG TCGACTTCAC CCACGGCACC GACCTGATCG ACCTCTCCGC GCTCGGCTAT
ACCGGGCTGG GCGACGGCTA CAACGGTACC CTGGCGATAG TGCTGAACGA CGCCGGCACC
AAGACCTACC TGAAAAACCG TGAGAGCGAC GCCGAGGGCA ACCAGTTCGA GATCGCCCTG
GAGGGCAACC ACGCCGACCA GCTCGATGCG AGCGACTTCA TCTTCGCCAC GGCGGCCGCG
ACCACCGCGA TCGAGGTGGT CGGCGGCAGC GGCACCCAGA CCGATCAGCT CGCCTGA
 
Protein sequence
MDYNVKDFGA LGDGVSDDTA AIQAAIDAAY AAGGGTVYLP AGEYRVSGGE EPSDGCLTIK 
SNVHIVGAGM GETVIKLVDG WDQDVTGIVR SAYGEETSNF GMSDLTLDGN RDNTSGKVDG
WFNGYIPGED GADRDVTLER VEIREMSGYG FDPHEQTVNL TIRDSVAHDN GLDGFVADFQ
IGGVFENNVS YNNDRHGFNV VTSTNDFVLS NNVAYGNGGA GLVVQRGSSD VAHPYDILID
GGAYYDNGLE GVQIKMAHDV TLQNAEIYGN GLYGVRVYGA EDVQILDNYI HDNSQNGSYA
EILLQSYDDT AGVSGNFYTT TGTWIEGNTI VGSANSTYGI QERDDGTDYS SLYANSVSNV
QNGSVRLYGA NSVVSDLPGT GQQATLEGTA GNDTLGGSDA HETLLGLDGN DRLNGGAGND
ILDGGAGRDN LTGGAGADLF RVSARTDSYR TDSASFNDLI TDFDASQDRI DLSALGFTGL
GDGYDGTLLL QVSADGSRTY LKSLEADAEG RRFEIALDGN FAGLLGAGNL LFERTAIEGD
AGDNALLGTS AAETLLGHAG NDTLDGGAGD DILVGGAGRD SLTGGAGADV FRFDALSDSQ
RNYDIGDNQG DRIADFAVGE DKLDVSALGF TGLGDGYNGT LALVLNSAGD RTYVKSYENG
ADGYRFEFSL DGNYLELLGN EDFIFATPSG QQLLEGSAGN DSLQGTAADE VIHGGGGRDT
LAGGAGADVF RFSELTDSYR TDSASYADLI TDFDASEDRI DLSGLGFSGL GNGYGGTLAL
QVNSAGTRTY LKSFETNAAG ERFEIALDGD LSALGGANLI LDERVVLAGG DGDDTLSGSS
AAEELLGGAG NDSLDGGAGN DILDGGAGRD TLSGGSGSDI FRFGGALDSF RNYASGTNGT
DSIVDFTHGT DLIDLSALGY TGLGDGYNGT LAIVLNDAGT KTYLKNRESD AEGNQFEIAL
EGNHADQLDA SDFIFATAAA TTAIEVVGGS GTQTDQLA