Gene Avin_45500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_45500 
SymbolmdoH2 
ID7763418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4616360 
End bp4618960 
Gene Length2601 bp 
Protein Length866 aa 
Translation table11 
GC content70% 
IMG OID643807398 
Productglucosyltransferase MdoH 
Protein accessionYP_002801639 
Protein GI226946566 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2943] Membrane glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00174116 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAGA ATAATCCGAC CTTTCCTCAG GCTCCGCTGG ATGAGTATTT GCGCCATCTG 
TCCCTGGCCG CGGCGGAGCG CGGGCGTCTG ACCGATGCCG ACTCGCTCGC CGAACTGCAC
TGCCGACTGG CCGGCGGCGC CCGGGCGGGG CTCGGCGATG CCGCCCTGGC CTCGGCCTCC
CGGCGCCTGA GCGAAGGCTA CGGCGACGAA CCGGCCGCCA CCGGGCGGCT TGCCAGCGAT
GCCGCCGGCC GTGCCTGCCT GGCGGCCGCG CCGCCGATCC GCCGTACCCC GGTCGTCCCC
GAGCCTTGGG ACTGCAATCC GCTGCTACGC CTCTGGCGCC GCTTCACCGG CCGCCGCAAC
CCGTCGCCGC CGCGCGATCT GCCGCCGGTG AGCTGGCAGC GGGGCGCCTC GCGGCGCAGG
TTGCTTCTGT TGCTGCTGAT GCTGGGACCG ACCGCGCTGG CCACCTGGTA CATGCGCGAC
ATCCTGCCAT GCTGCGACTG GTCCTATTCC ACGATCGCCC AGTTCGTCGA AGGTACCCCG
CCGAGCCCTT GGCAGAGCCT GGGCACGGCG CTGCCGTACC TGCTCGAGGG CAGCGTGCTC
GGTCTCTTCG CGCTGCTCTT CGGCTGGGTC TCGGCCGGTT TCTGGACGGC GGTGATGGGC
TTCTGGGAGT TGTTGCGCGG GAGTGGGGGC CGGCACATTT CCGCCGCCGA GGCGGGGAGC
GGGCCGATTC CCGCCGAGGT ACGCACGGCC ATCGTCATGC CGATCTGCAA CGAGGACGTG
GCGCGGGCCT TCGCCGGCCT GCGGGCGACC TACGAGTCGC TGGCCGCCGG CAGCGAGCTG
GCGCGCTTCG ACTTCTTCAT CCTCAGCGAC AGCAACTCGG CCGAAATCGC CGCCGCCGAG
CAGCAGGCCT GGTTCGACCT GTGCCGCGAG ACCGGCGGCT CCGGGCGCAT CTTCTATCGC
CGGCGCCGGC GCCGGGTGAA ACGCAAGAGC GGCAACATCG ACGACTTCTG CCGGCGCTGG
GGCAGTCAGT ACCGCTACAT GGTGGTGATG GATGCCGACA GCGTGATGAG CGGCGACTGC
CTGGCCACGC TGGTACGGCT GATGGAGGCC AATCCCGAGG CCGGGATCAT CCAGACCGCG
CCCAAGGCTT CCGGCCGCGA CACCCTCTAT GCGCGCATGC AGCAGTTCGC CACCCGCGTC
TACGGCCCGC TGCTGACCGC CGGCATGCAT TTCTGGCAGC TCGGCGAGTC GCACTACTGG
GGGCACAACG CGATCATCCG CATCGAGCCG TTCATCGCGC ACTGCGCGCT GGCTCCGTTG
CCGGGGCGCG GGGTGTTCGC CGGGGCGATC CTTTCCCACG ACTTCGTCGA GGCCGCCCTG
ATGCGGCGCG CCGGCTGGGG CGTGTGGGTC GCCCACGACC TGCCGGGCAG CTACGAGGAG
CTGCCGCCGA ACCTGCTCGA CGAGCTCAAG CGCGACCGCC GCTGGTGCCA CGGCAACCTG
ATGAACTTCC GCCTGTTCCT GGTCCGCGGC ATACATCCGG TACACCGCGC GGTGTTCCTC
ACCGGGGTGA TGTCCTATCT GTCGGCGCCG CTGTGGTTCG CCTTCCTGCT GCTCTCCACC
GCGCTGATGG CCGCGCACCA GTTGCTGATG CCCCAGGATG CCGCCAGCGC GGCAGGGCCG
TTCCCGGCCT GGATGCCCTG GTATCCCCGG GAGGCGATGC TGCTGTTCTA CAGCACCCTG
ACCATGCTGG TGCTGCCCAA GCTGCTCAGC CTGGTACTGG TCTGGATCAA GGGTGTCGGC
CCCTATGGCG GGACGCTCCG GGTGACCTTG AGCGTACTGC TGGAGATGTT CTGCTCCGTC
CTGCTGACGC CGGTGCGGAT GTTCTTCCAT AGCTGCTTCG TGATCGGCGC CTTCCTCGGC
CGTTCGATCC AGTGGAAGTC GCCGCGCCGC GACGACGGCT CCACCTCCTG GGGCGAGGCG
CTGCGCCGCC ACGGCGTGCA GACCCTGATC GGCATCGTCT GGGCCCTGCT GGTGGCCTGG
CTGAGCCCGC ACTTTCTCTG GTGGCTGGCG CCGATCGTCG GTTCCCTGAT GCTGTCCATC
CCGGTGTCGG TGCTTTCCAG CCGGGTCGAC CTGGGCCGGC GCCTGCGCGC GCTGAAGCTG
TTCCTCATCC CCGAGGAGCA CGATGCTCCG CCCGAGCTGC GCGCCACCGA GCGCTATACC
CGGGAAAACC GCCAGCGGGC GCTGGGCGGC GCCTTCGTCC GGGCGGTGGT CGATCCGCTG
GACAACGCCC TGGCCTGCGC CATGGCCACG GCACGCCATG GCCGGTCGCT GGCGATCGAA
CGGCTGCGCG CGGAACGCGT GACCCATGCG CTGGCCGTCG GTCCCGAGCG GCTCGACGAC
CGGGCGCGAC TGGCCCTGCT CGGCGACCCG GTGGTCCTGG CGCGTCTGCA CCTGCAGCTC
TGGGAGGAGG GCTGGGAGAA CTGGCTGGCG CCCTGGCGGC GCTCGCTGGG CAGCCTGTAC
CGGAGCAGCG ACCGCCCGCT GGTCGTTCCT CCGGGCAGCC AGACGAAGCC GGCGGTCGAC
GGGGTGCGGA TGGCCGGGTG A
 
Protein sequence
MPKNNPTFPQ APLDEYLRHL SLAAAERGRL TDADSLAELH CRLAGGARAG LGDAALASAS 
RRLSEGYGDE PAATGRLASD AAGRACLAAA PPIRRTPVVP EPWDCNPLLR LWRRFTGRRN
PSPPRDLPPV SWQRGASRRR LLLLLLMLGP TALATWYMRD ILPCCDWSYS TIAQFVEGTP
PSPWQSLGTA LPYLLEGSVL GLFALLFGWV SAGFWTAVMG FWELLRGSGG RHISAAEAGS
GPIPAEVRTA IVMPICNEDV ARAFAGLRAT YESLAAGSEL ARFDFFILSD SNSAEIAAAE
QQAWFDLCRE TGGSGRIFYR RRRRRVKRKS GNIDDFCRRW GSQYRYMVVM DADSVMSGDC
LATLVRLMEA NPEAGIIQTA PKASGRDTLY ARMQQFATRV YGPLLTAGMH FWQLGESHYW
GHNAIIRIEP FIAHCALAPL PGRGVFAGAI LSHDFVEAAL MRRAGWGVWV AHDLPGSYEE
LPPNLLDELK RDRRWCHGNL MNFRLFLVRG IHPVHRAVFL TGVMSYLSAP LWFAFLLLST
ALMAAHQLLM PQDAASAAGP FPAWMPWYPR EAMLLFYSTL TMLVLPKLLS LVLVWIKGVG
PYGGTLRVTL SVLLEMFCSV LLTPVRMFFH SCFVIGAFLG RSIQWKSPRR DDGSTSWGEA
LRRHGVQTLI GIVWALLVAW LSPHFLWWLA PIVGSLMLSI PVSVLSSRVD LGRRLRALKL
FLIPEEHDAP PELRATERYT RENRQRALGG AFVRAVVDPL DNALACAMAT ARHGRSLAIE
RLRAERVTHA LAVGPERLDD RARLALLGDP VVLARLHLQL WEEGWENWLA PWRRSLGSLY
RSSDRPLVVP PGSQTKPAVD GVRMAG