Gene Avin_20650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20650 
Symbol 
ID7760991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2055211 
End bp2056371 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content62% 
IMG OID643804962 
ProductGlycosyl transferase, family 2 
Protein accessionYP_002799243 
Protein GI226944170 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGACCT ATCATTTTGT CAGCACCCTC CTGTCCTGGC TGGGTACCGG ACTGGCAGTC 
GTCACTGCCG GCTACGCCGT CGTTACCCTG GGCGCCGCGC TCAGAGGGAT TCGCGACCGG
GCCCCTGCGC TCGGTGCTGC CGATCGTACT CGGCCGGTCA GCATGCTCAA GCCCCTGCAT
GGCGCGGAGC CGCGGTTGTA CGAGAATTTG CGCGACTTCT GTCGGCAGAC CCATCCGGAC
TACCAGTTGA TATTCGGCGT ACGTGAAGCC GATGATCACG CCATCGCCGT GGTGCACAGA
CTGTGCGCGG AGTTCCCGCA CCTGGACATC GATCTGGTCA TCGATCCGCG TGTACACGGC
GCCAACCTGA AAGTCAGCAA CTTGCTGAAC ATGCTGCCGC TGGCCCGCCA TGACTGGCTG
GTGCTGGCCG ACAGCGACAT CAGCGTGCCG GCGGATTACC TGGTGCGGGT GACGGCGCCG
CTGGCAGATC CTGGCGTGGG TATCGTCACC TGTCTTTACT ACGGCGTGCC GCAGGAAAGC
TTCTGGTCGC GCCTGGGCGC TCTGTTCATC GACGATTGGT TTGCGCCCTC GGTCCGCTTG
TCGCATGTTT TCGGCTCCAC CCGTTTCGCC TTCGGTTCGA CCATCGCGCT GCGCCGCGAG
GTATTGCAGG CTATTGGTGG CTTCGAGGTC TTGCGTGATA CTCTGGCCGA CGATTTCTGG
TTGGGGGAAC TGACCCGGCG GGCCGGGTTG CGCACCGTGC TGTCGGATCT GCTGGTCGGT
ACCGAAGTGA GCGAAACCCG CCTGATCGAG CTGTGGACGC ATGAGTTGCG CTGGTTGCGC
ACGATCCGCG CGGTCGCGCC AACTGGTTTT GCGCTGAGCT TCGTCTGTTT CACTTGGCCG
GTGTCCCTGC TCGGCCTGGC GCTGAACCCT TCCATGTTGA ATGCCTGGCT CGTCGCGGTA
GCGGGTGGCG CGCGTGTTGC CCGCTTTTTC TTCGGCCAGA AAATCAGGCG TTCATCTGTG
TCCTGGTACG AGGTTCTGCT GACTCCGTTT CGTGACCTGC TGCTGTTGTT GGAGTGGGCC
ATGGCCCTGA CCAGTTGGCG AGTGGAGTGG CGTGGTCGGG TTTTGCATGC GTGCAAGGAT
GGGCCCATGC GTTATCTTTG A
 
Protein sequence
MPTYHFVSTL LSWLGTGLAV VTAGYAVVTL GAALRGIRDR APALGAADRT RPVSMLKPLH 
GAEPRLYENL RDFCRQTHPD YQLIFGVREA DDHAIAVVHR LCAEFPHLDI DLVIDPRVHG
ANLKVSNLLN MLPLARHDWL VLADSDISVP ADYLVRVTAP LADPGVGIVT CLYYGVPQES
FWSRLGALFI DDWFAPSVRL SHVFGSTRFA FGSTIALRRE VLQAIGGFEV LRDTLADDFW
LGELTRRAGL RTVLSDLLVG TEVSETRLIE LWTHELRWLR TIRAVAPTGF ALSFVCFTWP
VSLLGLALNP SMLNAWLVAV AGGARVARFF FGQKIRRSSV SWYEVLLTPF RDLLLLLEWA
MALTSWRVEW RGRVLHACKD GPMRYL