Gene Avin_20620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20620 
Symbol 
ID7760988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2052086 
End bp2053192 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content63% 
IMG OID643804959 
ProductGlycosyl transferase, family 2 
Protein accessionYP_002799240 
Protein GI226944167 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID[TIGR03469] hopene-associated glycosyltransferase HpnB 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.268109 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCACCG TATTGTTCGC CGCCCTCCCC TTCATGATCT GGATGGGATT GTTGCTGGCC 
CCTTGGCGGC CATGGAGCAC CCGCGAACGG CTCGAGGTCG ACTCGTCGCC CATGCCGGCA
GCCGATCTCA GCGGGATTAC CGTACTGATT CCGGCACGCA ACGAAGCCGA AACGATCGGC
ACCATCCTGG CCACCCTGCA GAAGCAGGGA AACGGCTTGC AGGTGGTGGT CGTGGACGAT
CAATCCAGTG ACGCCACGGC GAGTATCGCC GCCGCCTACC CCCATACCCG CGTCGTAAGC
GGCCGCCCCT TGCCTGAGGG CTGGGCTGGC AAGCTGTGGG CGCTGGAACA GGGAAAGTCG
CAGGTGCACA CGGCAATGAC GCTTTTGCTC GATGCCGACA TCCAGCTTCG CCCCGGCCTG
CTGCCGGCAT TGCTGGAGCT CAAGCGGCGC GAAGGCCTAC ACTTCGTCTC GTTGATGGCG
GACTTGCGCC GTACCAGCTT TTGGGATCGC CTGCTGCTGC CAACGTTCGT CTATTACTTC
AAGCTGCTGT ATCCGTTTGC CCTGTCCAAT TCGCGTAGCA GACATGTCGC CGCGGCAGCG
GGCGGTTGTG TGCTGGTGGA TACTGAAGTC CTGCGGCATA TAGGTGCCTT CGCCAGCCTG
CGCAACGCCC TGATCGATGA CTGTACCTTG GCGAGGCAGG TCAAGCAGGC CGGTTACCGC
ATCTGGCTGG GCCTGAGCCG CGGCGTGGTG AGCCTGCGCC CTTACGGCAC CCTGGCATCC
ATCCACGACA TGGTGGCGCG CTCGGCCTTC ACTCAACTCG GCTATTCCGC ATGGTTGCTG
CTGGCCGTGA CGGTGATCTT CATCGTCGCC TATGGCGGGC CGTTCGCTCT GCTGGGCCTG
TCGCTCGCCC GACCATGGGC GCTGGCCGCC TGGGCAGCCA TGACGCTGAG CTACCTGCCG
ATCTTGCGCT ATTACCGCAT GTCCCCGCTC TGGGCCTTGT TACTGCCCAT CAGCGCAGCG
TTTTACCTTG GCATGACATG GAGCTCGGCC ATCCGTTATT GGCGCGGCGT ACGCTCACGT
TGGAAAGGAC GAGTCTACAG CCATTGA
 
Protein sequence
MLTVLFAALP FMIWMGLLLA PWRPWSTRER LEVDSSPMPA ADLSGITVLI PARNEAETIG 
TILATLQKQG NGLQVVVVDD QSSDATASIA AAYPHTRVVS GRPLPEGWAG KLWALEQGKS
QVHTAMTLLL DADIQLRPGL LPALLELKRR EGLHFVSLMA DLRRTSFWDR LLLPTFVYYF
KLLYPFALSN SRSRHVAAAA GGCVLVDTEV LRHIGAFASL RNALIDDCTL ARQVKQAGYR
IWLGLSRGVV SLRPYGTLAS IHDMVARSAF TQLGYSAWLL LAVTVIFIVA YGGPFALLGL
SLARPWALAA WAAMTLSYLP ILRYYRMSPL WALLLPISAA FYLGMTWSSA IRYWRGVRSR
WKGRVYSH