Gene Avin_05090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_05090 
SymbolbcsAB 
ID7759465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp485252 
End bp489619 
Gene Length4368 bp 
Protein Length1455 aa 
Translation table11 
GC content68% 
IMG OID643803429 
ProductCellulose synthase subunit AB 
Protein accessionYP_002797737 
Protein GI226942664 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03030] cellulose synthase catalytic subunit (UDP-forming) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCCCC CCTCCCGGAA AGCAGGGCTG TCGGCCTTCA TTCTGATACT CGCCGCCAGT 
CCGGCCCTGC TGCTGTTCGT CACCGTGCCG CTCGACATTT CCACGCAGGT GGGGATGGGC
GTGTGCATGA TCCTGCTCAT GCTGCTGCTG GGCCGGCTCC AGTCCTATAC CGTCACGCTG
ATCCTGATCG TCGTCTCGGT CGCCGTCTCC ACCCGCTACC TGTACTGGCG GACCACCCAG
ACACTGGTAT TCGACAATAG CCTGGAGGCC TTCCTGGGCA CGGGGCTCTA TCTGGCCGAA
GTCTACGTCT GGATGATCCT GACGCTGGGC TATTTCCAGA CCGTCATGCC GCTGAAACGG
GCGATCCGGC CGCTGCCGGA AGACCTCTCG CAATGGCCGA CGGTCGATGT CTACATTCCG
ACCTACAACG AAAGCCTCGA CGTCGTCCAG GACACCGTGC TGGCCGCGCA GAACATCGAC
TACCCGGCGG ACAAGCTGCG CGTCTACCTG CTCGACGACG GCCGGCGCCC CGAGTTCGGC
GCCTTCGCCG CGGCGGCGGG CGTCGGCTAC ATCACGAGGT CCGACAACCG GCACGCCAAG
GCCGGCAACC TGAACAACGC CCTGAGCCTG ACGGATGGCG AGCTGATCTG CATCTTCGAC
TGCGACCACG TCAGCACCCG CGCCTTCCTG CAGGCCACGG TCGGGGCGTT CCTGCAGGAC
GCCAGGCTGG CCCTGGTACA GACCCCGCAC CACTTCCACT CCCCCGACCC CTTCGAGCGC
AACCTCGCCA CCGGGCGGGA GGTTCCCAAC GAGGGCGAAC TGTTCTACGG TCCCGTACAA
CAGGGCAACG ACTTCTGGAA CGCGGCGTTC TTCTGCGGCT CCTGCGCGGT GATCCGCCGC
AGCGCCCTGG AGGAAATCGG CGGCTTCGCC GTGGAAACCG TCACCGAAGA CTCCCATACC
GCGCTCAAGC TGCAGCGCAG GGGCTGGAAC ACTGCCTTCC TGCCCGTTCC GCTGGCCGCC
GACCTGGCCA CCGAGCGCCT GGCCCTGCAT ATCGGCCAGC GGATGCGCTG GGCGCGCGGC
ATGACGCAGA TCTTCCGCCT GGACAACCCC CTGCTGGGCC GCGGTCTGCG CCTGCCGCAG
CGGCTCTGCT ATCTCAACGC CTTGCTGCAT TATCAGTTCG CCCTGCCGCG CATCGTCTTC
CTGACGGCGC CGACCGCCTA CCTGCTGCTC GGGCAGAACA TCATCGCCGC CCCCGCCACC
CTGATCTTCG CCTACATGCT CCCGCACATG GCCCACTCGA TCATGGCCAA CGCCAGGATC
CAGGGCCGGC ACCGCCATTC CTTCTGGGGC GAGCTGTACG AGACGGTGCT GGCCTTCCAT
ATCCTCAAGC CGACCCTGGT CACCCTGTTT TTCCCCAAGC GCGGCAAGTT CAACGTGACC
GACAAGGGCG AACTGCTCGA CCGGGGCTTC TTCGATTTCG CCCTGGTCAG GCCGCACCTG
CTCACTGCGG CGCTGCTGGC GACAGCCATC GCCGTGGGTC TGATCCGCTA CTTCTATATC
GATTGGTTTC CCATCGATGC CAAGGTGCTC GCGATGAGCA TCGGCTGGGC ACTGTTCAGC
CTGCTCCTTT TGCTGGCCGC CTCTGCGGTG GCAATGGAAA GCCGGCAACT GCGCAGCACC
ACCCGGCTGA CGCTGAAACT GCCGGTCGTC CTGCATTTCG ACAACGGACG CAGTTGGCAG
GGGGAAACCC TGGATATCTC CATGGGCGGC CTCAAGGTCG TCCCGGCGCA GACAGTGAAG
AAAATGCCTC AGCCGCCGGG CCAGCTCGAA TATATCGAGA TCGCCTGCAA TGGCCACACG
GCACTGTTCC CCGCACACCT CATGGGAGCC GGCAGGCGGG AGTTGCGTCT GAGTTTCAGC
CCGCTCGACA TCGATCAGCG GCGCGAACTG GTACGCATCG TCATGGGCCG GGCGGATGCC
TGGCTCGCCA GTACTCCGAA ACGGCTGGAT CGGCCCCTGC GCTCGCTCTG GAGCGTGGCG
AAAAACGCCC TGACGCTGCT CAATCCCGCC AGACGGCGGA AGGAGCCGGC GCCAGCGACC
GACGCCCGGA CCCCGCAGGA GCGCGCCGGC CAGGCGGTAC CGAACGCCTT CCTGCTGCTT
CTCCTGTGCG TCCTCTGCCT CATCGCGGCA GCGCCCCTTC TGGCCGGGGA ACTGGAACTC
CCGCCCCCCT CGGCAACCGC GCAGCCGCCC GCGGCGGCAG GCCTGATCGA GGTCCTGCGC
TTCGAGCAGT TGGGCATCGG GGACGGAACG ACGCTGCGCG GAGCACGGGC CGAAGTGGAC
ATCCCCTTTT CGATCGGTCG CCAGCAGATC GTCAGCGAAG CCCGGCTCGA CCTGCACGTC
CGTCACTCGG ACAAGCTGCC GGCCGACGCC CGCCTGGAGG TTCTGCTCAA CGGCGAGACG
CTCGACGACC TGCGCCTGAC CGCCGGCGAA CCGCTCGACG ACTTCGTCGA GATCGCCGTC
AACCCCCTGT TGCTGCTGCC CCACAACAGC CTGCGCCTCC GGTTGCGTAG CGGCGAACAG
CGCTGCGAGT CGCCCGAGCG CTCGCCGCTG CAGATCGCCA TCGGCAAGGA CTCCAGCCTC
TCCCTGGAAT TGCGCCGGCT GCCCCTGACG AACGATCTGG CGCTCCTGCC CGCCCCCTTC
TTCGACGAGG CGCAGCCCGG CGAGCTGCGC CTGCCCACGG TCCTGGCCGG GCAGCCCGGC
GCAGAAACCC TGCGCAGCGC CGCGCTGGCG GCCTCGCATT TCGGCGCGCT CGCCCGCTAC
CGGAGCCTGG ACTTTCCGGT GACGATCGGC ACGCTGCCGC TGGGCAACGC CCTGCTGTTC
GCCCTCGTCG AGCAGCGCAT CGACGGCCTG CAACTGCCGC CCATCGAGGG ACCGACGCTG
GCCATGCTGG ACAACCCCCG CGATCCGCAC GCCAAGCTGC TGCTGATCGC CGGGCGCACT
CCGGCCGAGC AGCGCGCGGC GGCGCTCGCC CTGGTGCTCG GCGCCAGCCG TCTGAAAGGC
GCCAGTATGC TGCTGGAAGA ACCATCCGCG CCGCCACGGC GCGCGCCCTA CGATGCGCCG
CGCTGGCTGC CCGCCGAGCG GCCGGTGCGC CTCGCCGAAC TGACCGACCA GCCCCTGGCG
AGCCAGAGCC CCACGCCGGC GGGGGTGCAC CTCAACTTCC ACGCCGCTCC GGACAACTTC
CTGTGGGGCG CGACGAATAT CCCCATGCAC CTGCGCTACC GCTTTCCCGA AGGCGACTGG
CTGGATGCCT CCCGCACCCA TCTGGACATC GCCCTCAACG GTCGCCACCT GGCCTCTCTG
CCAATGCTGA AAGGCGGTCC GCTGGAAAGA CTCAAGCACT ACCTCGGCCG GCAGACCCGC
CAGGAAGAGG CCAGGGTCGA GATACCCGCC TACCTGATCT ACGGCGACAA CCGCCTGGAC
TTCCATTTCA ACCTGCGCAC CAGGGACGAC CCGGACTGCT CGCTGGAGCT GCCCGAACAG
GCCCTGAGCC GGATCGACGG CGATTCCTCC ATCGACCTCA GCGGCACCCG GCACTTCGCC
CGGCTGCCCA ACCTGTCGTT CTTCGTCGGC GCCGGCTTCC CCTTCACGCG AATGGCCGAC
CTTTCCGAAA CGGCGGTACT GCTGCCGGGC ACGCCGCGGG AAGAGGAAAT CGAGGCGATG
CTCGGCCTGC TCGGGCGCTT CGGCGAAGCC ACCGGCTATC CGGCCCTGGG CGTCGAGGTG
CTCGCCGGCC CGGCCCGGCT GTCCTCGGTC GCCGGGCGCG ACCTGCTGGC GATCGGCCGG
CTCGACGGCG ACCTGGCCCT GGCGCCGCTG CTCGCCGGCT CGGACTTCCG CGTGGAGAAA
GGACAACTGC GCATCGCCCC GTGGACGCCG CTCGAACGGG TGCGCCGCTT CGTTCTGGGC
GACTGGGACA GCCAGGCGCC GGAGGCCGCG CGGCAACTCG CCGGCGACCA GCCGTTCCGC
GGGCTGCTCA GCCGCCGTTC GCCCTTCGAC CCGGCGCGGG CGCTGGTGCT GGTCCTGGCC
CGCGAGGCGC AATGGCTGCC GCAGATCGTC GAGAGCCTGC ACGCCCCCGA GGTCAGCGCG
GAAATCCGCG GCGACCTCAC CCATTTCGCG AGCGCCAGAC AGGTGCAGAG CTTCCGCGTC
GGGACGCAGT TCGCCTACGG CACCCTCCCC TGGCACATCC ACGTCCGCTG GCTGTTCAGC
GACCGTCCGG CGCTGCTGGC GACGCTATTG CTGGCCTCGG CGCTGCTCGT CGCGCTCGCC
TTGCAACCCC TCCTGCGTGC GCGTGCGGCA CGACGCCTGA GCGAATAG
 
Protein sequence
MPPPSRKAGL SAFILILAAS PALLLFVTVP LDISTQVGMG VCMILLMLLL GRLQSYTVTL 
ILIVVSVAVS TRYLYWRTTQ TLVFDNSLEA FLGTGLYLAE VYVWMILTLG YFQTVMPLKR
AIRPLPEDLS QWPTVDVYIP TYNESLDVVQ DTVLAAQNID YPADKLRVYL LDDGRRPEFG
AFAAAAGVGY ITRSDNRHAK AGNLNNALSL TDGELICIFD CDHVSTRAFL QATVGAFLQD
ARLALVQTPH HFHSPDPFER NLATGREVPN EGELFYGPVQ QGNDFWNAAF FCGSCAVIRR
SALEEIGGFA VETVTEDSHT ALKLQRRGWN TAFLPVPLAA DLATERLALH IGQRMRWARG
MTQIFRLDNP LLGRGLRLPQ RLCYLNALLH YQFALPRIVF LTAPTAYLLL GQNIIAAPAT
LIFAYMLPHM AHSIMANARI QGRHRHSFWG ELYETVLAFH ILKPTLVTLF FPKRGKFNVT
DKGELLDRGF FDFALVRPHL LTAALLATAI AVGLIRYFYI DWFPIDAKVL AMSIGWALFS
LLLLLAASAV AMESRQLRST TRLTLKLPVV LHFDNGRSWQ GETLDISMGG LKVVPAQTVK
KMPQPPGQLE YIEIACNGHT ALFPAHLMGA GRRELRLSFS PLDIDQRREL VRIVMGRADA
WLASTPKRLD RPLRSLWSVA KNALTLLNPA RRRKEPAPAT DARTPQERAG QAVPNAFLLL
LLCVLCLIAA APLLAGELEL PPPSATAQPP AAAGLIEVLR FEQLGIGDGT TLRGARAEVD
IPFSIGRQQI VSEARLDLHV RHSDKLPADA RLEVLLNGET LDDLRLTAGE PLDDFVEIAV
NPLLLLPHNS LRLRLRSGEQ RCESPERSPL QIAIGKDSSL SLELRRLPLT NDLALLPAPF
FDEAQPGELR LPTVLAGQPG AETLRSAALA ASHFGALARY RSLDFPVTIG TLPLGNALLF
ALVEQRIDGL QLPPIEGPTL AMLDNPRDPH AKLLLIAGRT PAEQRAAALA LVLGASRLKG
ASMLLEEPSA PPRRAPYDAP RWLPAERPVR LAELTDQPLA SQSPTPAGVH LNFHAAPDNF
LWGATNIPMH LRYRFPEGDW LDASRTHLDI ALNGRHLASL PMLKGGPLER LKHYLGRQTR
QEEARVEIPA YLIYGDNRLD FHFNLRTRDD PDCSLELPEQ ALSRIDGDSS IDLSGTRHFA
RLPNLSFFVG AGFPFTRMAD LSETAVLLPG TPREEEIEAM LGLLGRFGEA TGYPALGVEV
LAGPARLSSV AGRDLLAIGR LDGDLALAPL LAGSDFRVEK GQLRIAPWTP LERVRRFVLG
DWDSQAPEAA RQLAGDQPFR GLLSRRSPFD PARALVLVLA REAQWLPQIV ESLHAPEVSA
EIRGDLTHFA SARQVQSFRV GTQFAYGTLP WHIHVRWLFS DRPALLATLL LASALLVALA
LQPLLRARAA RRLSE