Gene Avin_30080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_30080 
Symbol 
ID7761909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3115656 
End bp3117953 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content68% 
IMG OID643805881 
ProductPolysaccharide biosynthesis protein 
Protein accessionYP_002800149 
Protein GI226945076 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.819591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCGC CACACAAGGA ATCTTCGCAA CCCATCCGCC AGCGCCTGCT GGACCTTTCC 
CATGCCCAGA AGCGGCTCAT CCAGGTCGGC GTCGACCTCC TGCTCGTCTG GCTGGCGCTC
TGGCTGGCGT TCTACATCCG CCTCGAAGAC ATGACCCCGA TCGAACCCCT CGGCGACCAC
GCCTGGCTGT TCACCGCCGC CCCGGCCGTC GCCCTGCCGA TCTTCGTGCG CCTGGGCATG
TACCGGGCGG TCATGCGTTA CCTGGGCAAC GAGGCTTTGC TGAGCATCGC CCGGGCGGTC
ACCCTGTCGG CGCTGCTGCT GGCCCTGGTC ATCCTCCTCC ACGGCAAGAC GAGTCCGCCG
ATCCCGCGCT CGGTGATCTT CAACTACTGG GGGCTGAGCC TGCTGTCGAT CGGCGGCCTG
CGCATCGCCA TGCGCCAGTA TTTCACCGGC GACTGGTTCA ACCTGCGCGA GCTGTCCTTC
CGGCGCCCGG AGAACGGCCC GCCACGGGTG GCCATCTACG GCGCGGGCTC GGCCGGCAAC
CAGCTCGCCC ACGCCCTGCT CATGGGCCAC GCGCTGCGTC CGGTGGCCTT CATCGACGAC
GATGCGCACC TGGCCGGCCG CACGATCGCC GGCCTGCCGG TCTACGCCCC CGACCGGCTG
GAGCGGATGC TCGGGGAAAC CGGCGCCACG GAGATCGTCC TGGCGATCCC CTCGGCCAGC
CGCAGCCGCC GGCGCAGGAT CCTCGAGATG CTGCAGGCCC ACCCGCTGCC GGTGCGCTCG
ATGCCCAGCA TCGCCAAGCT GGCCTGCGGC CGCCTCACGG TGAACGACCT GCAGGAAGTG
GACATCGCCG ACCTGCTCGG CCGCGATGCG CTGCCCCTGC GGCCCGAATT GCAGGAACGC
TGCATCCGCG GCCAGGTGGT GATGGTGACC GGTGCCGGCG GCTCGATCGG CGCGGAACTG
TGCCGGCAAA TCCTCGCCAA CGGACCGGCC ACGCTGATCC TCTTCGAGCA CTCGGAGTAC
AACCTGTACA GCATCCACGG CGAACTGGAA CGGAGAATCC ACCGGGAAGC CCCGGGCCTG
CGCCTGGTCC CCGTCCTCGG CTCGGTCCGC CACGGCAGCC GCCTGCTCGA CACGCTGCGC
CACTGGCGGG TCGACACGCT CTACCACGCC GCGGCCTACA AGCACGTGCC GCTGGTCGAA
GTGAACATCG GCGAAGGCGT GCTCAACAAC ACCTTCGGCA CCCTGCGCGC CGCCCAGGCA
GCGATACGCG CCGGAGTACG CAGCTTCGTG CTGATCTCGA CCGACAAGGC GGTACGCCCC
ACCAATGTGA TGGGCGGCAG CAAGCGCCTG GCCGAGATGG TGCTGCAGGC CTTCTCCGGG
GAGCGCGAGG TGGAGCTGTT CGATGAGCCC GGCCTGTCGC CGCAACCCAA CCGCACCCGT
TTCACCATGG TCCGCTTCGG CAACGTGCTG GGTTCGTCCG GCTCGGTGAT CCCGCTGTTC
CGCGAACAGA TCCGCAACGG CGGGCCGATC ACCGTCACCC ACCCGGAGAT CACCCGCTAC
TTCATGACCA TCCCCGAGGC GGCGCAACTG GTGATCCAGG CCGGCGCCAT GGGCGAAGGC
GGCGACGTCT TCGTGCTGGA CATGGGCGAG CCGGTGAAGA TCCTCGACCT GGCGGAGAAG
ATGGTGCGCC TGTCCGGCCT GTCGCTGCGC ACCAAGACCA GCCCGGACGG CGACATCGAG
ATCCGCTTCG TCGGCCTGCG TCCCGGCGAG AAGCTCTACG AGGAACTGCT GATCGGCGAC
GGCGCGACGC CGACCAGCCA CAGCCGGATC ATGCGCGCCC ACGAGGAGCA CCTGCCCTGG
CGGGAGCTGA AGCCGCGCCT CGAAGCCTTG GCCGGGGCGC TCGAAGCCGA CGATTTCCCG
CGCATCCGCG AGTTGCTGCA ACGCACCGTC AGCGGCTACC GGCCGAGCAG GGAAATCCTC
GGCCGGCCCC TGCAGGAACA TCCGGCCAGT TGTCAGCCCG TTCCGCCTGG ATGCACCTGG
TGGTTCCACA GGATGCCAGG CCGAACGGCG AAAACCCCTG ACGGGGCCTG TCCTGCCGTA
ATGGCCGCTG CGAGTCGACC CGGCCTCGAC AAAACTCTCC CGCGAATCAC GGACGAGGCG
ACATGGAGAC GACTCGATCT GCCCTGCCCT CGTTCGGCGG ACCGCTCTTT TCTCCTGGCC
GAAACCCTCG CCCGCCAGGC CGAGACCGAT CCAGCCGCCA CAAACATATT GACCGGAAAA
GAGCGACAGA ACAGATGA
 
Protein sequence
MKSPHKESSQ PIRQRLLDLS HAQKRLIQVG VDLLLVWLAL WLAFYIRLED MTPIEPLGDH 
AWLFTAAPAV ALPIFVRLGM YRAVMRYLGN EALLSIARAV TLSALLLALV ILLHGKTSPP
IPRSVIFNYW GLSLLSIGGL RIAMRQYFTG DWFNLRELSF RRPENGPPRV AIYGAGSAGN
QLAHALLMGH ALRPVAFIDD DAHLAGRTIA GLPVYAPDRL ERMLGETGAT EIVLAIPSAS
RSRRRRILEM LQAHPLPVRS MPSIAKLACG RLTVNDLQEV DIADLLGRDA LPLRPELQER
CIRGQVVMVT GAGGSIGAEL CRQILANGPA TLILFEHSEY NLYSIHGELE RRIHREAPGL
RLVPVLGSVR HGSRLLDTLR HWRVDTLYHA AAYKHVPLVE VNIGEGVLNN TFGTLRAAQA
AIRAGVRSFV LISTDKAVRP TNVMGGSKRL AEMVLQAFSG EREVELFDEP GLSPQPNRTR
FTMVRFGNVL GSSGSVIPLF REQIRNGGPI TVTHPEITRY FMTIPEAAQL VIQAGAMGEG
GDVFVLDMGE PVKILDLAEK MVRLSGLSLR TKTSPDGDIE IRFVGLRPGE KLYEELLIGD
GATPTSHSRI MRAHEEHLPW RELKPRLEAL AGALEADDFP RIRELLQRTV SGYRPSREIL
GRPLQEHPAS CQPVPPGCTW WFHRMPGRTA KTPDGACPAV MAAASRPGLD KTLPRITDEA
TWRRLDLPCP RSADRSFLLA ETLARQAETD PAATNILTGK ERQNR