Gene Avin_20590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_20590 
Symboldxs-2 
ID7760985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2048876 
End bp2050777 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content71% 
IMG OID643804956 
Product1-deoxy-D-xylulose-5-phosphate synthase 
Protein accessionYP_002799237 
Protein GI226944164 
COG category[H] Coenzyme transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1154] Deoxyxylulose-5-phosphate synthase 
TIGRFAM ID[TIGR00204] 1-deoxy-D-xylulose-5-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAAGA CGTTCCACGA GATTCCCCGC GAGCGCCCCG CCACCCCCCT GCTGGACCGC 
GCCGCGACGC CGGACCGGCT GCGCCAGCTC GGCGAGGCGG AACTCGAAGT ACTGGCCGAC
GAACTGCGCC AGGAACTGCT CTACACCGTC GGGCAGACCG GCGGGCACTT CGGCGCCGGG
CTCGGCGTGA TCGAGCTGAC CATCGCCCTG CACTACGTCT TCGACACCCC CGGGGACCGC
CTGGTGTGGG ACGTCGGCCA CCAGGCCTAC CCGCACAAGA TTCTCACCGG CCGCCGCGAA
CGCATGCTCA CGTTGCGCCA GAAGGACGGC ATCGCCGCCT TCCCGCGCCG CTCGGAAAGC
CCCTACGACA CCTTCGGCGT CGGCCACTCC AGCACCTCCA TCGGCGCCGC GCTGGGCATG
GCCATCGCCG CGCGGCTGAA GGGCGAGCGG CGCCGCTGCA TCGCCGTGAT CGGCGACGGC
GCGCTGACCG CCGGGATGGC CTTCGAGGCG CTCAACCACG CCTCGGACGT GCAGGCCGAC
ATGCTGGTGG TGCTCAACGA CAACGACATG TCGATCTCCA AGAACGTCGG CGGGCTGTCC
AACTACCTGG CCAAGATCCT CTCCAGCCGC ACCTACGCGA GCATGCGCGA GGGCAGCAAG
AACATCCTCT CGCGCCTGCC CGGCGCCTGG GAGATCGCCC GGCGCACCGA GGAATACGCC
AAGGGCATGC TGGTGCCCGG CACCCTGTTC GAGGAGCTCG GCTGGAACTA CATCGGCCCG
ATCGACGGCC ACGACCTGCC GACCCTGCTC GCCACCCTGC GCAACATGCG CGACCTCAAG
GGGCCGCAGC TCCTCCACGT GGTGACCAAG AAGGGCAAGG GCTTCGCCCC GGCCGAGGCC
GATCCCATCG GCTACCACGC GATCACCAAG CTGGAGCCCG AGGGCAGCGC CCCGCGCCAG
CCCGGCCCTC CCCGATACTC CAGCGTCTTC GGCCGCTGGC TGTGCGACAT GGCCGCCGCC
GATCCGCGCC TGGCCGGCAT CACCCCGGCG ATGAAGGAGG GCTCCGACCT GGTCGCCTTC
AGCCAGCGCT TCCCGGAGCG CTACTTCGAC GTGGCCATCG CCGAGCAGCA CGCCGTGACC
CTCGCCGCCG GCATGGCCTG CGACGGCCTC AAGCCGGTGG TGGCGATCTA CTCGACCTTC
CTGCAGCGCG CCTACGACCA GTTGATCCAC GACGTCGCGG TGCAGAACCT CGACGTGCTG
TTCGCCATCG ACCGCGCCGG CCTGGTCGGC GAGGACGGCC CGACCCACGC CGGCAGCTTC
GACCTCTCCT ACCTGCGCTG CATCCCCGGC ATGCTGGTGA TGACCCCCAG CGACGAGAAC
GAGCTGCGCC GGCTGCTCAC CACCGGCTAC CTGTTCGAAG GCCCGGCGGC GGTGCGCTAC
CCGCGCGGCA GCGGACCGAA CGCGGCCCTC GAGCCGGGCC TGGAGCCCTT GCCGATCGGC
AAGGGCGTGC TGCGCCGCCG GAGCGGGAAG AGCGACGGCC CGCGGGTCGC CCTGCTGGTG
TTCGGCGTGC AGGTGGCCGA GGCGCTGCGG GTGGCCGGGA AGCTGGACGC CACGGTGGCC
GACATGCGCT TCGTCAAGCC GCTGGACGAG GAACTGGTAC GCGAACTGGC CGCCGGCCAC
GAGCTGCTGG TGACCGTCGA GGAGAACAGC ATCATGGGCG GCGCCGGCAG CGCGGTGGCG
GAATACCTCG CCGAGGCCGG CCTCCTCAGG CCGCTCCTCC ACCTGGGCCT GCCGGACTAT
TACGTGGAAC ACGCCCGGCC CGCGGAAATG CTCGCCGAAT GCGGCCTGGA CGCCGCTGGC
ATCGAGGCGG CGGTGTGCGA ACGGCTGAAT ATGCTGAGCT GA
 
Protein sequence
MPKTFHEIPR ERPATPLLDR AATPDRLRQL GEAELEVLAD ELRQELLYTV GQTGGHFGAG 
LGVIELTIAL HYVFDTPGDR LVWDVGHQAY PHKILTGRRE RMLTLRQKDG IAAFPRRSES
PYDTFGVGHS STSIGAALGM AIAARLKGER RRCIAVIGDG ALTAGMAFEA LNHASDVQAD
MLVVLNDNDM SISKNVGGLS NYLAKILSSR TYASMREGSK NILSRLPGAW EIARRTEEYA
KGMLVPGTLF EELGWNYIGP IDGHDLPTLL ATLRNMRDLK GPQLLHVVTK KGKGFAPAEA
DPIGYHAITK LEPEGSAPRQ PGPPRYSSVF GRWLCDMAAA DPRLAGITPA MKEGSDLVAF
SQRFPERYFD VAIAEQHAVT LAAGMACDGL KPVVAIYSTF LQRAYDQLIH DVAVQNLDVL
FAIDRAGLVG EDGPTHAGSF DLSYLRCIPG MLVMTPSDEN ELRRLLTTGY LFEGPAAVRY
PRGSGPNAAL EPGLEPLPIG KGVLRRRSGK SDGPRVALLV FGVQVAEALR VAGKLDATVA
DMRFVKPLDE ELVRELAAGH ELLVTVEENS IMGGAGSAVA EYLAEAGLLR PLLHLGLPDY
YVEHARPAEM LAECGLDAAG IEAAVCERLN MLS