Gene Avin_42390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_42390 
SymbolotsA 
ID7763115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4266209 
End bp4267612 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content66% 
IMG OID643807090 
Productalpha,alpha-trehalose-phosphate synthase (UDP-forming) 
Protein accessionYP_002801338 
Protein GI226946265 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGTC TAGTCGTGAT TTCCAATCGG GTCGCTCCCA TCCGCGAAGG CAAGATCGCT 
GCCGGGGGAT TGGCCGTGGG CGTGTACGAT GCGTTGCGGC AGAATGGCGG CATCTGGTTC
GGCTGGAACG GCGAGGTCGG ACAGAAGCCG GAAACTGCCT GCGAGAGCGC CGGCAATATC
ACCTACGTCA CCCTGGGCCT CAGCAAGCCG GACTACAACG AATACTACCG CGGCTTCTCC
AACGCCACGC TCTGGCCGAT CTTCCACTAC CGCATCGATC TGGCGCGCTA CAGCCGCGAG
GAGTACCTGG GCTACCGGCG GGTCAACGCC ATGCTGGCGG AGAAGCTCAA GCCCCTGCTG
CGTCCCGACG ATATCCTCTG GGTCCACGAT TACCACCTGA TTCCCTTCGC CGCCGCCTGT
CGCCAACTGG GAATCGGCAA CCGCATCGGC TTCTTCCTGC ACATTCCCTT CCCGACGACG
GAGGTCCTCA CCGCGGTGCC CCCGCACAGG GACCTGTTCC AGACCCTCTG CGACTACGAC
CTGGTCGGCT TCCAGACCGA GAGCGACCGC ATGGCCTTTC AGGACTACGT CTGCCGCGAA
CTCGACGGGC TCATCGGCAC CGACGGCAGC CTGACCGCCT GCGGGCGGAA CTTCCGCGCC
GGCGTCTATC CCATCGGCGT CATGCCCGAC GATATCCGCC GTTTGGCCGA CTCCTACCGG
GGCCGTCGCC CCATGATCGG GCGGACCGCC GAAGGCAAGC TGCGCAAGAC CCTCATCAGC
GTCGACCGGC TGGACTACTC CAAGGGGCTG GTGGAACGCT TCCTGGCCTA CGAGCAGTTC
CTCGAACATT ACCCCGAGCA CCGGCGCAAC GTGGAACTCA TCCAGATCGC GCCGACCTCG
CGCACCGACG TGAAGACCTA CCGGGCCATC CGCAAGCAAC TGGAAACCGT GGCCGGGCAT
GTCAACGGAC GTCTGGCCGA TCTCGACTGG ATGCCGCTGC ACTATCTCAA CAAGAGCCTC
GAGCGGCGCA CCCTGATGGG CCTGTTCCGC ACCGCCAACG TCGGCCTGGT CACTCCGCTG
CGCGACGGCA TGAACCTGGT GGCCAAGGAA TACGTCGCCG CGCAGAATCC GGCCGATCCG
GGCGTGCTGG TGCTGTCGCG CTTCGCCGGC GCGGCCCACG AGCTGGGCGC GGCGCTGATC
GTCAACCCCT ACGACTGCCT GGGCATGGCC GAGGCCATGG ACCGCGCCCT GCGCATGCCG
CTGGAGGAGC GCAAGGAACG TTACGAGGAC ATGATGCGGG CCTTGCGTGC CGCCGACCTG
AACGCCTGGC GGGACAACTT TCTGCGCGAC CTGCGAACCT TCGGCCGGCA CCGGGTGGTG
ACGGCGAGCC GTGCCGCCGT CTGA
 
Protein sequence
MSRLVVISNR VAPIREGKIA AGGLAVGVYD ALRQNGGIWF GWNGEVGQKP ETACESAGNI 
TYVTLGLSKP DYNEYYRGFS NATLWPIFHY RIDLARYSRE EYLGYRRVNA MLAEKLKPLL
RPDDILWVHD YHLIPFAAAC RQLGIGNRIG FFLHIPFPTT EVLTAVPPHR DLFQTLCDYD
LVGFQTESDR MAFQDYVCRE LDGLIGTDGS LTACGRNFRA GVYPIGVMPD DIRRLADSYR
GRRPMIGRTA EGKLRKTLIS VDRLDYSKGL VERFLAYEQF LEHYPEHRRN VELIQIAPTS
RTDVKTYRAI RKQLETVAGH VNGRLADLDW MPLHYLNKSL ERRTLMGLFR TANVGLVTPL
RDGMNLVAKE YVAAQNPADP GVLVLSRFAG AAHELGAALI VNPYDCLGMA EAMDRALRMP
LEERKERYED MMRALRAADL NAWRDNFLRD LRTFGRHRVV TASRAAV