Gene Ava_0433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0433 
SymbolispG 
ID3682594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp551923 
End bp553149 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content44% 
IMG OID637715762 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_320954 
Protein GI75906658 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000301466 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.208688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACTC TGCCGACACC CACAACATCC AGTAATACAG CCAACCAAAG CACATTTGAT 
ACGACAATCA AGCGTCGTAA AACCCGTCCG GTAAAAGTGG GTAATGTCAC CATCGGCGGT
GGCTACCCTG TGGTGGTGCA GTCGATGATT AACGAAGACA CTCTTGATAT CGACGGTTCC
GTAGCCGCTA TTCGGCGCTT GCACGAAATT GGCTGTGAAA TCGTCCGTGT CACAGTGCCA
AGCATAGCTC ACGCCGTAGC GTTGGCAGAA ATTAAACAAA AACTCATTAC AACTTACCAA
GATGTGCCAA TTGTTGCTGA CGTACACCAC AATGGGATGA AAATTGCCCT GGAAGTTGCC
AAACATATTG AAAAAGTACG GATAAACCCC GGCTTGTATG TGTTTGAGAA ACCCAACACC
AATAGAACTG AATATACTCA AGCCGAATTT GAAGAAATTG GCGAAAAAAT CCGCGAAACT
CTCGCACCCT TGGTAATTAC TCTGCGCGAC CAAGGTAAAG CCATGCGTAT TGGTGTCAAT
CATGGTTCTC TCGCTGAGAG AATGTTATTT ACCTACGGCG ATACTCCAGA AGGCATGGTG
GAATCAGCTT TAGAATTTAT TCGCATCTGT GAATCTCTAG ACTTCCGCAA CATAGTCATT
TCCATGAAAG CCTCACGAGT TCCCGTGATG GTAGCCGCCT ATCGCCTCAT GGCTAAACGC
ATGGATGATT TAGGCATGGA TTATCCCTTA CATTTAGGTG TCACCGAAGC TGGTGATGGT
GAATATGGAC GGATTAAATC CACAGCAGGT ATTGCCACAT TATTAGCTGA TGGTATTGGT
GATACTATTC GCGTCTCCCT AACAGAAGCA CCGGAAAAGG AAATTCCAGT CTGTTACAGC
ATTCTGCAAG CTTTAGGTTT GCGGAAAACA ATGGTGGAAT ATGTAGCTTG TCCTTCCTGC
GGTCGTACTT TATTTAACTT AGAAGAAGTA CTACATAAAG TACGTGAATC TACTAAACAC
CTCACAGGAC TAGACATTGC TGTTATGGGT TGCATTGTCA ATGGCCCAGG CGAGATGGCT
GATGCTGACT ACGGTTATGT AGGTAAAACT CCTGGTTACA TTTCTTTATA TCGTGGCAGA
GAAGAAATTA AAAAAGTTCC AGAAGATAAA GGCGTTGAGG AATTAATTAA CCTCATTAAA
GCTGATGGTC GCTGGGTAGA TCCTTAG
 
Protein sequence
MQTLPTPTTS SNTANQSTFD TTIKRRKTRP VKVGNVTIGG GYPVVVQSMI NEDTLDIDGS 
VAAIRRLHEI GCEIVRVTVP SIAHAVALAE IKQKLITTYQ DVPIVADVHH NGMKIALEVA
KHIEKVRINP GLYVFEKPNT NRTEYTQAEF EEIGEKIRET LAPLVITLRD QGKAMRIGVN
HGSLAERMLF TYGDTPEGMV ESALEFIRIC ESLDFRNIVI SMKASRVPVM VAAYRLMAKR
MDDLGMDYPL HLGVTEAGDG EYGRIKSTAG IATLLADGIG DTIRVSLTEA PEKEIPVCYS
ILQALGLRKT MVEYVACPSC GRTLFNLEEV LHKVRESTKH LTGLDIAVMG CIVNGPGEMA
DADYGYVGKT PGYISLYRGR EEIKKVPEDK GVEELINLIK ADGRWVDP