Gene Ava_2639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2639 
Symbol 
ID3681941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3267459 
End bp3268529 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content44% 
IMG OID637717985 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_323148 
Protein GI75908852 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.997652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACAAA ATGATACAGA ACTATTGACA ACATCTGGTG CGGCTGTACC TATTCTGATT 
ACGGGTGGTG CAGGCTTTAT TGGCTCCAAT TTCGTCCATC ATTGGTATGA ACAGTACCCA
GGCGATCGCA TAATTGTTTT GGATGCGCTC ACCTATGCAG GGAATCGCCA AAATTTAGCA
GATATAGAAG GAAAAGCAAA TTTAAGATTT GTCAAGGGAG ATATAGGTGA TCGCGCTCTC
ATTGATCAGC TACTAGAGGA AGAAAAGATT CAGGCGATCG CCCACTTTGC AGCTGAATCT
CACGTTGATC GCTCAATTGT CGCGCCAGAT GCTTTCATTC AGACCAATGT TGTAGGTACA
TTTACTTTAT TAGAAGCCTT TCGCCATCAC TGGACAAAAC AAGGCAAACC TGCTAACTAC
CGCTTTCTCC ACGTCTCTAC AGATGAAGTT TACGGCAGCC TTGAACTAGA TGATCCAGCT
TTTACAGAAA CAACTCCTTA CGCCCCCAAC AGTCCCTATT CCGCCTCTAA AGCAGGTAGT
GATCATCTAG CACGAGCTTA TTACCACACC TACGGTTTAC CAACCTTAAT TACAAATTGC
TCCAATAACT ACGGCCCCTA TCACTTCCCC GAAAAATTAA TTCCCCTAAT ATGCCTCAAT
ATTCTCTTAG GTAAACCTCT ACCTATCTAT GGAGATGGGT TAAATATCCG TGATTGGTTA
TATGTTGAAG ACCATTGTCG TGCTTTAGAT ATTGTCATTC ATCAGGGTAA ACCAGGAGAA
ACCTACAACA TTGGCGGTAA TAACGAAATC AAAAACATTG ACCTTGTTCA GATGATCTGT
GAGTTAATGG ACGAATTAGC CCCTGATTTA CCCGTCTCTC CCGCCAGTAA ACTCATTACC
TTCGTCAAAG ACCGCCCCGG ACACGATCGC CGTTATGCGA TCGATGCGAC AAAAATCAAA
ACAGAATTAG GTTGGGAACC CCAACAAACA ATCTCGACTG GATTACGCCA CACCATCCAG
TGGTATCTAA CTCATCGCCA TTGGTGGGAA GCACTTTTAC CAAAGGAGTA G
 
Protein sequence
MIQNDTELLT TSGAAVPILI TGGAGFIGSN FVHHWYEQYP GDRIIVLDAL TYAGNRQNLA 
DIEGKANLRF VKGDIGDRAL IDQLLEEEKI QAIAHFAAES HVDRSIVAPD AFIQTNVVGT
FTLLEAFRHH WTKQGKPANY RFLHVSTDEV YGSLELDDPA FTETTPYAPN SPYSASKAGS
DHLARAYYHT YGLPTLITNC SNNYGPYHFP EKLIPLICLN ILLGKPLPIY GDGLNIRDWL
YVEDHCRALD IVIHQGKPGE TYNIGGNNEI KNIDLVQMIC ELMDELAPDL PVSPASKLIT
FVKDRPGHDR RYAIDATKIK TELGWEPQQT ISTGLRHTIQ WYLTHRHWWE ALLPKE