Gene Ava_0023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0023 
Symbol 
ID3678868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp20934 
End bp22625 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content46% 
IMG OID637715350 
Productdihydroxy-acid dehydratase 
Protein accessionYP_320544 
Protein GI75906248 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.393007 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAGA ATCTTAGAAG CAAGGCTATC ACACAAGGGG TGCAGCGATC GCCTAACAGA 
GCAATGCTGC GGGCTGTTGG TTTTCAGGAT GCAGATTTTA CCAAAGCCAT TGTCGGTGTT
GCCAATGGTT ACAGCACTAT TACCCCGTGT AATATGGGGA TAAATCAACT AGCACAAAGG
GCAGAAGCTG GTATAAATCG CGCTGGAGCG AAGCCACAAA TATTCGGTAC AATTACGATT
AGTGATGGGA TTTCGATGGG AACCGAAGGG ATGAAATATT CCCTGGTATC ACGAGAGGTA
ATTGCTGACT CCATTGAAAC CGTTTGTAAT GGGCAAAGTT TAGATGGGGT AATTGCCATT
GGTGGCTGTG ATAAAAATAT GCCAGGGGCA ATGATAGCGA TCGCTCGGAT GAACATCCCT
GCTATCTTTG TTTACGGTGG CACAATTAAA CCCGGACACT ACAACGGCAA AGATTTAACT
GTTGTTAGTT CTTTTGAAGC TGTCGGTGAG TACAGCGCTG GCAAAATCGA CGAAAATGAA
CTGTTAGCAG TAGAACGCAA TGCTTGTCCT GGTGCAGGTT CCTGCGGTGG GATGTACACA
GCAAATACTA TGTCCTCTGC TTTTGAAGCA CTGGGAATGA GTTTGCCCTA TTCGTCTACA
ATGGCAGCAG AAGACGACGA AAAAGCTGAT AGTACGGAAG AATCAGCCAA GGTATTAGTA
GAAGCAATTC GTCATCAGCT ATTACCCAGG CAGATTATCA CTCGTAAATC CATAGAGAAT
GCCATAGCAG TAATTATGGC GGTGGGAGGT TCCACCAATG CCGTGTTACA TTTTCTAGCG
ATCGCCCGTG CAGCTGGTGT AGAGTTAAAT CTAGACGACT TTGAAACTAT TCGTGGTCGT
GTCCCCGTTT TGTGCGACTT GAAACCAAGC GGTAGATATG TAGCTACAGA CCTGCACAAA
GCTGGTGGTA TACCCCAAGT CATGAAAATG TTACTTGTGC ATGGTTTACT CCACGGCGAC
TGTATAACCA TCACAGGTAA AACCATTGCC GAAGTTTTAG CAGATATCCC AGAAGAACCA
TCGCCTAATC AAGACGTGAT TCGTCCTTGG AATAAACCCA TGTATGCCCA AGGTCACTTG
GCTATACTCA AAGGTAATTT GGCTACAGAA GGCGCAGTCG CCAAAATTAC AGGTGTGAAA
AATCCTGTGA TTACCGGGCC AGCCAAAGTA TTTGAATCAG AAGAAGATTG TTTAGATGCA
ATTTTGGCAG GTAAGATTAA AGCCGGAGAC GTGATTGTCG TCCGTTACGA AGGCCCCAAA
GGCGGCCCTG GGATGCGAGA AATGTTAGCC CCCACCTCAG CTATTATCGG TGCAGGTTTA
GGTGATTCAG TGGGATTAAT TACCGATGGA CGCTTCTCCG GTGGTACTTA TGGGATGGTA
GTCGGACACG TTGCACCAGA AGCAGCCGTT GGTGGAGCGA TCGCACTGGT ACAAGAAGGT
GATAGCATCA CAATTGATGC CCATACCCGT TCTTTGCAGT TGAACATATC AGACGAAGAA
TTAGCCCATC GTCGTGCCAA CTGGCAACCC CGTCCCCCAC GTTACACTAA AGGCATACTC
GCAAAATACG CCAAGTTAGT AGCTTCTAGT AGTGTAGGTG CGGTCACCGA TTTAGACTTA
TTTAATGAAT AG
 
Protein sequence
MSENLRSKAI TQGVQRSPNR AMLRAVGFQD ADFTKAIVGV ANGYSTITPC NMGINQLAQR 
AEAGINRAGA KPQIFGTITI SDGISMGTEG MKYSLVSREV IADSIETVCN GQSLDGVIAI
GGCDKNMPGA MIAIARMNIP AIFVYGGTIK PGHYNGKDLT VVSSFEAVGE YSAGKIDENE
LLAVERNACP GAGSCGGMYT ANTMSSAFEA LGMSLPYSST MAAEDDEKAD STEESAKVLV
EAIRHQLLPR QIITRKSIEN AIAVIMAVGG STNAVLHFLA IARAAGVELN LDDFETIRGR
VPVLCDLKPS GRYVATDLHK AGGIPQVMKM LLVHGLLHGD CITITGKTIA EVLADIPEEP
SPNQDVIRPW NKPMYAQGHL AILKGNLATE GAVAKITGVK NPVITGPAKV FESEEDCLDA
ILAGKIKAGD VIVVRYEGPK GGPGMREMLA PTSAIIGAGL GDSVGLITDG RFSGGTYGMV
VGHVAPEAAV GGAIALVQEG DSITIDAHTR SLQLNISDEE LAHRRANWQP RPPRYTKGIL
AKYAKLVASS SVGAVTDLDL FNE