Gene Aazo_5028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5028 
Symbol 
ID9342836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5146633 
End bp5148036 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content45% 
IMG OID 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_003723260 
Protein GI298493083 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.44739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGG GAACTCTGTT TGACAAAGTT TGGGACTTCC ACACCGTTGG GACACTTCCA 
TCAGGACTAA CGCAACTATT TATTGGACTT CATCTCATCC ATGAAGTTAC CAGTCCCCAA
GCCTTTGCTA TGCTCAAAGA AAGGGGTTTA AAAGTTTTAT TTCCACAACG CACAGTAGCG
ACAGTTGATC ATATCGTTCC TACAGAGAAT CAAGCCCGTC CCTTTGTGGA CAGTATGGCC
GAAGAAATGA TCCAGGCTTT AGAAAAGAGT TCTCAAGAAA ATGACATAAC TTTTTACAAT
ATTGGTTCAG GAAATCAAGG TATAGTTCAC GTCATTGCCC CGGAACTGGG ACTAACTCAA
CCGGGAATGA CCATAGCTTG TGGAGATAGC CATACATCGA GTCATGGTGC CTTTGGTGCG
ATCGCATTTG GTATTGGTAC AAGCCAAGTT CGCGATGTTC TAGCCTCCCA AACCTTAGCA
TTATCTAAAC TCAAAGTCCG CAAAATCGAA GTTAACGGCA ACTTAAAACC TGGAGTTTAC
GCCAAAGATG TAATTTTACA CATCATTCGC ACATTAGGCG TAAAAGGTGG TGTAGGCTAC
GCTTACGAAT TTGCAGGAAC AACCCTTGCA AAAATGAACA TGGAAGAACG GATGACCGTT
TGCAACATGG CCATAGAAGG TGGTGCAAGA TGCGGTTACG TCAACCCCGA TCATATTACC
TACGACTATT TAAAAAATAG AGACTTCGCC CCTAAAGATG CCAATTGGGA ACAAGCCGTT
ACTTGGTGGG AATCCCTACG GAGTGATGCC GATGCTGAAT ATGATGATGT AGTACTATTT
AATGGCGAAT ACATTCCCCC CACAATCACA TGGGGAATTA CACCAGGTCA AGGAATTGGC
GTAGATCAAA AAGTTCCCAC AGCCGAAGAA CTCTTAGAAG AAGACCGCTT TGTAGCCCAA
GAAGCATATC GCTACATGGA CTTATACCCC GGTCAACCCA TCCAAGGAAC AAAAATTGAC
GTTTGCTTCA TAGGTAGCTG CACCAACGGA CGGATTAGCG ACTTACGAGA AGCTGCTAAA
ATTGCCCAAG GTCGCAAAGT AGCAGAGCAT GTGAAAGCTT TCGTTGTTCC CGGTTCAGAG
AGAGTCAAAA AAGAAGCCGA AGCCGAAGGA CTAGATAAAA TATTTCTCGC AGCCGGTTTT
GAATGGAGAG AACCAGGATG TTCCATGTGT TTAGCCATGA ACCCCGACAA ACTCCAAGGT
AGACAAATTA GCGCCTCCTC CTCCAACCGC AACTTTAAAG GAAGACAAGG TTCTGCTTCC
GGTCGTACCC TACTCATGAG TCCCGCAATG GTAGCTACAG CCGCTATTAA GGGGGAGGTG
TCCGACGTGC GCGAATTGCT TTAA
 
Protein sequence
MSKGTLFDKV WDFHTVGTLP SGLTQLFIGL HLIHEVTSPQ AFAMLKERGL KVLFPQRTVA 
TVDHIVPTEN QARPFVDSMA EEMIQALEKS SQENDITFYN IGSGNQGIVH VIAPELGLTQ
PGMTIACGDS HTSSHGAFGA IAFGIGTSQV RDVLASQTLA LSKLKVRKIE VNGNLKPGVY
AKDVILHIIR TLGVKGGVGY AYEFAGTTLA KMNMEERMTV CNMAIEGGAR CGYVNPDHIT
YDYLKNRDFA PKDANWEQAV TWWESLRSDA DAEYDDVVLF NGEYIPPTIT WGITPGQGIG
VDQKVPTAEE LLEEDRFVAQ EAYRYMDLYP GQPIQGTKID VCFIGSCTNG RISDLREAAK
IAQGRKVAEH VKAFVVPGSE RVKKEAEAEG LDKIFLAAGF EWREPGCSMC LAMNPDKLQG
RQISASSSNR NFKGRQGSAS GRTLLMSPAM VATAAIKGEV SDVRELL