Gene Aazo_4800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4800 
Symbol 
ID9342607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4906566 
End bp4908224 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content43% 
IMG OID 
Product2-isopropylmalate synthase/homocitrate synthase family protein 
Protein accessionYP_003723093 
Protein GI298492916 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACAA CTCCCTCAAA TCAACTTTGG CTCTATGACA CTACTCTCCG GGATGGTACG 
CAACGGGAAG GACTATCAGT GTCTATAGAA GATAAGTTAC GCATTGCTCA CAGACTCGAT
GAATTAGGCA TACCCTTTAT TGAAGGTGGT TGGCCAGGAG CTAATCCTAA AGATGTTCAA
TTTTTCTGGC AACTCCAAGA AAATCCCCTC AAACAAGCAG AAATAGTTAC CTTTTGCTCA
ACTCGTCGTC CTCACTCTAC AGCAGCAGAA GAACCAATGC TGCAAGCCAT ACTGTCTGCG
GGTACTCGCT GGGTGACAAT TTTTGGTAAG TCTTGGGATT TCCACGTCAT TGAAGGACTC
AAGACCAGTT TAGAAGAAAA CTTAGCCATG ATCAGCGATA CAATTGCATA TCTCCGTTCT
CAAGGACGGC GTGTGATTTA TGATGCTGAA CATTGGTTTG ATGGCTACAA ACAAAATCCT
GACTATGCTC TCCAGACAAT AAAGGCAGCC GCAACAGTAG GGGCCGAGTG GTTAGTACTA
TGTGATACTA ATGGTGGTAC TTTACCTCAT GAAATTACGC GGATCGTTAA AAATGTTGTG
TTGGCAACTG GGGACTGGGA ACTGGGGACT GGGAATACAG AAAAAACTCT GACCCAATCT
CCACTTCCCC AAATCGGAAT TCACACCCAT AATGATTCAG AGATGGCGGT TGCTAATGCC
CTAGCAGCAG TCATGGCAGG AGCGAAGATG GTGCAGGGTA CAATCAATGG TTATGGGGAA
CGCTGTGGTA ATGCTAACCT GTGTTCTGTG ATTCCCAATT TACAGTTAAA GCTTGATTAT
AGCTGTATCG GTGAACACCA GCTAAATCAA CTTACAGAAA CTAGTCGGTT TGTCAGCGAA
GTTGTCAATC TTGCGCCTGA TGAACACGCG CCTTATGTGG GACGTTCTGC TTTTGCTCAC
AAAGGTGGTA TTCATGTATC TGCGGTAGAA CGTAATCCTT TTACTTATGA ACATATTCAG
CCGGAACAAG TGGGGAATCG TCGCCGCATT GTTATTTCTG AACAGTCTGG TTTAAGTAAT
GTCATAGCTA AAGCTCGGTC TTTGGGAATG GAATTAGATA AAAATGATCC CCAAGCCCGG
CAAATTCTTC AACGTATGAA AGAATTGGAG AGTGAAGGAT ATCAATTTGA AGCTGCAGAA
GCCAGTTTTG CTCTATTAAT GTATGAAGCT TTGGGACTGC GAGAACAGTT TTTTGAAGTT
AAAGGTTTTC AAGTCCACTG TGATTTAGTA GAGGTGAAAG AAACTACTAA TTCTTTAGCA
ACGGTGAAAG TAGGTGTAAA TGGTATAAAT ATTCTTGAAG CAGCAGAAGG TAATGGACCA
GTGGCGGCTT TAGATGATGC TTTACGTAAA GCTTTGGTGA ACTTTTATCC GCAAGTTGCC
GACTTTGAGT TGACAGATTA TAAAGTCAGA ATTCTCAACG GAAATACGGG TACAGCAGCA
AAAACCCGTG CTTTGGTAGA ATCGGGAAAT GGTCAACAAC GTTGGACAAC TGTAGGAGTT
TCTACAAATA TTTTGGAGGC TTCCTATCAA GCTGTAGTTG AAGGTTTGGA ATATGGTTTG
TTGTTGAATT TCCAAGCTGA AAAGGCTCTG AAAGTTTAG
 
Protein sequence
MTTTPSNQLW LYDTTLRDGT QREGLSVSIE DKLRIAHRLD ELGIPFIEGG WPGANPKDVQ 
FFWQLQENPL KQAEIVTFCS TRRPHSTAAE EPMLQAILSA GTRWVTIFGK SWDFHVIEGL
KTSLEENLAM ISDTIAYLRS QGRRVIYDAE HWFDGYKQNP DYALQTIKAA ATVGAEWLVL
CDTNGGTLPH EITRIVKNVV LATGDWELGT GNTEKTLTQS PLPQIGIHTH NDSEMAVANA
LAAVMAGAKM VQGTINGYGE RCGNANLCSV IPNLQLKLDY SCIGEHQLNQ LTETSRFVSE
VVNLAPDEHA PYVGRSAFAH KGGIHVSAVE RNPFTYEHIQ PEQVGNRRRI VISEQSGLSN
VIAKARSLGM ELDKNDPQAR QILQRMKELE SEGYQFEAAE ASFALLMYEA LGLREQFFEV
KGFQVHCDLV EVKETTNSLA TVKVGVNGIN ILEAAEGNGP VAALDDALRK ALVNFYPQVA
DFELTDYKVR ILNGNTGTAA KTRALVESGN GQQRWTTVGV STNILEASYQ AVVEGLEYGL
LLNFQAEKAL KV