Gene Aazo_4126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4126 
Symbol 
ID9341931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4194873 
End bp4196789 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content46% 
IMG OID 
Productacetolactate synthase large subunit 
Protein accessionYP_003722686 
Protein GI298492509 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGTTCAC AAAGTGTCTC CGCAGGAGAA TCGCCTTCTC AAATCAGTCT CCCACAATCT 
GAAAATAACA AAAAGTCTCC TATCTCTAAC TCGCCAATTG TCGCTCCTAC AAGGGCTTCT
GGCGGTTTTG CCTTGTTAGA TAGTCTCCTC CGTCACGGTG TTGAGTACAT TTTTGGTTAT
CCCGGTGGGG CAATTCTACC GATTTATGAT GACCTGTATA AAGTGGAGGC AAGCGGTATA
ATTAAGCATA TTCTTGTCAG ACATGAACAA GGTGCTTCCC ATGCGGCTGA CGGCTACGCC
CGCGCTACAG GTAAAGTAGG AGTGTGTTTT GGGACTTCTG GTCCTGGGGC AACTAACTTG
GTGACAGGTA TTGCCACAGC TTACATGGAT TCCATTCCGA TGATTGTGGT GACAGGACAA
GTACCACGAG CGGCAATTGG TACAGATGCT TTCCAAGAAA CGGATATCTA CGGTATTACT
CTGCCCATTG TCAAGCATTC TTATGTAGTC CGTGACCCCA AAGATATGGC GCGGATTGTG
GCTGAAGCCT TCCATATTGC CAATACAGGC AGACCAGGAC CAGTTTTGAT AGATGTTCCC
AAAGATGTGG CTTTGGAAGA ATTTGATTAT GTGCCTGTAG AACCTGGTTC AGTCAAGTTA
CCTGGATATC GTCCTACAGT TAAAGGTAAT CCCCGACAAA TTAATGCGGC GATTCAGTTG
ATTACGGAAA GTGGTAGACC CTTATTATAT GTTGGTGGGG GTGTGATCGC AGCGAATGCC
CACGCAGAAG TTAAACGTCT GGCAGAATTA TTTAATATCC CCGTCACCAC AACCCTCATG
GGTATCGGTG CATTTGATGA ACATCATCCC CTATCTTTAG GAATGTTGGG GATGCACGGT
ACTGCTTACG CTAATTTTGC GGTTACAGAT TGTGATTTGC TGATTTGTGT TGGTGCAAGA
TTTGATGACC GTGTAACAGG AAGATTAGGT GAATTTGCTT CCCGTGCTAA AGTCATTCAC
ATCGACATTG ACCCGGCAGA AGTTGGTAAA AACCGCGTTC CTGAAGTTCC TATCGTTGGC
GATGTCAAGA GTGTTCTAAC TGATTTACTC CGGCGATGTC AAGACGCAAC GGGAAAAACT
ACACCTAATC AAAATCAAGA ATGGTTAAAT CTAATTAACC GTTGGAAACA AGATTACCCC
TTGGTTGTGC CTCATCATGC TGACAGCATT TCTCCCCAAG AGGTAATTGT GGCAGTTGGG
AGTCAAGCAC CCAATGCTTT TTATACCACC GATGTTGGTC AACATCAAAT GTGGGCAGCA
CAATTCCTCA AAAATGGACC TAGACGCTGG ATTTCTAGCG CCGGTTTAGG AACAATGGGT
TTTGGTGTCC CTGCGGCTAT GGGTGCTAAA GTGGGCTTCC CTGATGAAGA AGTGATCTGT
ATTAGCGGTG ATGCCAGTTT CCAAATGTGC TTACAGGAAC TGGGAACTAT AGCACAGTAT
GGGATAAATA TCAAGACTGT AATTTTAAAT AACGGTTGGC AGGGAATGGT GCGTCAATGG
CAAGAAGCCT TTTATGGTGA ACGTTATTCC TGCTCAAATA TGGAAGTAGG GATGCCAGAT
ATTGAGCTGT TAGCACAGGC TTATGGGATC AAAGGGATGG TGATTAGCAG CCGGGAAGAA
TTGGCAGATA AAATTGCCGA AATGCTGGCA CACAATGGAC CGGTGATTGT CGATGTTCAT
GTTACCAGAG ATGAAAACTG CTATCCGATG GTAGCCCCTG GCAAGAGTAA CGCGCAGATG
TTTGGTTTGC CAAAACCTCC ACCCACAAAT ACAGATGAGC CAGTTGCTTG CAGTCATTGT
GGGACAAAAA ACTCGCCTAA CCATAACTTC TGTTCTGAGT GCGGCACTAA GTTGTAA
 
Protein sequence
MRSQSVSAGE SPSQISLPQS ENNKKSPISN SPIVAPTRAS GGFALLDSLL RHGVEYIFGY 
PGGAILPIYD DLYKVEASGI IKHILVRHEQ GASHAADGYA RATGKVGVCF GTSGPGATNL
VTGIATAYMD SIPMIVVTGQ VPRAAIGTDA FQETDIYGIT LPIVKHSYVV RDPKDMARIV
AEAFHIANTG RPGPVLIDVP KDVALEEFDY VPVEPGSVKL PGYRPTVKGN PRQINAAIQL
ITESGRPLLY VGGGVIAANA HAEVKRLAEL FNIPVTTTLM GIGAFDEHHP LSLGMLGMHG
TAYANFAVTD CDLLICVGAR FDDRVTGRLG EFASRAKVIH IDIDPAEVGK NRVPEVPIVG
DVKSVLTDLL RRCQDATGKT TPNQNQEWLN LINRWKQDYP LVVPHHADSI SPQEVIVAVG
SQAPNAFYTT DVGQHQMWAA QFLKNGPRRW ISSAGLGTMG FGVPAAMGAK VGFPDEEVIC
ISGDASFQMC LQELGTIAQY GINIKTVILN NGWQGMVRQW QEAFYGERYS CSNMEVGMPD
IELLAQAYGI KGMVISSREE LADKIAEMLA HNGPVIVDVH VTRDENCYPM VAPGKSNAQM
FGLPKPPPTN TDEPVACSHC GTKNSPNHNF CSECGTKL