Gene Aazo_1698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1698 
Symbol 
ID9339491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1762135 
End bp1765161 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content41% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003720972 
Protein GI298490795 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.967311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAT TATTACTAGA TAGCCAGCAC ACCTTAGTAC AATGGGTAAG CCAAGCAACA 
GGAATCAACA CTTTCGGGGT GAAAGTCCGG TTGTTGGGAA ATGACCTACA TATCCTTTGT
GAAGATACCG ACTGTCCACA ACGTTGGCTA ACTCTGTCTG ACTTGCTACA AGCACTACAG
CAAACAGATT TGGATAGCTT AACCAGTGAA GAACAATCCC CAATATACCA AGTATTGGTT
TACGGGCGAA AAAAAGGAGA ACATCGTCCT CAATGGTGTC ATCGGGTCTA CTTGAATCAG
TTAGATCGGC ATCTAGAGCA ACTACAGCAG GCACTGTTAA CAGAAGCAGC CAAATCTAAA
CCCATTGGGG TAAGTGGAGC TCTGATTGTC TCTAATGAAA GTTTGGCACG TCAGGGCAAC
CCGGATGCTA TTGCCCGCTA TCTTAGCGAA AACCTCAGTC AATTAGGTGT GGCAGTCCAA
GTTAAGAATC GACCCTATGA TTCCAAAGCT AACTCTCAAA AAATAGGTAA TCGTCTGTGG
ATTTTTTGTC AGTCCAGTTA TAGCCCTGAT GCGACCTTAT TAGCAGAACC AATAGCCCAA
CAGTTGAGAC ATCTCAAGCT TTTTGGCTAT GAGGATGCAG TCATTGTTTC TCAAGTCCGG
GGTGAGCATG ATCCTGATTG GCGGTTGCGC GTGGATTTGA CACCACCAGA AACAATGCTC
AAAGAATGGG CGCGTTGGGG AGATGTGCAA GCGATCACTC TATTATTAAC TGAGGTCTTG
TCTGATTTTC AAATTTCCGT CCAAGCTACT CTTAAAGAAT TTACTCTGCA CATTTTTTGC
ACCCCAGCAG TTGACTTGTT AAAAACTGGT CCTGATCCTG ACAAGGAAAC TTGTTTACCA
GTTATTCTGT CTCAGCTAGA AGCGATCGCA CCTCAAGGTA TTCTCGCCGC TGCTATCTAT
GGACAAAAAA CAGGTGATTT ACAACCCGCT TGGGTAGATT GGGCATCTCT ACCTGCTTCA
ATACATCCGG CTTTAGCAAC ATCGACACTA GCATTAGGTA GTTCTGGTGA CGAACCAGCC
CTAATTTTCT TACTAGAACG TTTGCTCAAT CATGACATGA ATTGGCGGTT AAAAACAGGT
GGTATTCGCG TTTTACTACT TCGCAAAGGT GATTTACTCC AGGTCATGTG TGATGCACCC
AATTGTCCCA CACGCCAGCA AGTAGCCACC AAAGTCAGCG AATTTATTCG CCAACTTAAC
ATTATTGGTA TTGTTGGTGT GAGAGTTTAC GGTCGTCGTG CCGGTGATAA TGAACCTGTT
TGGCATCACG GACTCGATTT TGAACAACGC CACCGCTTAG TGCCAGAAGC AACACCAGAA
TTTGCCGCTA CTGCTCAATA TGTTGATGAT TTGCTCACCA ACGAAACCGA CGAACCAATT
GTGCGCCCTG ATTTAACAAC CCAAGAAGTT CAAAGTTTCG TGACTGAAGC TGCACAACAT
TGGGTAGAAA GGGCTCACAA ACAAGTCAAA AATTTGTTAG TGGGAACTCA ACTGTTTGCA
GAAACTAACC AATCAATTGA ACAAAACCCT AATGAACAAG GAGTAGCGGG AAGTTTAATT
TGGGTAGCTT TGGGATTAAT GCTCACACTC CAAAGTGATT GGATGTTGGG TTACGTCATT
ACTAGTATTC AGAATTCACC CAAAGGTGCT AGTATTTTAT CCCGATCATC CTCACAGACC
CAGGTATCTT TGACATTAGG AGCAGAACCG AAAAGCACCG CATTTTTCAC CAGTAGTAAC
AATATAAAGT CTTCTGAGTC CAGTAATTCT GCCTTTAATG CTTCTGGTTT TACCCAAAAT
GATAGGGCTG GAAATTTACC AGATGCACCA TTGAAACAAA AAGCTAATTC CACTGCTATT
CTTTTAGCAG CACGCTCTCA AATGCCAAGC TTTGGAGCAA GGCAACTAGA CGAACAATTA
GCACTTTATA AAAAACGTTT AGCAACAACA GGCAAACCAC CAGATATTTT AATTATTGGT
TCATCTCGTG CCTTGCGAGG AGTTGATCCT GTTGCCCTTT GTAAATCCTT GTCAACTCAA
GGTTATGGCG AAATTGATGT CTTTAACTTT GGTATTAATG GTGCTACAGC CCAAGTTGTA
GACTTGATTA TTCGCCAAGT TTTACAACCA TCAGAATTAC CAAGAATTAT TATTTGGGCA
GATGGTTCTA GGGCTTTTAA CAGTGGTCGT GAAGATATCA CCTTTAACTC TATTGCCGCA
TCAAAAGGCT ATCAAGAACT TTTAAAAAGA GCTACAGAAC CAGAAAATAA CAATAAATTA
TCCCAAGCAA CAACAAACAA TAAAACAGAA GATAAAAAAC TCACCAACAA AACACCAGAT
ATTAATAGCT ACGAAGCTGC TAATGAATGG TTAAATCAAA TTTTAGTCGG TAGATCTGCT
ACCTACAGAA ACCGTGATCA CATTAAAACT CTATTGCAAA AACAATTACA CTATCTTCCA
TTTAGTAAAG ACTTTCAGCC AGTTAAATCA AACAACAAAT TAAGAAACGA TCCACAAGAA
GATAATACTC AATTATCAGT TGATTTTGAT GGCTTTTTAC CCCTAACTAT TCGTTTCAAT
CCGACCACAT ATTATCAAAA ACATCCCAAA GTATCTGGAG GTTACGACAA CGATTATAAA
TCTTTTCAAT TAATAGGAAA CCAAGACGTT GCTTTCAGAT CAGTAATTGA GTTTACAAAG
ACTCAACAAA TACCAATCGT GTTTGTGAAT ATGCCCCTCA CCACAGATTA TCTAGATCCA
GCGCGGACGA AATATGAACA GCAATTTCAA CAATATATGT TAAATTTTTC CACTACTAAT
GCCAAGTTTA TCTACCGAGA CTTAAGCCAA ATCTGGCTCA AAGGACATGA ATACTTTTCT
GATCCTAGTC ATCTTAACCG CTATGGAGCT TATGAAATCT CCAAAAAGCT GGCTATTGAT
CCCATGATTC CCTGGACTCT TAAATAA
 
Protein sequence
MKTLLLDSQH TLVQWVSQAT GINTFGVKVR LLGNDLHILC EDTDCPQRWL TLSDLLQALQ 
QTDLDSLTSE EQSPIYQVLV YGRKKGEHRP QWCHRVYLNQ LDRHLEQLQQ ALLTEAAKSK
PIGVSGALIV SNESLARQGN PDAIARYLSE NLSQLGVAVQ VKNRPYDSKA NSQKIGNRLW
IFCQSSYSPD ATLLAEPIAQ QLRHLKLFGY EDAVIVSQVR GEHDPDWRLR VDLTPPETML
KEWARWGDVQ AITLLLTEVL SDFQISVQAT LKEFTLHIFC TPAVDLLKTG PDPDKETCLP
VILSQLEAIA PQGILAAAIY GQKTGDLQPA WVDWASLPAS IHPALATSTL ALGSSGDEPA
LIFLLERLLN HDMNWRLKTG GIRVLLLRKG DLLQVMCDAP NCPTRQQVAT KVSEFIRQLN
IIGIVGVRVY GRRAGDNEPV WHHGLDFEQR HRLVPEATPE FAATAQYVDD LLTNETDEPI
VRPDLTTQEV QSFVTEAAQH WVERAHKQVK NLLVGTQLFA ETNQSIEQNP NEQGVAGSLI
WVALGLMLTL QSDWMLGYVI TSIQNSPKGA SILSRSSSQT QVSLTLGAEP KSTAFFTSSN
NIKSSESSNS AFNASGFTQN DRAGNLPDAP LKQKANSTAI LLAARSQMPS FGARQLDEQL
ALYKKRLATT GKPPDILIIG SSRALRGVDP VALCKSLSTQ GYGEIDVFNF GINGATAQVV
DLIIRQVLQP SELPRIIIWA DGSRAFNSGR EDITFNSIAA SKGYQELLKR ATEPENNNKL
SQATTNNKTE DKKLTNKTPD INSYEAANEW LNQILVGRSA TYRNRDHIKT LLQKQLHYLP
FSKDFQPVKS NNKLRNDPQE DNTQLSVDFD GFLPLTIRFN PTTYYQKHPK VSGGYDNDYK
SFQLIGNQDV AFRSVIEFTK TQQIPIVFVN MPLTTDYLDP ARTKYEQQFQ QYMLNFSTTN
AKFIYRDLSQ IWLKGHEYFS DPSHLNRYGA YEISKKLAID PMIPWTLK