Gene Aazo_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0103 
Symbol 
ID9337887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp91467 
End bp92615 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content42% 
IMG OID 
Productclass V aminotransferase 
Protein accessionYP_003719875 
Protein GI298489698 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAATA AGCTAATGTT GATGATTCCT GGACCAACCC CAGTTCCAGA AGCTGCCTTA 
TTGGCATTGG CCAAGCACCC AATTGGACAC CGTACTGCTG AATTCAGCAA TATGATGGCG
GAGGTGACAG AAAACCTCAA ATGGCTGCAC CAAACTGAAA GTGATGTGCT AATGCTGAAT
GTTAGCGGTA CTGGTGCAGT AGAAGCGGGA ATAATTAATT TTCTTTCTCC AGACGATCAC
ATTTTAGTCG GTTCTAATGG TAAATTCGGT GAACGCTGGG TAGAAGTTGG TCAAGCGTTT
GGTTTGAATG TGGAAACTGT CACCGCAGAA TGGGGACAAC CTTTAGACCC AGCTAAGTTT
GCCGAAAAGT TGCAAGCTGA CACAAACAAG GAAATTAAAG CTGTAATTAT TACTCACAGC
GAAACTTCAA CAGGTGTAAT TAATGATTTG GTAGCTATCA ACAGCCATGT AAAAGCACAT
GGTGAAGCCT TAATTATTGT TGATGCTGTC ACCAGCTTGG GTGCATACAA TGTTGCAGTT
GATGCTTTAG GTTTGGATAT AGTCGCTTCT GGTTCCCAAA AAGGCTACAT GATACCACCC
GGTTTAGGAT TTGTGTCTGT GAGTCCTAAA GGTTGGGAAG CTTATAAAAC TGCTAAGTTG
CCAAAATATT ATTTAGATTT AGGTAAATAT CGCAAATCGA CTGCTAAAAA TACAACTCCT
TTTACTCCCC CAGTTAATTT GATTGTGGCA TTACACACCA CCTTGGGGAT GATGAAGAAA
GAGGGTTTGG AGTCAATTTT TGCTCGTCAT GAACGTCAAA AGAATGCTAC CCGGGCAGCA
ATGAAAGCTT TAAACTTACT ATTGTTTGCG GCAGATGAAT GTGCTAGTCC AGCTATTACC
GCTGTATCAG TACCGGGAAT GGAAGCAGAT AAAATTCGGT CGTTGATGAA AAAGCGGTTC
GATATTGCTT TAGCTGGTGG TCAAGACCAT TTGAGCAATA AAATTTTCCG TATTGGTCAC
TTAGGATTTG TGAGCGATCG CGATATTCTT AGCTGTATAG CATCATTGGA AGTCGTACTT
TCAGAACTGG GCTATGAAAA CTTTACCCCT GGTACTGCTA TAGGCGCAGC GGCAAAGGTT
TTCGGATAA
 
Protein sequence
MDNKLMLMIP GPTPVPEAAL LALAKHPIGH RTAEFSNMMA EVTENLKWLH QTESDVLMLN 
VSGTGAVEAG IINFLSPDDH ILVGSNGKFG ERWVEVGQAF GLNVETVTAE WGQPLDPAKF
AEKLQADTNK EIKAVIITHS ETSTGVINDL VAINSHVKAH GEALIIVDAV TSLGAYNVAV
DALGLDIVAS GSQKGYMIPP GLGFVSVSPK GWEAYKTAKL PKYYLDLGKY RKSTAKNTTP
FTPPVNLIVA LHTTLGMMKK EGLESIFARH ERQKNATRAA MKALNLLLFA ADECASPAIT
AVSVPGMEAD KIRSLMKKRF DIALAGGQDH LSNKIFRIGH LGFVSDRDIL SCIASLEVVL
SELGYENFTP GTAIGAAAKV FG