Gene Aazo_0886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0886 
Symbol 
ID9338674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp940978 
End bp942192 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content44% 
IMG OID 
Producthistidyl-tRNA synthetase 2 
Protein accessionYP_003720416 
Protein GI298490239 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.151754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGTATC AACCAGCAGC GGGAGCTAGG GATTTACTGC CCTTAGATGT GGCTCAAAAA 
CGCTGGATTG AAGATAGGTT ACAACAAGTT TTTCATCGTT GGGGATATCA CAGGATTATC
ACCTCAACTT TGGAAAGAAT GGATACTTTA ATGGCTGGTG GAGCAATTCA ACGCCATAAG
GTAATACAGT TACAAAATGG GCAAGATGAA GAATTGGGCT TGCGTCCAGA ACTCACAGCT
TCTATTGCCA GGACAGTAGT CACTAGCATG GCAGAGGCTA CTTATCCCCA ACGGTTGTAT
TACAATGCTA ATGTGTTTCG TCGTAGTAAC TGGGAAAAGC GACATAATCG CCAGCAAGAG
TATTATCAGG CTGGAGTAGA GTTGCTAGGA TCAGGTGGGT TACTGGCAAA TGCAGAAGTG
CTGCTGTTGG TAGCCAATTG TTTAGAAGCT TTGGATTTGT GGGGATGGCA TTTAATTTTA
GGTGAAGCGG GAATTACCAA ATTTCTGCTT GATGCTTTCC CGACTCATGT CAGAAGTAAA
GTGCGGAGTG CGATCGCTCA CTTAGATCGA GTAGCCTTAG ATACCTTGCC TCTGAGTGAA
GAACTGCACG AACGTGCCAG AATTATGCTT GATTTGCGTG GTAATAGTGC AGATGTCTTG
GCAAAAATCA GCAGTTTAAA CTTAGATGCA GATCAACAAG AAGCAGTAAA TAATCTCAAA
TCTCTCGTCG AGTTACTAGA ATCAGAAGGT AAATTCCCCT TAATTCTTGA CTTGAGTTTG
ATTCAAACCA TAGACTATTA CACAGGTATA GTGTTTGAAG TAGTTAGTAA TACTGATGGT
CAGGCACAGG TACTAGGGCG CGGTGGTCGT TATGATCAGC TTCTAGGGTT ATATCATCCT
CAAGGAGAAA ACATTCCCGG CATAGGCTTT GAGTTGAGCA TTGACGATTT ATACCAACTT
CTTGCTTCTA CTCAGCAATT ACCGCAAACT ACCCCAGCGA GTAACTGGTT AGTAGTGCCA
GAAAGCAAAA ATGCTGACGC TGCAGCCTTT GCTTACGCCC AACAACTGCG AGATTCTACC
AATTTAGTGA GGGTAGAAAT GGACTTAGGG GGAAGAGATG CAGAAGCAAT TCGGAACTAT
GCAAGTCATC ACTCTATCGC CCAAATCGCC TGGATTAAAG CTGATGGTTC ACCCACAATT
GAAGCAGTCC ATTAA
 
Protein sequence
MVYQPAAGAR DLLPLDVAQK RWIEDRLQQV FHRWGYHRII TSTLERMDTL MAGGAIQRHK 
VIQLQNGQDE ELGLRPELTA SIARTVVTSM AEATYPQRLY YNANVFRRSN WEKRHNRQQE
YYQAGVELLG SGGLLANAEV LLLVANCLEA LDLWGWHLIL GEAGITKFLL DAFPTHVRSK
VRSAIAHLDR VALDTLPLSE ELHERARIML DLRGNSADVL AKISSLNLDA DQQEAVNNLK
SLVELLESEG KFPLILDLSL IQTIDYYTGI VFEVVSNTDG QAQVLGRGGR YDQLLGLYHP
QGENIPGIGF ELSIDDLYQL LASTQQLPQT TPASNWLVVP ESKNADAAAF AYAQQLRDST
NLVRVEMDLG GRDAEAIRNY ASHHSIAQIA WIKADGSPTI EAVH