Gene Aazo_2602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2602 
Symbol 
ID9340401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2693019 
End bp2694407 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content37% 
IMG OID 
Producthistidyl-tRNA synthetase 
Protein accessionYP_003721615 
Protein GI298491438 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA GTGACAAAAT CAACTTTTCT ACCCCTAGCG GGTTTCCCGA ATTTCTGCCT 
AGCGAAAAGC GTCTAGAAGT ATATTTATTA GATATAATCC GTCGTGTTTT TGAAAGCTAT
GGCTTTACAC CGATTGAAAC CCCAGCAGTA GAACGGTTAG AAGTACTGCA AGCTAAGGGC
AATCAGGGGG ATAATATAAT CTATGGTATT GATCCAATTT TACCACCAAA TCGCCAAGCA
GAAAAGGATA AATCTGGCGA AACTGCTTTG GAAGCTAGGG CTTTAAAGTT TGATCAAACT
GTTCCTTTTG CGGCTTATAT TGCACGTCAT TTAAATGAAT TAACTTTTCC TTTTTCTCGT
TATCAAATGG ATATGGTTTT TCGCGGTGAA CGGGCAAAAG ACGGGCGGTT TCGGCAGTTT
CGTCAATGTG ATATTGATGT GGTGGCTCGT GGTAAACTCA GTTTGCTATA TGATGCCCAA
ATGCCGGCAA TTATAACTGA AATATTTGAA GCTATTAATA TTGGTGATTT TGCAATTCGC
ATCAATAACC GCAAAATTCT GACAGGGTTT TTTCAGTCTG TGGAAGTTCC GGAAAACCAA
ATTAAAGCTT GTATTGCCAT TATTGATAAT TTGGAAAAAA TTGGAGAAGC AAAAGTTAAG
TCGGAGTTAG AGAAAGAAGG TATTTCTTCC GAACAAACTG AGAAAATTAT TGAGTTTATT
AAAATTGATG GTAGTGTTGA TGATGTTTTA GATAAACTCA ATCATCTCGC GCAAAGTTTT
CCCGAAGCAG AACAATTTAA CTTAGGAGTT ACTGAATTAG CAACGGTAAT TAATGGAGTT
AGAGATTTAG GAGTTGCTGA CAAACGGTTC TGTATTGATT TATCAATTGC TCGTGGTTTA
AATTACTACA CTGGCACGGT TTACGAAACT ACTTTAATAG GACATGAAGC TTTGGGTAGT
ATTTGTTCTG GAGGCAGATA TGAAGAATTA GTGGGAATGT TTTTAGGCGA AAAAATGCCT
GGAGTGGGTA TTTCTATTGG TTTAACTCGG TTAATTAGTC GGTTATTAAA AGCGGGTATT
CTCAATACTT TATCTGCAAC ACCTACCCAG GTGGTGGTAG TGAATATGCA GGAAGATTTG
ATAGCTGTTT ATTTAAAGGT ATCACAGCAA TTAAGACAAG CAGGGATTAA TGTTGTAACT
AACTTTGAAA AACGTCCTTT GGGTAAACAA TTTCAAGCAG CAGATAAACA AGGAATTCAA
TTTTGTGTTA TTATTGGTGC TGATGAAGCC GCAGCCCAAA AATCATCATT AAAGAATTTG
AAAAGTGGTG AGCAAGTGGA GGTAGCTTTG GCAGATTTAG CGGAAGAAAT TAAAAGAAGG
CTGACGTAA
 
Protein sequence
MAKSDKINFS TPSGFPEFLP SEKRLEVYLL DIIRRVFESY GFTPIETPAV ERLEVLQAKG 
NQGDNIIYGI DPILPPNRQA EKDKSGETAL EARALKFDQT VPFAAYIARH LNELTFPFSR
YQMDMVFRGE RAKDGRFRQF RQCDIDVVAR GKLSLLYDAQ MPAIITEIFE AINIGDFAIR
INNRKILTGF FQSVEVPENQ IKACIAIIDN LEKIGEAKVK SELEKEGISS EQTEKIIEFI
KIDGSVDDVL DKLNHLAQSF PEAEQFNLGV TELATVINGV RDLGVADKRF CIDLSIARGL
NYYTGTVYET TLIGHEALGS ICSGGRYEEL VGMFLGEKMP GVGISIGLTR LISRLLKAGI
LNTLSATPTQ VVVVNMQEDL IAVYLKVSQQ LRQAGINVVT NFEKRPLGKQ FQAADKQGIQ
FCVIIGADEA AAQKSSLKNL KSGEQVEVAL ADLAEEIKRR LT