Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4102 |
Symbol | |
ID | 9341907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 4169916 |
End bp | 4172936 |
Gene Length | 3021 bp |
Protein Length | 1006 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | |
Product | valyl-tRNA synthetase |
Protein accession | YP_003722672 |
Protein GI | 298492495 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCAA CTATTCCCAA TCTCCCCAGT CTCTACGAAG CCTTCTCCAC AGAAGCCAAA TGGCAAAAAT TCTGGGAAGA AAACCAAGTC TACAAGCCAG ACCCGAATCA CAAAGGTGAA CCCTTCTGCG TCGTTATCCC ACCACCAAAC GTTACTGGCA GTTTACATAT AGGTCACGCC TTTGAAAGTG CGTTGATTGA TACCCTTGTG CGCTACCATC GGATGAAGGG ACATAATACT CTATGGGTAC CGGGAACAGA CCACGCGAGT ATAGCAGTCC AAACAATTTT AGAAAACCAA CTCAAGGCTG AAGGTAAAAC TCGCCATGAT GTAGGTCGAG AAAAATTCCT GGAACGCGCC TGGCAATGGA AAAATGAATC TGGCGGGACA ATTGTTAATC AATTACGCCG TTTGGGTGTT TCTGTGGACT GGTCACGGGA ACGCTTCACC TTGGATGAAG GTTTATCTAA AGCTGTTTTA GAAGCATTTA TCCGCCTCTA TGAAGAAGGA TTAATTTACC GCAGTAACTA TCTGGTTAAC TGGTGTCCCG CTTCTCAGTC TGCGGTATCT GATTTAGAAG TCGAACCAAA AGAAGTAAAT GGTAATCTTT GGCACTTCCG TTATCCCCTT AGCGATGGTT CTCGTTTTGT AGAAGTTGCG ACAACAAGAC CAGAAACCAT GTTGGGTGAT ACAGCAGTTG TGGTTAACCC TGGTGATGAA CGGTATAAAG ATTTAATTGG TAAAACCCTC ATCGTACCCA TCATAAATCG GGAAATCCCC ATTATTGGTG ATGAGTTAGT TGACCCTGCT TTCGGTACAG GTTGTGTGAA AGTGACTCCC GCCCATGACC CCAATGATTT TGAAATGGGT AAGCGTCACA ACTTGCCGTT TATCAATATT ATGAACAAAG ACGGCACGCT GAACGAGAAC GCCGGTGAGT TTCAAGGTCA AGACCGCTTC GTTGCTAGAA AAAATGTCAT TGCCCGTTTA GAAGCTGATG GTGTACTGGT TAAAATAGAA GATTATAAAC ATACAGTACC TTATAGCGAT CGCGGTAAAG TCCCCGTTGA ACCCCTGCTT TCAACTCAGT GGTTTGTGAA AGTTCGTCCC CTCGCTGACA AAACCCTGGA ATTCCTTGAC CAGCAAAATT CCCCTGAGTT CGTCCCCCAA CGCTGGACTA AGGTTTATCG TGACTGGTTA GTAAATCTGC GTGACTGGTG TATTTCCCGT CAACTTTGGT GGGGTCATCA AATCCCTGCT TGGTATGCTG TTAGTGAAAC AGGCGGCGAA ATTACCGACA CCACACCCCA TTTCGTCGCA CGGAATGAAG CAGAAGCTTT AGAAAAAGCT AAATCACAAT TTGGGGAAGA AGTTAAATTA GAACAAGACC CCGATGTTTT AGATACTTGG TTTTCCTCTG GTTTATGGCC ATTTTCAACT TTAGGTTGGC CAGAACAAAC ACAGGATGTA GAAACTTACT ATCCCACCAC TACCTTAGTT ACAGGCTTTG ACATCATCTT TTTCTGGGTA GCAAGAATGA CGATGATGGC AGGACATTTT ACAGGAGAAA TGCCTTTCAA AACTGTTTAT ATTCATGGTT TAGTGAGGGA TGAAAATAAT AAGAAAATGT CCAAATCAGC AAATAATGGA ATTGACCCAT TATTGTTGAT GGATAAATAT GGAACTGATG CCCTCCGTTA TACTTTAGTG AAAGAAGTAG CCGGTGCTGG TCAAGATATT CGCTTAGAAT ATGACCGCAA AAAAGATGAA TCAATATCTG TCGAAGCATC ACGTAATTTT GCTAATAAAT TATGGAATGC CGCCAGATTT GTAATGATGA ATTTGGATGG ACAAACTCCA GTACAATTGG GTAAACCTGC ACTTACAGAA CTTAGTGATA AATGGATTAT TTCCCGTTAT CATCAAGTAG TTAGACAGAC AAATAATTAC ATTGATAATT ACGGTTTAGG AGAAGCAGCT AAAGGACTTT ATGAGTTCAT TTGGGGTGAT TTCTGTGATT GGTATATTGA ATTAGTCAAA TCCAGATTAC AAAAAGATGC AGATCCGGCA TCTCGTAAAG TTGCCCAACA AATTCTTGCT GAAATATTGG AAGGAGTATT AAAGTTATTA CATCCTTTCA TGCCCCATAT TACCGAAGAG ATTTGGCAAA CTCTCACCCA ACAAATAGCC GAAAGTCCTC AGACTTTAGC TTTACAAAGC TATCCAGAAA CGGATACAAA TTTAATTGAT TCTTCTTTAG AAGAACAGTT TGATTTGTTA ATTGATACAA TCCGCACTAT TCGCAATTTA CGCGCGGAAG CTGATGTTAA ACCAGGAGTA AAAATTACTG CTAATTTGCA AAGTGAAAGC GATAAAGAAA GGGTAATTCT TACAACTGGA CAGTATTACA TCAAAGATTT AGCTAAGGTA GAAACCTTAA CTATTAATGC TCCCAAAACT ACTGTAGAAG AACCAAGAAT TAACCAAATT TTTACCAGTC CTTATTGGCG AACTTTCAAA ACTATTGCTT TAATTATTAT TGTCTTAGTT TCCATCAGAT TCGCTATCTT TGTCGGAAAT ACAACTCTAC GTCTACCAAT TTTTGGGATG TTCTTTGAAA CTTTGGGTTT GGGTTACGCT GGTTGCTTTT TTGTTCGCTA TTTACTAAAT GCTAAAGCGA GACAAGAATT ATTTACTAAA TACTTCCCAG TCAAAGAAAC GTCAACAACA GCAGAAATAA CACCAACACC AGAAATAGAA AATTCTATTT CTGGTGTTGT TGGTACAGTT CAGGTTGTAA TTCCCTTGAC TGGAGTAGTT GATATTGAAG TCTTACGTGC CAAATTAGAG AAAAGCTTGA ATAAAGTTGA AACGGAAGCT AAATCTTTAA GTGGAAGATT GAGTAATTCC AGCTTTGTAG ATAAAGCACC TATAGATGTG GTACAAGCTA CAAGAAATGC TTTCACAGAA GCCGAAAAAC AAGCAGAAAT TTTACGCGCT CGCTTGCGTA GTCTAGTATA A
|
Protein sequence | MTATIPNLPS LYEAFSTEAK WQKFWEENQV YKPDPNHKGE PFCVVIPPPN VTGSLHIGHA FESALIDTLV RYHRMKGHNT LWVPGTDHAS IAVQTILENQ LKAEGKTRHD VGREKFLERA WQWKNESGGT IVNQLRRLGV SVDWSRERFT LDEGLSKAVL EAFIRLYEEG LIYRSNYLVN WCPASQSAVS DLEVEPKEVN GNLWHFRYPL SDGSRFVEVA TTRPETMLGD TAVVVNPGDE RYKDLIGKTL IVPIINREIP IIGDELVDPA FGTGCVKVTP AHDPNDFEMG KRHNLPFINI MNKDGTLNEN AGEFQGQDRF VARKNVIARL EADGVLVKIE DYKHTVPYSD RGKVPVEPLL STQWFVKVRP LADKTLEFLD QQNSPEFVPQ RWTKVYRDWL VNLRDWCISR QLWWGHQIPA WYAVSETGGE ITDTTPHFVA RNEAEALEKA KSQFGEEVKL EQDPDVLDTW FSSGLWPFST LGWPEQTQDV ETYYPTTTLV TGFDIIFFWV ARMTMMAGHF TGEMPFKTVY IHGLVRDENN KKMSKSANNG IDPLLLMDKY GTDALRYTLV KEVAGAGQDI RLEYDRKKDE SISVEASRNF ANKLWNAARF VMMNLDGQTP VQLGKPALTE LSDKWIISRY HQVVRQTNNY IDNYGLGEAA KGLYEFIWGD FCDWYIELVK SRLQKDADPA SRKVAQQILA EILEGVLKLL HPFMPHITEE IWQTLTQQIA ESPQTLALQS YPETDTNLID SSLEEQFDLL IDTIRTIRNL RAEADVKPGV KITANLQSES DKERVILTTG QYYIKDLAKV ETLTINAPKT TVEEPRINQI FTSPYWRTFK TIALIIIVLV SIRFAIFVGN TTLRLPIFGM FFETLGLGYA GCFFVRYLLN AKARQELFTK YFPVKETSTT AEITPTPEIE NSISGVVGTV QVVIPLTGVV DIEVLRAKLE KSLNKVETEA KSLSGRLSNS SFVDKAPIDV VQATRNAFTE AEKQAEILRA RLRSLV
|
| |