Gene Aazo_4102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4102 
Symbol 
ID9341907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4169916 
End bp4172936 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content40% 
IMG OID 
Productvalyl-tRNA synthetase 
Protein accessionYP_003722672 
Protein GI298492495 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCAA CTATTCCCAA TCTCCCCAGT CTCTACGAAG CCTTCTCCAC AGAAGCCAAA 
TGGCAAAAAT TCTGGGAAGA AAACCAAGTC TACAAGCCAG ACCCGAATCA CAAAGGTGAA
CCCTTCTGCG TCGTTATCCC ACCACCAAAC GTTACTGGCA GTTTACATAT AGGTCACGCC
TTTGAAAGTG CGTTGATTGA TACCCTTGTG CGCTACCATC GGATGAAGGG ACATAATACT
CTATGGGTAC CGGGAACAGA CCACGCGAGT ATAGCAGTCC AAACAATTTT AGAAAACCAA
CTCAAGGCTG AAGGTAAAAC TCGCCATGAT GTAGGTCGAG AAAAATTCCT GGAACGCGCC
TGGCAATGGA AAAATGAATC TGGCGGGACA ATTGTTAATC AATTACGCCG TTTGGGTGTT
TCTGTGGACT GGTCACGGGA ACGCTTCACC TTGGATGAAG GTTTATCTAA AGCTGTTTTA
GAAGCATTTA TCCGCCTCTA TGAAGAAGGA TTAATTTACC GCAGTAACTA TCTGGTTAAC
TGGTGTCCCG CTTCTCAGTC TGCGGTATCT GATTTAGAAG TCGAACCAAA AGAAGTAAAT
GGTAATCTTT GGCACTTCCG TTATCCCCTT AGCGATGGTT CTCGTTTTGT AGAAGTTGCG
ACAACAAGAC CAGAAACCAT GTTGGGTGAT ACAGCAGTTG TGGTTAACCC TGGTGATGAA
CGGTATAAAG ATTTAATTGG TAAAACCCTC ATCGTACCCA TCATAAATCG GGAAATCCCC
ATTATTGGTG ATGAGTTAGT TGACCCTGCT TTCGGTACAG GTTGTGTGAA AGTGACTCCC
GCCCATGACC CCAATGATTT TGAAATGGGT AAGCGTCACA ACTTGCCGTT TATCAATATT
ATGAACAAAG ACGGCACGCT GAACGAGAAC GCCGGTGAGT TTCAAGGTCA AGACCGCTTC
GTTGCTAGAA AAAATGTCAT TGCCCGTTTA GAAGCTGATG GTGTACTGGT TAAAATAGAA
GATTATAAAC ATACAGTACC TTATAGCGAT CGCGGTAAAG TCCCCGTTGA ACCCCTGCTT
TCAACTCAGT GGTTTGTGAA AGTTCGTCCC CTCGCTGACA AAACCCTGGA ATTCCTTGAC
CAGCAAAATT CCCCTGAGTT CGTCCCCCAA CGCTGGACTA AGGTTTATCG TGACTGGTTA
GTAAATCTGC GTGACTGGTG TATTTCCCGT CAACTTTGGT GGGGTCATCA AATCCCTGCT
TGGTATGCTG TTAGTGAAAC AGGCGGCGAA ATTACCGACA CCACACCCCA TTTCGTCGCA
CGGAATGAAG CAGAAGCTTT AGAAAAAGCT AAATCACAAT TTGGGGAAGA AGTTAAATTA
GAACAAGACC CCGATGTTTT AGATACTTGG TTTTCCTCTG GTTTATGGCC ATTTTCAACT
TTAGGTTGGC CAGAACAAAC ACAGGATGTA GAAACTTACT ATCCCACCAC TACCTTAGTT
ACAGGCTTTG ACATCATCTT TTTCTGGGTA GCAAGAATGA CGATGATGGC AGGACATTTT
ACAGGAGAAA TGCCTTTCAA AACTGTTTAT ATTCATGGTT TAGTGAGGGA TGAAAATAAT
AAGAAAATGT CCAAATCAGC AAATAATGGA ATTGACCCAT TATTGTTGAT GGATAAATAT
GGAACTGATG CCCTCCGTTA TACTTTAGTG AAAGAAGTAG CCGGTGCTGG TCAAGATATT
CGCTTAGAAT ATGACCGCAA AAAAGATGAA TCAATATCTG TCGAAGCATC ACGTAATTTT
GCTAATAAAT TATGGAATGC CGCCAGATTT GTAATGATGA ATTTGGATGG ACAAACTCCA
GTACAATTGG GTAAACCTGC ACTTACAGAA CTTAGTGATA AATGGATTAT TTCCCGTTAT
CATCAAGTAG TTAGACAGAC AAATAATTAC ATTGATAATT ACGGTTTAGG AGAAGCAGCT
AAAGGACTTT ATGAGTTCAT TTGGGGTGAT TTCTGTGATT GGTATATTGA ATTAGTCAAA
TCCAGATTAC AAAAAGATGC AGATCCGGCA TCTCGTAAAG TTGCCCAACA AATTCTTGCT
GAAATATTGG AAGGAGTATT AAAGTTATTA CATCCTTTCA TGCCCCATAT TACCGAAGAG
ATTTGGCAAA CTCTCACCCA ACAAATAGCC GAAAGTCCTC AGACTTTAGC TTTACAAAGC
TATCCAGAAA CGGATACAAA TTTAATTGAT TCTTCTTTAG AAGAACAGTT TGATTTGTTA
ATTGATACAA TCCGCACTAT TCGCAATTTA CGCGCGGAAG CTGATGTTAA ACCAGGAGTA
AAAATTACTG CTAATTTGCA AAGTGAAAGC GATAAAGAAA GGGTAATTCT TACAACTGGA
CAGTATTACA TCAAAGATTT AGCTAAGGTA GAAACCTTAA CTATTAATGC TCCCAAAACT
ACTGTAGAAG AACCAAGAAT TAACCAAATT TTTACCAGTC CTTATTGGCG AACTTTCAAA
ACTATTGCTT TAATTATTAT TGTCTTAGTT TCCATCAGAT TCGCTATCTT TGTCGGAAAT
ACAACTCTAC GTCTACCAAT TTTTGGGATG TTCTTTGAAA CTTTGGGTTT GGGTTACGCT
GGTTGCTTTT TTGTTCGCTA TTTACTAAAT GCTAAAGCGA GACAAGAATT ATTTACTAAA
TACTTCCCAG TCAAAGAAAC GTCAACAACA GCAGAAATAA CACCAACACC AGAAATAGAA
AATTCTATTT CTGGTGTTGT TGGTACAGTT CAGGTTGTAA TTCCCTTGAC TGGAGTAGTT
GATATTGAAG TCTTACGTGC CAAATTAGAG AAAAGCTTGA ATAAAGTTGA AACGGAAGCT
AAATCTTTAA GTGGAAGATT GAGTAATTCC AGCTTTGTAG ATAAAGCACC TATAGATGTG
GTACAAGCTA CAAGAAATGC TTTCACAGAA GCCGAAAAAC AAGCAGAAAT TTTACGCGCT
CGCTTGCGTA GTCTAGTATA A
 
Protein sequence
MTATIPNLPS LYEAFSTEAK WQKFWEENQV YKPDPNHKGE PFCVVIPPPN VTGSLHIGHA 
FESALIDTLV RYHRMKGHNT LWVPGTDHAS IAVQTILENQ LKAEGKTRHD VGREKFLERA
WQWKNESGGT IVNQLRRLGV SVDWSRERFT LDEGLSKAVL EAFIRLYEEG LIYRSNYLVN
WCPASQSAVS DLEVEPKEVN GNLWHFRYPL SDGSRFVEVA TTRPETMLGD TAVVVNPGDE
RYKDLIGKTL IVPIINREIP IIGDELVDPA FGTGCVKVTP AHDPNDFEMG KRHNLPFINI
MNKDGTLNEN AGEFQGQDRF VARKNVIARL EADGVLVKIE DYKHTVPYSD RGKVPVEPLL
STQWFVKVRP LADKTLEFLD QQNSPEFVPQ RWTKVYRDWL VNLRDWCISR QLWWGHQIPA
WYAVSETGGE ITDTTPHFVA RNEAEALEKA KSQFGEEVKL EQDPDVLDTW FSSGLWPFST
LGWPEQTQDV ETYYPTTTLV TGFDIIFFWV ARMTMMAGHF TGEMPFKTVY IHGLVRDENN
KKMSKSANNG IDPLLLMDKY GTDALRYTLV KEVAGAGQDI RLEYDRKKDE SISVEASRNF
ANKLWNAARF VMMNLDGQTP VQLGKPALTE LSDKWIISRY HQVVRQTNNY IDNYGLGEAA
KGLYEFIWGD FCDWYIELVK SRLQKDADPA SRKVAQQILA EILEGVLKLL HPFMPHITEE
IWQTLTQQIA ESPQTLALQS YPETDTNLID SSLEEQFDLL IDTIRTIRNL RAEADVKPGV
KITANLQSES DKERVILTTG QYYIKDLAKV ETLTINAPKT TVEEPRINQI FTSPYWRTFK
TIALIIIVLV SIRFAIFVGN TTLRLPIFGM FFETLGLGYA GCFFVRYLLN AKARQELFTK
YFPVKETSTT AEITPTPEIE NSISGVVGTV QVVIPLTGVV DIEVLRAKLE KSLNKVETEA
KSLSGRLSNS SFVDKAPIDV VQATRNAFTE AEKQAEILRA RLRSLV