Gene Aazo_3394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3394 
Symbol 
ID9341199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3458182 
End bp3460590 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content43% 
IMG OID 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_003722167 
Protein GI298491990 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCA CCGAACAAAA TCCATTTTCT CCGCAACAAA TAGCAGAAGA AGGTATCAAG 
CCTGAAGAAT ATGCAGAAAT AGTCCGTCGC TTAGGTCGTC ATCCCAATAA AGCTGAATTA
GGAATGTTTG GGGTAATGTG GTCTGAACAT TGCTGTTATA AAAATTCTCG TCCCTTACTG
AAACAGTTTC CCACCACAGG CCCCCGCATC CTCGTGGGAC CAGGTGAAAA TGCTGGAGTT
GTTGATATTG GGAATGGACT GAGATTAGCT TTTAAAATAG AATCCCATAA CCACCCTTCA
GCCGTTGAAC CATTCCAAGG TGCAGCAACG GGTGTGGGTG GTATATTAAG AGATATTTTT
ACAATGGGTG CGCGTCCTAT TGCTTTATTA AATTCTCTCA GATTTGGTGA TTTAGATGAT
GCGAAAACTC AAAGGTTATT TACGGGCGCT GTAGCTGGTA TAAGCTACTA TGGTAACTGC
GTCGGTGTTC CCACTGTTGG TGGTGAAGTT TATTTTGACT CAGCCTATTC TGGTAATCCC
TTAGTTAATG TCATGGCACT GGGATTAATG GAAACAGAAG ATATCGTAAA ATCTGGTGCT
TCTGGTTTTG GTAATCCTGT ATTATATGTT GGTTCTACCA CTGGTCGGGA TGGTATGGGT
GGTGCAAGTT TTGCCAGTAC AGAATTAACT GATCAATCAA TGGATGACCG CCCTGCAGTG
CAAGTAGGCG ACCCATTTTT AGAAAAATCT CTAATTGAAG CTTGTTTGGA AGCGTTCAAA
ACTGGGGCTG TTGTTGCAGC ACAAGATATG GGTGCGGCTG GTATAACTTG TTCTACTTCG
GAAATGGCTG CGAAAGGCAG TGTGGGTATT GAGTTTGACT TGGATAAAAT CCCGGCTCGT
GAGTTAGGAA TGATACCTTA TGAATATCTG TTGTCGGAAT CTCAAGAAAG AATGCTGTTT
GTTGCCCATA AAGGACGGGA AAAGGAATTA ATTGATATTT TCCACCGTTG GGGACTACAA
GCAGTGGTTG CTGGTACGGT GATTGCTGAA CCCATTGTGA GGATATTATT TCAGGGTAAA
GTAGCTGCTG AAATTCCAGC TGATGCTTTG GCAGAAAATA CACCTTTATA TGAACGGGAA
CTATTGACAA ACCCGCCACA ATATGCGCTC GAAGCTTGGG CTTGGACGGT TGATAGTTTA
CCTGCTTGTG ATGCTACAGG AATTGAAACT GGCAGAATTT GGAAAAGTTG GAATGAGATT
TTATTAATCT TGCTGAATAC GCCAACTATT GCTTCTAAAA GCTGGGTTTA TCGCCAATAT
GACCATCAAG TACAGAATAA TACGGTGTTG TTACCTGGTG GTGCTGATGC TGCGGTGCTG
CGGTTACGTC CTTTGGAAGA TATCCTCAAT CTCACAACTA AAGCTGAAAA TCTCAAATCT
GGGGTGGCTG CTACGGTAGA TTGTAATTCG CGTTATGTGT ATCTTGATCC TTATGAGGGT
GCTAAGGCTG TGGTAGCGGA AGCAGCGCGG AATCTTAGCT GTGTGGGTGC TGAACCCTTG
GCTGTGACTG ATAATCTCAA TTTTGGTAGT CCTGAAAAAC CAATTGGTTA TTGGCAATTA
GCTTCTGCTT GTCAGGGTTT GAGTGAGGGG TGTCAGGAGT TCTCTACACC TGTGACTGGC
GGAAATGTTT CTTTGTATAA TGAGACACTA GATAGTCAGG GAAATCCCCA ACCAATTTAT
CCGACTCCAG TGGTGGGGAT GGTTGGTTTA ATAGAGGATT TGACTAAAAT ATGTGGACAA
GGTTTACAAA GCCCAGGTGA TTTGATTTAC CTTCTTGGTA TAGATGTGAA ATCTCGATCC
CCCCAACCCC CCTTAGAAAG GGGGGCAGAA ATATCTAAAA TTGAATTGGG TGCTTCGGAA
TATTTGGCAA CGATCCATAA TACAATTGCT GGTAAGCCGC CCAGGGTAGA TTTTGATTTG
GAACGTCGTG TACAAAAAGT TTGTCGTGAG GGTATTCGCG CTGGATGGGT GCGTTCCGCT
CATGATACTG CTGAAGGTGG TTTAACGGTA GCTTTGGCTG AATGTTGTCT TTCTGGTCAT
CTTGGCGCAG AGATTAATTT GGGTATTTCT GCAACTCATG ATTTGAGATT TGATGAGGTG
CTATTTGGTG AAGGTGGTGC AAGGATTTTG GTTTCTGTGA GTGAGGAACA TCTCGAAGTT
TGGGAATCCT ATTTACAGGA GTATTTGCCA GAAAATTGGC AGAAATTAGG TGTGGTTAGT
AATTCCGAAT TAGGTTTGCA GATTTGCACC AATGACAACC ATGATTTAAT CAAGGTTAGC
ATTCAAGAAA TGAGCGAGCG CTATGATCTA GCAATATCTA ATCGGCTCGC TATCCATGTT
AATACCTAA
 
Protein sequence
MTTTEQNPFS PQQIAEEGIK PEEYAEIVRR LGRHPNKAEL GMFGVMWSEH CCYKNSRPLL 
KQFPTTGPRI LVGPGENAGV VDIGNGLRLA FKIESHNHPS AVEPFQGAAT GVGGILRDIF
TMGARPIALL NSLRFGDLDD AKTQRLFTGA VAGISYYGNC VGVPTVGGEV YFDSAYSGNP
LVNVMALGLM ETEDIVKSGA SGFGNPVLYV GSTTGRDGMG GASFASTELT DQSMDDRPAV
QVGDPFLEKS LIEACLEAFK TGAVVAAQDM GAAGITCSTS EMAAKGSVGI EFDLDKIPAR
ELGMIPYEYL LSESQERMLF VAHKGREKEL IDIFHRWGLQ AVVAGTVIAE PIVRILFQGK
VAAEIPADAL AENTPLYERE LLTNPPQYAL EAWAWTVDSL PACDATGIET GRIWKSWNEI
LLILLNTPTI ASKSWVYRQY DHQVQNNTVL LPGGADAAVL RLRPLEDILN LTTKAENLKS
GVAATVDCNS RYVYLDPYEG AKAVVAEAAR NLSCVGAEPL AVTDNLNFGS PEKPIGYWQL
ASACQGLSEG CQEFSTPVTG GNVSLYNETL DSQGNPQPIY PTPVVGMVGL IEDLTKICGQ
GLQSPGDLIY LLGIDVKSRS PQPPLERGAE ISKIELGASE YLATIHNTIA GKPPRVDFDL
ERRVQKVCRE GIRAGWVRSA HDTAEGGLTV ALAECCLSGH LGAEINLGIS ATHDLRFDEV
LFGEGGARIL VSVSEEHLEV WESYLQEYLP ENWQKLGVVS NSELGLQICT NDNHDLIKVS
IQEMSERYDL AISNRLAIHV NT