Gene Aazo_3594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3594 
Symbol 
ID9341400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3661939 
End bp3663450 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content41% 
IMG OID 
Productanthranilate synthase component I 
Protein accessionYP_003722305 
Protein GI298492128 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATTCC CCGATTTCCA GCAATTTACA GAACTAGCAA AACAAGGTAA TTTTGTCCCT 
GTATATCAAG AATGGGTCGC TGATTTAGAT ACCCCTGTTT CTGCTTGGTA CAAGGTTTGT
GCAGGTCAAC CTTATAGCTT TTTGCTGGAG TCGGTGGAAG GTGGAGAAAA GGTAGGACGT
TATAGTTTAC TTGGTTGTGA TCCGCTGTGG ATTTTGGAAG CGCGGGGAGA TAAAACTACT
CAAACACACC GCGATGGTTC CCAGGAAGTT TTTACAGGTG ATCCTTTTAC TGTTTTAGCG
GATTGTTTAG CACCTTATCA CCCAGTAAAA TTACCACAGT TACCTTCAGG AATCGGCGGA
CTGTTCGGGT TTTGGGGTTA TGAATTGATT AACTGGATTG AACCGTGTGT ACCAATTCAT
CCTCAAGATG AGCGTAATAT CCCTGATGGG TTATGGATGC AAGTAGACCA ACTGTTAATT
TTTGACCAGG TGAAGCGAAA AATCTGGGCG ATCGCCTACG CTGATTTAAG GAATACTGAT
AATTTAGCAG CAGCATATCA AAAAGCGAGC GATCACATCC AACAAATGGT GAGTAAGTTA
TCTTTACCTT TATCACCACA AAATACCCAA CTTCCTTGGA CATCTCCCCA AAATAAACCC
AAAGCGGGAA TGGAAGAATA TATCAGCAAT TTTACCCGTC CCGATTTTTG TGCTAGTGTG
GAAAAAGCTA AAGAATATAT CAAAGCAGGT GATATTTTCC AAGTCGTGAT TTCTCAACGT
CTATCCACAG AATATACAGA AAATCCCTTC GCTTTATATC GTTCCCTACG CCAAATTAAC
CCTTCACCTT ACATGGCGTA TTTTAACTTC CAAGACTGGC AAATTATCGG TTCTAGTCCT
GAAGTTATGG TGAAAGCAGA ACGAGATGAA GAAGGGGGAA TAATCGCCAC TGTCCGACCG
ATTGCGGGAA CTAGACCCAG AGGTAAAACC ACCCAGGAAG ATGAGGCTTT TGCAGCAGAT
TTACTTCAAG ACCCTAAAGA AGTTGCAGAA CATATCATGT TAGTTGATTT AGGACGCAAT
GATTTAGGAC GAGTATGTAA AAATGGCACT GTTAAAGTTG ACGAATTAAT GATAATTGAA
CGCTATTCTC ATGTTATGCA CATTGTCAGT AATGTGGTAG GGAAATTAGC GAAAAATAAA
ACGGCATGGG ATTTATTAAA AGCTTGTTTT CCTGCGGGTA CAGTTAGCGG TGCGCCAAAA
ATTCGCGCTA TGGAAATTAT CAATGAATTA GAACTAACCC GCAGAGGTGT ATATTCTGGT
GTCTATGGAT ATTATGACTT TGAGGGACAA TTAAATAGTG CGCTCGCTAT CAGAACTATG
GTTTTACATA ATCAAACTGT TACTGTCCAA GCTGGTGCAG GTTTAGTCGC TGATTCTGAA
CCAGAAAAGG AATACGAAGA AACTCTCAAT AAAGCCAGAG GTTTATTAGA AGCAATTCGA
TGTTTGCGGT AA
 
Protein sequence
MIFPDFQQFT ELAKQGNFVP VYQEWVADLD TPVSAWYKVC AGQPYSFLLE SVEGGEKVGR 
YSLLGCDPLW ILEARGDKTT QTHRDGSQEV FTGDPFTVLA DCLAPYHPVK LPQLPSGIGG
LFGFWGYELI NWIEPCVPIH PQDERNIPDG LWMQVDQLLI FDQVKRKIWA IAYADLRNTD
NLAAAYQKAS DHIQQMVSKL SLPLSPQNTQ LPWTSPQNKP KAGMEEYISN FTRPDFCASV
EKAKEYIKAG DIFQVVISQR LSTEYTENPF ALYRSLRQIN PSPYMAYFNF QDWQIIGSSP
EVMVKAERDE EGGIIATVRP IAGTRPRGKT TQEDEAFAAD LLQDPKEVAE HIMLVDLGRN
DLGRVCKNGT VKVDELMIIE RYSHVMHIVS NVVGKLAKNK TAWDLLKACF PAGTVSGAPK
IRAMEIINEL ELTRRGVYSG VYGYYDFEGQ LNSALAIRTM VLHNQTVTVQ AGAGLVADSE
PEKEYEETLN KARGLLEAIR CLR