Gene Aazo_0310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_0310 
Symbol 
ID9338094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp308762 
End bp310957 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content38% 
IMG OID 
Productpara-aminobenzoate synthase subunit I 
Protein accessionYP_003720017 
Protein GI298489840 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.764542 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACAT TAATTGTTGA TAATTATGAT TCTTATACCT TCAATTTGTA TCAGATGATT 
GCTGAGGTAA ATGGAAACTA TCCCTTAGTA ATTCGTAATA ATGAAGTAAC CTGGGATGAA
CTAAAAAAAA TTGCCTTTGA CAATATTGTG ATTTCTCCTG GTCCAGGCCG CCCAGAAAAG
TCGGAAGATT TTGGAGTTTG TCGGCAAATT ATTGAAAATA ATGTAGATGT TCCTTTATTG
GGAGTATGTC TGGGACATCA AGGGATTGGC TACCTACATG GTGCTAAAGT GATTCATGCA
CCGGAAGTTA GGCACGGGAG AATAAGTAAA ATTCATCACA ATGAATCTGA ATTATTCCAA
GGTATTCCTA GTCCATTTTC TGTGGTGCGA TATCACTCTT TGTTAGTTGC AGATGAATTA
CCTGAGTGTT TAGAAAAAAT TGCTTGGACT GAAGACGGAT TAGTAATGGG ATTACGCCAT
CGGTATCTAC CCTTTTGGGG TGTACAGTTT CATCCCGAAT CTATTTGTAC AGAATATGGA
ACAACATTAT TAAACAATTT TAGAGATATC ACGCTGCTAT TTACAAAGAA ATTATCAATA
AGTTCAGAAT CAAAATATTA TTTTCCAGGA TATCAGTCTC TTTATTCTTC TGTTGAGGAC
ATGTACCAAC AGGAAGAGAG GTTTGAACTT TATACCAGAA AGTTAGATAT TTGCCCGAAT
ACAGAACAGA TATTTGTCCA TTTATTTCAG GAAGAACCAA ATAGCTTTTG GTTAGATAGT
AGTCGAGTAG AACCGGGTCT GTCTCGCTTC TCATTTATGG GTGATGGCAA AGGGAAAAAC
AGTTTATTGG TGCGTTACCA TACTCAAACT CAAGAACTGA TCATCACACA ATCAGATAAG
GTCACGCGTC GCACCGAAAG TATTTTTGAA TTTCTCAAAC GGGAAATTGG ACTCCGAAGT
TGTCAAAGTG ATGAGTTACC TTTTGATTTT AACTGCGGAT TTGTGGGTTA TTTTGGTTAT
GAACTCAATG CAGAGTGTGG AGCAAAATTG GTACATTCTT CACCATTACC AGATGCAATT
TTTTTACTAG CTGATCGCAT GATTGCTATA GATCACGAAG AACAGTGTCT TTATTTGCTG
GAATTAGTCA AAACAGGACA AACACAACAG GCAGGAACTT GGTTTGATAC GATCCAACAA
CAATTAAAAA TTTTGAATCC TCTTCCTCCA CTTGTAGCAC AGGGAAATGG TGAACCAGTA
AGGTTATATT TAAGTCGTTC TTACCAAAAT TATATTAATG ACATTCACCA TTGTTTAGAG
GAAATTCATG AAGGAGAAAC TTATCAAGTT TGTTTGACTA ATCAACTTCA CGCTGATATC
AGCCCTGATC CCTTGACATT TTACCGCCGA TTACGTCAGA TTAATCCTGC GCCATACTCT
GCGTTTTTGC GGTTTGGAGA GATTGCGATC GCCTGTTCTT CTCCAGAACG ATTTATGCAA
ATTGATCGCC AGGGGTGGGT GGAAACCAAG CCCATAAAAG GTACGCTACG ACGAGGAGAT
AACCCCGAAG AAGATTTCAT TTTACAGGAA CGACTGCGAA ACAGCGAAAA AGACCAGGCT
GAAAATCTTA TGATAGTCGA TTTATTACGT AATGATTTAG GACGGGTTTG TGAAGTAGGT
AGCGTTCATG TACCAAAATT AATGGATGTA GAAACCTATG CCACAGTGCA TCAATTGGTA
ACAACTATCC GTGGACATTT GCCATCAAAC CTCCAAGCCG TAGATTGTAT TCATGCAGCA
TTTCCAGGGG GCTCGATGAC AGGAGCACCC AAGATTAGAA CCATGCAAAT TATTGACAGA
CTCGAGCAAG AAGCACGGGG AGTATATTCA GGAGCAATCG GCTTTTTGGG ATTGAATGGT
GCAAGCGATT TGAACATAGT TATTCGCACT GCTATTTTCA CTCCTGATGG AACTTCTATT
GGTGTCGGTG GTGGTATTGT AGCACTTTCT GACCCAGAAA TGGAATTTCA AGAAATACTA
CTAAAAGCTA AAGCCTTAAT CCAGACTATG GTTATGACAA TACACGGTAA ATTTGAAGAA
GACCTATATA GTATCTTAGG AGTAGAATCA AAAACTTCCC AGTGTTTTAT GATGTTTTCC
AATCTGATTT TGAAACCATA CAATGGGTTC TTTTAA
 
Protein sequence
MQTLIVDNYD SYTFNLYQMI AEVNGNYPLV IRNNEVTWDE LKKIAFDNIV ISPGPGRPEK 
SEDFGVCRQI IENNVDVPLL GVCLGHQGIG YLHGAKVIHA PEVRHGRISK IHHNESELFQ
GIPSPFSVVR YHSLLVADEL PECLEKIAWT EDGLVMGLRH RYLPFWGVQF HPESICTEYG
TTLLNNFRDI TLLFTKKLSI SSESKYYFPG YQSLYSSVED MYQQEERFEL YTRKLDICPN
TEQIFVHLFQ EEPNSFWLDS SRVEPGLSRF SFMGDGKGKN SLLVRYHTQT QELIITQSDK
VTRRTESIFE FLKREIGLRS CQSDELPFDF NCGFVGYFGY ELNAECGAKL VHSSPLPDAI
FLLADRMIAI DHEEQCLYLL ELVKTGQTQQ AGTWFDTIQQ QLKILNPLPP LVAQGNGEPV
RLYLSRSYQN YINDIHHCLE EIHEGETYQV CLTNQLHADI SPDPLTFYRR LRQINPAPYS
AFLRFGEIAI ACSSPERFMQ IDRQGWVETK PIKGTLRRGD NPEEDFILQE RLRNSEKDQA
ENLMIVDLLR NDLGRVCEVG SVHVPKLMDV ETYATVHQLV TTIRGHLPSN LQAVDCIHAA
FPGGSMTGAP KIRTMQIIDR LEQEARGVYS GAIGFLGLNG ASDLNIVIRT AIFTPDGTSI
GVGGGIVALS DPEMEFQEIL LKAKALIQTM VMTIHGKFEE DLYSILGVES KTSQCFMMFS
NLILKPYNGF F