Gene Aazo_3338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3338 
Symbol 
ID9341142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3407487 
End bp3409007 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content46% 
IMG OID 
ProductATP synthase F1 subunit alpha 
Protein accessionYP_003722129 
Protein GI298491952 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTT CCATTAGACC TGACGAAATC AGCAACATTA TCCAACAGCA AATCGAGCAA 
TACAACCAAG AGGTTAAAGT TGCTAACGTT GGTACTGTGC TACAAGTAGG TGACGGTATT
GCCCGGATAT ATGGTCTAGA AAAGGCTATG GCTGGGGAAC TCCTAGAATT TGAAGATGGT
ACTATTGGTA TCGCCCAAAA CTTAGAAGAA GATAACGTGG GCGCGGTACT GATGGGTGAA
GGTAGAAACA TTCAAGAAGG TAGCTCCGTA ACTGCTACTG GTAGAATTGC TCAAGTAGGG
GTAGGCGAAG TCCTCATTGG TCGTGTCCTT GATGCTTTGG GTCGCGCCAT TGATGGTAAA
GGTGATCCTA AGACTACCGA AACTCGGTTA ATTGAATCCC CAGCACCTGG TATTATTGCC
CGTCGGTCTG TACACGAACC TATGCAAACA GGTATCACCG CAATTGACTC CATGATTCCC
ATCGGCCGTG GTCAACGGGA ATTAATCATT GGAGACCGTC AAACTGGTAA AACTGCGATT
GCAATTGACA CCATCATCAA CCAAAAAGGT GAAGATGTAG TTTGCGTTTA CGTGGCGATC
GGTCAAAAAG CTTCCACAGT TGCTAACGTA GTCCAAACCT TACAAGAAAA AGGCGCAATG
GACTACACCG TAGTTGTAGC AGCTAACGCC AGTGACCCAG CAACCTTACA ATTCCTCGCA
CCCTACACAG GCGCTACCAT TGCTGAATAC TTCATGTATA AAGGCAAAGC AACCTTAGTA
ATTTACGATG ACCTTTCCAA GCAAGCACAG GCATATCGCC AAATGTCCTT GCTACTACGT
CGTCCACCCG GACGGGAAGC GTATCCTGGA GACGTATTCT ACATTCACTC CCGCTTGTTG
GAACGTGCTG CTAAACTCAG CGACGAACTA GGTAAAGGTA GTATGACTGC CCTACCTATC
ATCGAAACCC AAGCTGGTGA CGTATCTGCA TACATTCCTA CCAACGTAAT TTCCATCACA
GACGGTCAGA TTTTCTTGTC TTCCGACTTG TTTAACTCTG GTATCCGTCC CGCTGTAAAC
CCTGGTATCT CCGTATCCCG TGTAGGTTCT GCGGCACAAA CCAAGGCAAT GAAAAAAGTT
GCTGGTAAGA TTAAGTTAGA ATTGGCACAG TTTGATGACC TTCAAGCTTT CGCACAATTT
GCTTCTGACT TAGATAAAGC CACCCAAGAC CAGTTAGCAC GTGGTGTCCG CTTACGGGAA
CTCTTGAAGC AGCCCCAAAA CGACCCCCTC TCCGTAGCTG AACAAGTAGC AGTTCTTTAC
GCTGGTATTA ACGGTTATTT GGATGACATT GCTGTAAATA AAGTAACCAG CTTTGCTCAA
GGCCTACGCG ATTACTTGAA GACAGGAAAT ACAGCTTATT ACCAAGCAGT ACAAGATAGG
AAAGTCCTTG GTGATCCAGA AGAAGCAGCA TTGAAAGCCG CTATCTCTGA GTTCAAAAAG
ACCTTCCAAG CAGCAGCGTA A
 
Protein sequence
MSISIRPDEI SNIIQQQIEQ YNQEVKVANV GTVLQVGDGI ARIYGLEKAM AGELLEFEDG 
TIGIAQNLEE DNVGAVLMGE GRNIQEGSSV TATGRIAQVG VGEVLIGRVL DALGRAIDGK
GDPKTTETRL IESPAPGIIA RRSVHEPMQT GITAIDSMIP IGRGQRELII GDRQTGKTAI
AIDTIINQKG EDVVCVYVAI GQKASTVANV VQTLQEKGAM DYTVVVAANA SDPATLQFLA
PYTGATIAEY FMYKGKATLV IYDDLSKQAQ AYRQMSLLLR RPPGREAYPG DVFYIHSRLL
ERAAKLSDEL GKGSMTALPI IETQAGDVSA YIPTNVISIT DGQIFLSSDL FNSGIRPAVN
PGISVSRVGS AAQTKAMKKV AGKIKLELAQ FDDLQAFAQF ASDLDKATQD QLARGVRLRE
LLKQPQNDPL SVAEQVAVLY AGINGYLDDI AVNKVTSFAQ GLRDYLKTGN TAYYQAVQDR
KVLGDPEEAA LKAAISEFKK TFQAAA