Gene Aazo_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1000 
Symbol 
ID9338795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1058042 
End bp1060489 
Gene Length2448 bp 
Protein Length815 aa 
Translation table11 
GC content44% 
IMG OID 
ProductATPase AAA-2 domain-containing protein 
Protein accessionYP_003720495 
Protein GI298490318 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0499037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGAAC ACTTCACGTC CGAAGCCATT AGGGTAATTA TGTTAGCTCA GGAGGAAGCA 
CGCCGCCTGG GACACAATTT CGTAGGCACT GAACAAATTC TCCTGGGTTT AATGGGAGAA
GGAACCGGAG TGGCTGCGAA AGTGCTAGCT GAGTTGGGTG TTACCCTGAA AGATGCGCGT
CGGGAAGTAG AGAAAATTAT TGGTCGGGGT TCTGGTTTTG TTCCGCCGGA AATTCCTTTT
ACACCAAAAG TAAAAAGTCT GTTTGAGCAA TCTTTTAGAG AAGCTCATAG TCTTGGACAC
AACTACATAA ACACTGAACA TTTATTGTTA GGTTTAACTG AAGCCGGGGA AGGAGTGGCG
GCCAAAGTTC TGCAAAATTT GGGAGTTGAG TTGCAAGGTA TCCGTGCTGC TGTTATTAGT
CGTTTGGGTG AAGATGTAAC TGTTTTCGCA GGCACTGTAA GCGGGTCTAA GCGTAATCAA
AACCTAAGTA TAGAAGAGTT TGGTAGAAAT CTGACCAAAA TGGCTCAGGA TGGCAAGCTT
GATCCTGTTG TTGGTCGTCA ACGAGAAATT GAGCGCACGG TGCAAATTTT GGGTCGTCGC
ACCAAAAATA ACCCGGTTTT AATTGGAGAA CCAGGTGTTG GTAAAACTGC TATCGCAGAA
GGTTTAGCCC AACGTATCAT TAACCAAGAT GTACCAGAAG TGCTGTTGAA CAAGCAAGTC
ATCAGTCTGG ACATGGGTTT ACTAGTAGCT GGAACTCGTT TCCGTGGCGA CTTTGAGGAA
CGCCTGAAAA AAATCATGGA TGAAATTCGA TCAGAAGGCA ATATCATCCT GGTGATTGAT
GAAATTCACA CCTTAGTCGG TGCAGGTGGT ACAGAAGGCG GTTTAGATGC AGCTAACATC
CTGAAACCAG CTTTAGCAAG AGGTGAACTC CAATGTATTG GGGCAACCAC CTTGGATGAA
TACCGTAAAC ACATTGAGCG TGATGCGGCT TTAGAACGGC GTTTTCAACC AATTTTGGTG
GGAGAACCTT CTGTAGGAGA AACCATTGAG ATTCTCTATG GGTTGCGTAG TGCTTATGAA
CAACATCATA AAGTCACCAT CTCTGATGCA GCTGTAGTAG TAGCAGCACA GTTATCCGAT
AGATATATTA GTGATCGCTT CCTACCGGAC AAAGCTATAG ACTTAATTGA TGAAGCTGGT
TCTCGTGTAC GTTTACGTCA CTCCCGCATC ATCAACAATA AAGAAATCAA ACTGCAACTC
AAAAACATCA GCAAAGACAA AGCAGAAGCT ATCAGAGTTC AGGATTTTGG TAAAGCTAGT
AAACTCAATC AAGAAGAACT AGAACTTCAG GCCAAAATAG ACTTAGAAGA TAACCTGCAA
ACAGTTAAAG CGATCGTTGA CGAAGAAGAC ATCGCCCAAA TCGTTGCCTC TTGGACAGGT
GTCCCAGTTA ACAAACTCAC CGAATCAGAA TCAGAGTTAC TACTGCACCT AGAAGACACC
CTGCACAAAC GCCTCATCGG TCAAGAACAA GCAGTTGCAG CCGTTTCTCG TTCCATCCGT
CGCGCCCGTG TCGGCTTAAA GAATCCTAAG CGTCCCATCG CCAGCTTTAT CTTCTCTGGT
CCGACAGGAG TAGGGAAAAC CGAACTAGCC AAAGCCCTAG CCGCTTACTT CTTCGGTGCA
GGAGATTCCA TGATTCGCTT GGATATGTCC GAATACATGG AAAGCCATAA CGTTTCCAAA
CTTATCGGTT CACCTCCAGG TTACGTAGGC TACGACGAAG GCGGACAACT TACAGAAGCA
GTAAGACGTA AACCATACAC GGTGCTACTT TTCGACGAAA TTGAAAAAGC GCACTCTGAT
GTATTTAATA TGCTGCTACA AATCTTGGAT GAAGGACACC TCACCGATGC TAAAGGTCGT
AAAGTAGACT TCAAGAACAC CTTAATCATC TTAACTTCCA ATATTGGTTC TAAGGTAATT
GAGAAAGGCG GTATCAGTTT AGGCTTTGAA TTTGATAATC AAGCCGACGC TAGTTATAAC
GGTATCCGTA AATTGGTAAA TGAAGAACTG AAAGCTTATT TCCGTCCTGA ATTCCTCAAC
CGTGTTGATG ATATTATCGT CTTCACCCAG TTGAATAAAG AAGAAGTTAA GCAAATCGCC
GAAATCATGC TGCATGATGT TGCTAACCGA TTAAAAGACC GAGGAATTAA ACTCGAAGTC
ACAGAAAGCT TCAAAGAACT GGTTGTCAGA GAAGGTTATG ACCCAAGCTA CGGTGCTAGA
CCTTTACGTC GAGCTATTAT GCGTCTGTTA GAAGATTCTT TAGCTGAGGC TATCTTATCT
AGTCACATCC TTGAAGGTGA TACAGCCATT GTCGATGTTG ATGATGATGG TCAGGTAACA
GTCAGAAAAG CAGAAACCCG CGAATTCCTG TTAGCTAATG TTGGCTAA
 
Protein sequence
MFEHFTSEAI RVIMLAQEEA RRLGHNFVGT EQILLGLMGE GTGVAAKVLA ELGVTLKDAR 
REVEKIIGRG SGFVPPEIPF TPKVKSLFEQ SFREAHSLGH NYINTEHLLL GLTEAGEGVA
AKVLQNLGVE LQGIRAAVIS RLGEDVTVFA GTVSGSKRNQ NLSIEEFGRN LTKMAQDGKL
DPVVGRQREI ERTVQILGRR TKNNPVLIGE PGVGKTAIAE GLAQRIINQD VPEVLLNKQV
ISLDMGLLVA GTRFRGDFEE RLKKIMDEIR SEGNIILVID EIHTLVGAGG TEGGLDAANI
LKPALARGEL QCIGATTLDE YRKHIERDAA LERRFQPILV GEPSVGETIE ILYGLRSAYE
QHHKVTISDA AVVVAAQLSD RYISDRFLPD KAIDLIDEAG SRVRLRHSRI INNKEIKLQL
KNISKDKAEA IRVQDFGKAS KLNQEELELQ AKIDLEDNLQ TVKAIVDEED IAQIVASWTG
VPVNKLTESE SELLLHLEDT LHKRLIGQEQ AVAAVSRSIR RARVGLKNPK RPIASFIFSG
PTGVGKTELA KALAAYFFGA GDSMIRLDMS EYMESHNVSK LIGSPPGYVG YDEGGQLTEA
VRRKPYTVLL FDEIEKAHSD VFNMLLQILD EGHLTDAKGR KVDFKNTLII LTSNIGSKVI
EKGGISLGFE FDNQADASYN GIRKLVNEEL KAYFRPEFLN RVDDIIVFTQ LNKEEVKQIA
EIMLHDVANR LKDRGIKLEV TESFKELVVR EGYDPSYGAR PLRRAIMRLL EDSLAEAILS
SHILEGDTAI VDVDDDGQVT VRKAETREFL LANVG