Gene Aazo_4738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4738 
Symbol 
ID9342545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4843528 
End bp4846002 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content47% 
IMG OID 
ProductATPase AAA-2 domain-containing protein 
Protein accessionYP_003723055 
Protein GI298492878 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGAAC GCTTCACAGA AAAAGCCATT AAGGTAATCA TGCTGGCCCA AGAAGAGGCC 
CGCCGTTTAG GTCACAACTT TGTTGGAACC GAGCAGATCC TCTTGGGTCT CATAGGGGAA
GGGACAGGAG TGGCCGCCAA GGTGCTGAAA TCAATGGGCG TTAATCTTAA AGATGCCCGC
ATTGAAGTTG AAAAAATCAT AGGTCGGGGT TCAGGCTTTG TAGCCGTGGA AATTCCGTTT
ACGCCACGGG CAAAGCGAGT TCTAGAACTA TCCTTGGAAG AAGCACGCCA ACTGGGGCAT
AACTACATAG GCACCGAGCA TCTGCTGTTG GGCCTCATCC GCGAAGGGGA AGGTGTAGCA
GCCAGGGTGT TAGAAAACCT CGGTGTGGAT TTATCTAAGG TGAGAACCCA GGTAATCCGT
ATGTTGGGAG AAACAGCCGA AGTTTCACCA GGGGGAGGTT CATCTGGTCG CACGAAAACC
CCGACTTTGG ATGAATTTGG TTCCAACCTG ACCCAAATGG CCACAGACAA TAAACTTGAT
CCTGTGGTGG GACGTGCTAA AGAAATTGAG CGCGTGATTC AAATTTTGGG TCGCCGGACT
AAGAATAATC CAGTGCTAAT TGGTGAACCA GGGGTAGGTA AAACGGCTAT TGCTGAAGGT
CTAGCGAGCC GGATAGCTAA CAAAGATGTC CCCGACATTC TCGAAGATAA GCGTGTTGTT
ACTCTTGATA TTGGCTTGCT GGTAGCAGGA ACGAAATACC GGGGTGAATT TGAAGAACGC
CTAAAAAAAA TCATGGATGA AATCCGTTCT GCGGGTAATG TCATCCTTGT TATTGACGAG
GTTCACACTT TAATCGGTGC AGGTGCGGCT GAAGGGGCGA TTGATGCAGC GAATATCCTC
AAGCCAGCTT TGGCACGGGG TGAGTTGCAG TGTATCGGTG CGACCACTTT AGATGAGTAT
CGTAAACACA TTGAACGGGA TGCAGCGTTG GAAAGACGCT TCCAGCCTGT GATGGTAGGT
GAGCCTTCTG TTGATGAAAC AATTGAAATT TTATATGGTT TGCGCGATCG CTACGAGCAA
CACCACAAGT TAAAAATCTC CGATGAAGCT TTAGTCGCGG CGGCGAAGTT GTCTGATCGA
TATATTAGCG ATCGCTACCT CCCAGATAAA GCCATCGACT TGGTTGATGA AGCAGGCTCA
CGGGTGCGCT TGATTAACTC CCAGCTACCC CCAGCAGCTA AAGAGTTAGA CAAGGAATTG
CGGCAAATCT TAAAAGAAAA AGATGATGCT GTCCGTTCTC AGGACTTTGA CCGAGCCGGA
GAACTGCGGG ACAGAGAAAT GGAAATCAAA GCGGAAATAC GCACCATTGC TCAAACCAAA
ACCAACGCGG CTGGTGGTGA CGGTGTTGAA CCTGTAGTCA CAGAAGAAGA CATCGCTCAC
ATTGTTGCTT CTTGGACAGG TGTACCAGTG AACAAACTCA CTGAATCTGA ATCCGAGAAG
TTGCTACACA TGGAAGACAC CTTACATCAG CGCCTCATCG GTCAAGATGA TGCTGTGAAG
GCTGTTTCCC GCGCTATTCG TCGCGCTCGT GTTGGTTTGA AGAATCCTAA TCGGCCAATT
GCCAGTTTTG TCTTCTCTGG TCCAACTGGG GTTGGTAAAA CTGAGTTGGC TAAATCCTTG
GCTTCTTACT TCTTCGGTTC TGAAGAAGCA ATGATTCGCT TGGATATGTC CGAGTACATG
GAACGTCACA CCGTTAGTAA ACTGATTGGT TCACCTCCAG GTTACGTTGG TTACAACGAA
GGTGGTCAAT TAACAGAAGC TGTACGCCGT CGTCCTTATA CAGTAGTGCT GTTTGACGAA
ATCGAAAAAG CACACCCCGA TGTGTTCAAC ATGCTGCTGC AAATTTTGGA AGATGGTCGT
TTAACTGACG CGAAGGGTCG CACCGTGGAC TTTAAGAACA CCTTGCTGAT TTTGACTTCC
AACATTGGTT CTAAGGTAAT TGAAAAAGGT GGTAGCGGTA TTGGTTTCGA GTTTGCTGAG
GATGCGACCG AATCTCAATA CAACCGGATT CGTTCTTTGG TGAACGAGGA ACTAAAGCAA
TACTTCCGTC CTGAGTTCTT AAACCGTTTG GATGAAATTA TTGTCTTCCG TCAGTTGAAC
AAGCTTGAAG TTACCCAAAT CGCCGAAATC ATGCTGAAGG AAGTGTTCGG TCGGTTAACA
GAAAAAGGCA TTACTTTAGA AGTGAGTGAT CGCTTCAAAG AGCGCCTAGT TCAAGAGGGT
TACAGTCCCA GCTACGGTGC AAGGCCATTA CGTCGGGCAA TTATGCGCTT GTTAGAAGAT
AGTTTAGCGG AAGAAATTCT ATCTGGACGC ATCAAAGACG GTGATACTGC TCTTGTTGAT
GTTGATGAAA ATGGCATTGT TCAAGTTAGT TCTCAACCGC GTCGGGAGTT GTTACCCCAG
GGTGTTGAGT CATAG
 
Protein sequence
MFERFTEKAI KVIMLAQEEA RRLGHNFVGT EQILLGLIGE GTGVAAKVLK SMGVNLKDAR 
IEVEKIIGRG SGFVAVEIPF TPRAKRVLEL SLEEARQLGH NYIGTEHLLL GLIREGEGVA
ARVLENLGVD LSKVRTQVIR MLGETAEVSP GGGSSGRTKT PTLDEFGSNL TQMATDNKLD
PVVGRAKEIE RVIQILGRRT KNNPVLIGEP GVGKTAIAEG LASRIANKDV PDILEDKRVV
TLDIGLLVAG TKYRGEFEER LKKIMDEIRS AGNVILVIDE VHTLIGAGAA EGAIDAANIL
KPALARGELQ CIGATTLDEY RKHIERDAAL ERRFQPVMVG EPSVDETIEI LYGLRDRYEQ
HHKLKISDEA LVAAAKLSDR YISDRYLPDK AIDLVDEAGS RVRLINSQLP PAAKELDKEL
RQILKEKDDA VRSQDFDRAG ELRDREMEIK AEIRTIAQTK TNAAGGDGVE PVVTEEDIAH
IVASWTGVPV NKLTESESEK LLHMEDTLHQ RLIGQDDAVK AVSRAIRRAR VGLKNPNRPI
ASFVFSGPTG VGKTELAKSL ASYFFGSEEA MIRLDMSEYM ERHTVSKLIG SPPGYVGYNE
GGQLTEAVRR RPYTVVLFDE IEKAHPDVFN MLLQILEDGR LTDAKGRTVD FKNTLLILTS
NIGSKVIEKG GSGIGFEFAE DATESQYNRI RSLVNEELKQ YFRPEFLNRL DEIIVFRQLN
KLEVTQIAEI MLKEVFGRLT EKGITLEVSD RFKERLVQEG YSPSYGARPL RRAIMRLLED
SLAEEILSGR IKDGDTALVD VDENGIVQVS SQPRRELLPQ GVES