Gene Aazo_4141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4141 
Symbol 
ID9341946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4212926 
End bp4214599 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content40% 
IMG OID 
ProductPpx/GppA phosphatase 
Protein accessionYP_003722700 
Protein GI298492523 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAATG CAGTTTCAGC TAACTGGGAG AGTACACCTA CTCAACCAGT CAAGCAAAAC 
CCGATTATTG CGGCTATTGA TATCGGTACT AATTCCTTAC ATATCGTCAT AGTAAGAATT
GAACCGACGC TACCAGCTTT TACGATGATC GCCAGAGAAA AAGAAACGGT AAGATTAGGC
GAGCGCAACT TGGAAACTGG AGAACTCAAA CCAGAAGTGA TCAGAAAAGC GATCGCTTGT
TTGGGACGTT TCCAAAAACT TGCTAAAAGC CTAGAAGCAG AAAGCATTAT TGCAGTAGCA
ACCAGCGCCG TCCGCGAAGC CCCTAATGGG CAGGATTTTT TACAGAACAT AGAAAGCGAA
ATAGGCTTAA GCGTAGACTT GATTTCTGGT CAAGAAGAAG CCCGACGCAT CTATTTAGGT
GTTTTATCAG GGATGGAATT TAATCACGAA CCACACATCA TTATTGACAT TGGTGGTGGT
TCCACAGAAT TAATTTTAGG TGACTCTCAA GACCCCCGCA GCCTTACCAG CACGAAAGTA
GGTGCAGTGC GACTAACTGG AGAGTTAATT AACACCGACC CAATCAGCCA TTGTGAGTTT
CAATACTTAC AAGCTTATGC AAAAGGGATG TTAGAACGTT CTGTAGAAGA TGTACTTTTT
AAACTCAAAC CTGGTGAATC TCCCAAATTG GTGGGAACAT CAGGCACCAT TGAAACCTTA
GCAACTATTC ATGCTAAAGA AAAAATGGGT GTTGTTCCTT CTACTCTCAA CGGTTATCAA
TTTAGTCTTC AAGACTTGCG GACTTGGGTA ACTCGCTTAC GACGGATGAC CAATGTAGAA
AGGGCTGCAA TTTCAGGAAT GCCAGAAAAG CGGTCAGAAG TGATACTAGC TGGGGCGGTG
ATATTACAGG AAGCCATGAC CCTGTTAGAT GTGGATTCAG TTTCACTCTG TGAACGATCT
CTGCGAGAAG GTGTAATTGT CGATTGGATG CTGACACATG GTTTTATTGA CAACAAACTA
CGCTATCAAA GTTCGATTAG AGAACGTAAT GTTCTAAAAA TTGCTAAGAA ATACCATATT
AACTTAGAAA ATAGCAATGC TTGTGGCGAC CATAGCGATC ACATAGCTAA ATTTGCATTG
AGTTTATTTG ATCAAACTCA AAGTCAACTA CATAATTGGG GTCAACAAGA AAGACAATTG
CTTTGGGCTG CTGCCATTTT ACACAATTGT GGTCACTACA TCAGCCATTC TTCACACCAC
AAGCATTCAT ACTATTTGAT TAGAAATGGT GAATTACTTG GTTATAACGA AACTGAAATA
GAAATCATAG CTAATTTAGC CCGTTATCAC CGCAAATCAC CCCCTAAGAA AAAACACGAT
AACTACCGTA ATTTATTGCA TAAAGAACAT CGGCTCATAG TTTCTCAACT GAGTGCAATT
TTAAGATTGT CAGTAGCCTT AGATAGAAGA CAAATCGGTG CTATCTCTCA AGTGCAGTGT
GAATATATTC CCCAGAAACA TGAATTTAAA ATCTTGTTAT TCCCCAGAAT TTTAGGTGAT
GATTGTGCTT TAGAACTGTG GAGTTTAGAT TATAAGAAAG GTGTGTTTGA AGAAGAATTT
GGTTTAAAAT TAGACGCAAA TTTAGTTAAT ACTTGCAGCG TGAATTTTCC TTAG
 
Protein sequence
MLNAVSANWE STPTQPVKQN PIIAAIDIGT NSLHIVIVRI EPTLPAFTMI AREKETVRLG 
ERNLETGELK PEVIRKAIAC LGRFQKLAKS LEAESIIAVA TSAVREAPNG QDFLQNIESE
IGLSVDLISG QEEARRIYLG VLSGMEFNHE PHIIIDIGGG STELILGDSQ DPRSLTSTKV
GAVRLTGELI NTDPISHCEF QYLQAYAKGM LERSVEDVLF KLKPGESPKL VGTSGTIETL
ATIHAKEKMG VVPSTLNGYQ FSLQDLRTWV TRLRRMTNVE RAAISGMPEK RSEVILAGAV
ILQEAMTLLD VDSVSLCERS LREGVIVDWM LTHGFIDNKL RYQSSIRERN VLKIAKKYHI
NLENSNACGD HSDHIAKFAL SLFDQTQSQL HNWGQQERQL LWAAAILHNC GHYISHSSHH
KHSYYLIRNG ELLGYNETEI EIIANLARYH RKSPPKKKHD NYRNLLHKEH RLIVSQLSAI
LRLSVALDRR QIGAISQVQC EYIPQKHEFK ILLFPRILGD DCALELWSLD YKKGVFEEEF
GLKLDANLVN TCSVNFP