Gene Aazo_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1991 
Symbol 
ID9339784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2069050 
End bp2070372 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content39% 
IMG OID 
Producttype III effector Hrp-dependent outers 
Protein accessionYP_003721185 
Protein GI298491008 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.28954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAACC AACCCAAAAT AATCGTTTTA GATGATGATC CAACAGGTTG TCAAACCGTC 
CACAGTTGTT TACTGTTGAT GCGTTGGCAT GTGGAAACTT TGCGTTTAGG ATTAAAAGAT
GATGCTCCCA TTTTCTTTAT TTTAACTAAT ACGCGATCAC TCACACCAGA AGCAGCTGCT
ACAGTCACTA GAGAAGTTTG TCATAATCTA AAAATCGCCT TAGCAGCAGA AAACATAGAT
GATTTTCTCG TAGTTAGTCG TTCTGACTCC ACCTTACGCG GACATTATCC CATCGAAACT
GATGTCATCG CTGAAGAACT CGGAGTATTT GATGCTCATT TTCTCGTCCC GGCATTTTTC
GAAGGTGGAA GAATTACCCG CGATAGCATC CATTATCTAA CTATTGATGG TATTCCTACC
CCAGTCCATG AAACAGAATT TGCCCGTGAT TCTGTCTTTG GTTACAATTA CAGCTACCTG
CCTAAATATG TAGAAGAGAA AACCAAAGGA CGCATCAAAG AAGATACTGT TACTAAATTT
TTACTGGCAG ATATTCGTAG TGGTAGTTTA GAGAAATTAT TGCAACTCCA TGATAATCAG
TGTGCAGTTG TAGACGGAGA AACCCAAATA GACCTTAACA CTTTTGCTGT TGATATATTA
ACAGCTGCAG CTCAAGGAAA ACGCTTTTTA TTCCGTTCGG CTGCTAGTAT TTTAACAGCT
TTAGCTGCCT TACCTCCCCA ACCCATAGCC GCTGAAAATA TGGGGAAATA TGTGCGTGGT
GGTAAACCAG GTGCAGTCAT AGTTGGTTCT CACGTTAGAA AAACGACTCA ACAATTAGAG
TCTTTATTAC AAACCCAGGG AACAGTAGGA ATTGAAATCA ATGTATCTAG ATTACTTGAT
CATGGAGAAG ATCAAACTGC CCAATTGCTG CAAGAAAGTT TAGCAGAAAT TAAGATAGTA
CATAATTCAG GAAAAACACC AGTAATTTAT ACGAGTCGTC AAGAATTGAT CTTTCAAGAT
GTCAAAACTA GATTAGACTT TGGGGCAACA GTTTCAGCTT TATTAATGGA TATTGTTCAG
GGTTTACCAG CAGATATAGG ATTTTTAATT AGCAAAGGTG GGATTACCTC CAACGATGTT
TTGAGTACGG GATTAGCGTT AACTTCAGCC AGATTACTGG GTCAAATTTT AGCAGGTTGT
TCAATGGTAA TTACACCATC TGATCATCCT CAGTTTCCTA ACTTGCCAGT GGTACTTTTT
CCTGGTAATG TCGGTAACGC CGATGCATTA GGTACAATTT ATGAAAGATT GACGGCAAAA
TAA
 
Protein sequence
MNNQPKIIVL DDDPTGCQTV HSCLLLMRWH VETLRLGLKD DAPIFFILTN TRSLTPEAAA 
TVTREVCHNL KIALAAENID DFLVVSRSDS TLRGHYPIET DVIAEELGVF DAHFLVPAFF
EGGRITRDSI HYLTIDGIPT PVHETEFARD SVFGYNYSYL PKYVEEKTKG RIKEDTVTKF
LLADIRSGSL EKLLQLHDNQ CAVVDGETQI DLNTFAVDIL TAAAQGKRFL FRSAASILTA
LAALPPQPIA AENMGKYVRG GKPGAVIVGS HVRKTTQQLE SLLQTQGTVG IEINVSRLLD
HGEDQTAQLL QESLAEIKIV HNSGKTPVIY TSRQELIFQD VKTRLDFGAT VSALLMDIVQ
GLPADIGFLI SKGGITSNDV LSTGLALTSA RLLGQILAGC SMVITPSDHP QFPNLPVVLF
PGNVGNADAL GTIYERLTAK