Gene Aazo_5105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5105 
Symbol 
ID9342913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5236794 
End bp5238290 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content37% 
IMG OID 
Productsecretion protein HlyD family protein 
Protein accessionYP_003723308 
Protein GI298493131 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATT CCCTCGCTGC AAATGCTGTT CAAGCACGTC AAACAAAAGA GAGATTCGCA 
AAACCAGAGG AACAATTATC TTATGAATTA GGTAAAGCTG TACAGGAATT ACCACCGCTA
TATGCAAGAT TATTAGCGGG AACAATTAGC GTGATTATAT TTGGCACAAT TTCCTGGGCG
CATTTTTCAG AAATTGATGA AGTAGCGACA GCACCAGGAG AATTAATTGC TTCTACTCAA
GTTAGACCAG TGACATCTTT GGGTAGTGGA TATATTTTAG CAGTGAAAGT CAAAGAGGGC
GATCACGTTA CCAAAGATCA AATTTTAATA GAACGTGACC CCAACTTACA ACAAACCGAT
GTTAACCGAC TGGCTAAAGC TATTAAATTA ATTGAGGATG ACTTGCAGCG TTTAGAAGCA
GAACGTATCC GCGAAAAAAC TGCCGGAACA AAACTGCAAG ACGAACTTTT AAATTCTCGT
TTATTAGACT ACAAAGCCAA ACAAGCAGCA GCAGAAGCAG AAGCACAACG CCAACTTTCA
ATTATCAATC AAGCAAAAGT TCGTTTAAGT CGCTTACAAG AAAATTTAGC CAACGCCCAA
ACCAGCTTTA CTAATGCTCA AACTAACCTA GTGAATGCTG AAGGCATCCG TGCTAAAGTT
GATAATAATT TAACCATAGC TCAAAAAAGA GAAGAAAATC TCCGCGCTTT ATTAAATCCC
GGTGCAGTTC CCAGAGTTGA TTATTTAGAA GCACAGGAAA GATTAAATCG TGCCAGTACA
GACATTATTA AAAGCTCAGA TGAAGTAACT AATACCAAAA ATAGACTGAC AGAAGCAAAA
GATAAAGTTA GATCTTTAGA AAAAGATATT GCTGCTCAAG ACCAAGAAAT TCGCCAAGCA
GAACAAGCTT ATCAAGCAGC ACGTAATCAA GGCTTACGTT TAGCATCAGA ACGCCAAAGT
GAAATTTTAA CCCAAATCAA TAAACGCAAA GAAGAATTAA CTAATGTTGC GGGTCAATTA
GAACAAGCAA AAATGCAGAA AGATAGGGAA ACTATTAAAG CACCTGTCGC GGGAACAATT
TACAAAATTA AAGCTACTAA AGGTCCCATT CAATCTGGTG AAGAATTGCT ATCAATTGCA
CCAGAAGGTG AAGAAATGCT TTTAGAAGTA AAAGTTCTTA ACCGCGATAT TGGCTTTATT
CGTCAGAATA TGAAAGCAAA AGTTAAATTA GAAGCTTTTC CTTTTCAAGA ATTTGGAGTT
GTTGATGGTG AGGTTTTACA AATTAGTCCC AATGCAGTAG TTGATAAAGA CTTGGGTTTA
GTTTTCTCAA CCAGAATTAA ATTGACTCAA CATTCAATGA ATCTCCGAGG ACAAGAAGTG
GAATTTACTC CAGGAATGGC TGCGAATGCA GAGATTATCA CTCGTGAGAA ATCAATTCTG
ACCTTCATAG TTGAGCCAAT TACCCGCAGG TTTAGTGATG CTTTTTCTGT TAGATAA
 
Protein sequence
MKYSLAANAV QARQTKERFA KPEEQLSYEL GKAVQELPPL YARLLAGTIS VIIFGTISWA 
HFSEIDEVAT APGELIASTQ VRPVTSLGSG YILAVKVKEG DHVTKDQILI ERDPNLQQTD
VNRLAKAIKL IEDDLQRLEA ERIREKTAGT KLQDELLNSR LLDYKAKQAA AEAEAQRQLS
IINQAKVRLS RLQENLANAQ TSFTNAQTNL VNAEGIRAKV DNNLTIAQKR EENLRALLNP
GAVPRVDYLE AQERLNRAST DIIKSSDEVT NTKNRLTEAK DKVRSLEKDI AAQDQEIRQA
EQAYQAARNQ GLRLASERQS EILTQINKRK EELTNVAGQL EQAKMQKDRE TIKAPVAGTI
YKIKATKGPI QSGEELLSIA PEGEEMLLEV KVLNRDIGFI RQNMKAKVKL EAFPFQEFGV
VDGEVLQISP NAVVDKDLGL VFSTRIKLTQ HSMNLRGQEV EFTPGMAANA EIITREKSIL
TFIVEPITRR FSDAFSVR