Gene Aazo_5078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_5078 
Symbol 
ID9342887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp5205197 
End bp5207104 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content45% 
IMG OID 
Productouter membrane efflux protein 
Protein accessionYP_003723296 
Protein GI298493119 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.851054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAGGAC AGCACTTATT TCATAGTTTC TTGCCTGGTG TAACAGCAGC AGTATTAACG 
ACTCAATCGG CTTGGGCTAA TACCCATAAA GTTGGTGACG TCAAAGTGGT GTCTTCTCCT
AGTATCTTGA CTGCAACCAA TGGGAAACCC TCAGTTGTGG ACGACATCAA CAGACAGCAG
CGCAATACCG CAGTTGATAA TTCTCCAGCT TTAGTAGCTA CTCTGGATTT TAGTAACCTG
AGTGTGACGT CTGTGGCTAG TCACGATGTT AGAGAAGCCT CTTTGGTTCA TAATGGCAGC
GTGACAAAAG AAAACAGCCC GAAAACACTT TTGCCAAATG CAGTGACTAG TGCAAAATTA
GCACAGTTAC GGGAAGGGGA AAAATGTTTA CAGGCAAGGC AAAAAAGCCA AGCTGCTATG
CTTCTGGCTT TAAAGGCTTG TTCACAAAAA AATGGGGCGT CCAAGAAGGT AGCTCAAGTT
ACTAATCCTG GTGATTCACA AACGCCTAAT ACTTCAGAGC CATCTGTTCC AAACGTAGAA
ACTGCCACAC CTGCACCGAC AGACTCTAAA GGGTCACCTC TGGAAAACCT GAATTCTAAT
CCTAATCCTC TGCTATTTCC CACAAAACCA GAGGAAGTCA GGCTTCAGAA AAATCAGCCC
ATTACTTTGG CCGAGGCGCT GGAACTGGCA AAGCAGAATA ACAACGAATT ACAAGTGTCG
ATATTGCAGT TAGAGCGCAG TAAATCTGCT CTCCGAGAAG CCCAGGCTGC CTTGTTGCCT
ACTTTGGGGG TGAATGGTAA TGTCACTAAT AGCCGTAGTT CTAGCAACAC ACTGCAAGCT
AAACAGCAAG AGAGGTTAAA TCCTCTGTCA CCTGATGCTG AATCGAATAC TAGCTTTGGT
GGTCAAGCAG AACTTACGTA TAATCTTTAT ACGTCTGGGA GACGAAATGC AGCAATTACA
GAGGCTGAGG AACAGGTACG ATTTCAGGAA TTAGATGTAG AAAGACAATC TGAAGAAATT
CGCCTGAATG TTGCCACAGA TTACTATAAT CTGCAACAGG CAGATGAAAA TGTACGTATT
TCTCAGTCGG CGGTACAGAA TTCTGAGGCT AGTTTGCGAG ATGCACAGGC TTTGGAAAGG
GCTGGAGTGG GTACAAGATT TGATATGTTG CGATCGCAGG TGAATTTAGC CAATGCCGAA
CAAGAGTTAA CTGATGCTCG TTCTCAGCAA GCCATCGCTC AGCGGCGGTT AGCTACTCGC
CTAAATCTGC CACAGTCGAT AAATATCACT GCTGCTGACC CTGTGCAGTT AGCTGGTCTT
TGGGAACGTG GTTTAGAAGA TAGCATTATC TTAGCTTATC AAAACCGCCC CGAACTACAA
CAGCAGTTGG CACAGCGGAA TATTAGCGAA CAACAAAGAA GACAAGCTCT AGCATCTTTA
GGACCGCAAA TTAGTTTAGT TGCCAGCTAT GACCTGCTAG ATGTGTTTAA TGATAGTATC
AACGTTAGCG ATGGTTATTC TGTGGGAGTG CGAGCCACCC TGAATTTATA TGATGGTGGT
GCAGCAAAAG CAAGAGCAGC CCAGGCTAAA ACTAATATTG CGATCGCTGA AACTAATTTT
GCAGAACAAC GTAACCAAAT CCGTTTTCAA GTAGAACAGG CTTATTCTAC CCAGATCGCT
AACTTGGAAA ACGTCCAAAC TTCCAACGCG GCTCTAGAGC AAGCCAAAGA GTCCCTACGG
TTAGCTCGTT TGCGATTCCA AGCTGGTGTA GGTACTCAAA CAGATGTGAT TAACGCGGAA
AATGAACTTA CCAGATCCGA AGGTAATCGC GTTCGCGCCA TCTTGAACTA CAACCGTGCT
TTAACCGAGT TACAACGCTA TGTAACATCT AGGGCTTTTA AGAAGTAA
 
Protein sequence
MKGQHLFHSF LPGVTAAVLT TQSAWANTHK VGDVKVVSSP SILTATNGKP SVVDDINRQQ 
RNTAVDNSPA LVATLDFSNL SVTSVASHDV REASLVHNGS VTKENSPKTL LPNAVTSAKL
AQLREGEKCL QARQKSQAAM LLALKACSQK NGASKKVAQV TNPGDSQTPN TSEPSVPNVE
TATPAPTDSK GSPLENLNSN PNPLLFPTKP EEVRLQKNQP ITLAEALELA KQNNNELQVS
ILQLERSKSA LREAQAALLP TLGVNGNVTN SRSSSNTLQA KQQERLNPLS PDAESNTSFG
GQAELTYNLY TSGRRNAAIT EAEEQVRFQE LDVERQSEEI RLNVATDYYN LQQADENVRI
SQSAVQNSEA SLRDAQALER AGVGTRFDML RSQVNLANAE QELTDARSQQ AIAQRRLATR
LNLPQSINIT AADPVQLAGL WERGLEDSII LAYQNRPELQ QQLAQRNISE QQRRQALASL
GPQISLVASY DLLDVFNDSI NVSDGYSVGV RATLNLYDGG AAKARAAQAK TNIAIAETNF
AEQRNQIRFQ VEQAYSTQIA NLENVQTSNA ALEQAKESLR LARLRFQAGV GTQTDVINAE
NELTRSEGNR VRAILNYNRA LTELQRYVTS RAFKK