Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1047 |
Symbol | |
ID | 9338843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 1119990 |
End bp | 1121375 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | RND family efflux transporter subunit MFP |
Protein accession | YP_003720532 |
Protein GI | 298490355 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0423833 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTTTGG ACACAAATAA GTCACAAAAA CAGGTGAAAA TATCAACTCC TATAGTTACC AAACCAGTTA TCAATAACTA TAGTTTATTA ATGTTCTGTA TGCTGTTACT GGCATTACTA ACAGCAAGTT GTGGTTCATT ACCAAAAGAA TCAGCCGGAG CCCAATCGAG GCGACCTGAT GGTAGAGAAA GAGGTAATAG TGAAGCATCT GTAGATGTAG CGATCGCCCG AACTAACTTA TTACGTCCTC CAGCAGGTTA TATCGGTAAC ACCACCCCAT TTAGGACAGT TTCTGTGCGA TCGCAAGTAG AAGGAAGACT CATCGCCCTC AATTTAGATG TTGGAGATAC AGTCAACCGT GGAGAAATTA TTAGCCAGTT AGACGACGTT CTACTGATGA CGGCATCACA GCAAGCAGAA GCAGAACTAG CAACCAGTCA ATCAGAAGTC GCTAGGGCGA TGACACAAGT AAGTAATGCT CAAGCAGAAG TCGAAAAAGC CAGATTAGAA GTTATCCAAG CCCAAGCAGA CTCCCAAAGA CAACAACAAT TATTCAAAGC GGGAGCAATT TCCGAACAAG CCGCCCAACA AGCCAACACC AAAGCCCAAA CAGTCCAAAA AGCCCTACAA GCCACCATTG AACAAGTCCG TACAGAAAAA CAAGCTGTAG CCGCCGCGCA AGGTAGAGTA TTTGCCCAGC AAGCAGTAGT TAAAGCCGCC AAAGAACGTC GTTCCTATTC CCGCTTAATC TCCCCCATCA CCGGCGTAGT CACAGAAAAA GTCACAGAAC CTGGTAATCT TCTACAACCA GGAAGCGAAG TCTTAAAAAT TGCTGACTTG AGTCGGATTA AAGTAGTAGT CCAAGTTTCC GAATTAGAAC TAGCAAAAAT ACAGGTCGGG CAATCTGTAC AAGTGCGTTT AGATGCCTTC CCAGATCAAA CCATCATTGG TAGAGTAGCG CGTATTTCTC CAACTGCTGA TAGCACAGCC AGGTTAGTAC CTATAGAAGT AGTCATTCCC AATAGTGGCG GAAAAACTGG TAGCGGACTA CTAGCACGAG TCAATTTTAC GACCCAAACA CCACAGCGAG TCGTGGTGTC ACAAACAGCA ATTAATGTAA CAGATCAACA AACAAAACCA GAAAATACCA CAGGTACGAT ATTTATTCTT CAAGAAACCG ACGGTAAAGC CAAAGTAAAA GAACAATCTG TAACTTTAGG GAAAAAAGCT AACGGTAGTG TAGAAATTCT CTCTGGCTTA CAACCAGGAG AAAGTTATGT TGTTCGTAGT AGTAAACATT TAAAAGACAA TGAAGTTGTC AAGTTATCAA TTTTGTCGGA AAAAGATTTG AAAGAACCAC AAAAAACACA AAAAAAGAAT TTTTAG
|
Protein sequence | MVLDTNKSQK QVKISTPIVT KPVINNYSLL MFCMLLLALL TASCGSLPKE SAGAQSRRPD GRERGNSEAS VDVAIARTNL LRPPAGYIGN TTPFRTVSVR SQVEGRLIAL NLDVGDTVNR GEIISQLDDV LLMTASQQAE AELATSQSEV ARAMTQVSNA QAEVEKARLE VIQAQADSQR QQQLFKAGAI SEQAAQQANT KAQTVQKALQ ATIEQVRTEK QAVAAAQGRV FAQQAVVKAA KERRSYSRLI SPITGVVTEK VTEPGNLLQP GSEVLKIADL SRIKVVVQVS ELELAKIQVG QSVQVRLDAF PDQTIIGRVA RISPTADSTA RLVPIEVVIP NSGGKTGSGL LARVNFTTQT PQRVVVSQTA INVTDQQTKP ENTTGTIFIL QETDGKAKVK EQSVTLGKKA NGSVEILSGL QPGESYVVRS SKHLKDNEVV KLSILSEKDL KEPQKTQKKN F
|
| |