Gene Aazo_1047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1047 
Symbol 
ID9338843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1119990 
End bp1121375 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content42% 
IMG OID 
ProductRND family efflux transporter subunit MFP 
Protein accessionYP_003720532 
Protein GI298490355 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0423833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTTGG ACACAAATAA GTCACAAAAA CAGGTGAAAA TATCAACTCC TATAGTTACC 
AAACCAGTTA TCAATAACTA TAGTTTATTA ATGTTCTGTA TGCTGTTACT GGCATTACTA
ACAGCAAGTT GTGGTTCATT ACCAAAAGAA TCAGCCGGAG CCCAATCGAG GCGACCTGAT
GGTAGAGAAA GAGGTAATAG TGAAGCATCT GTAGATGTAG CGATCGCCCG AACTAACTTA
TTACGTCCTC CAGCAGGTTA TATCGGTAAC ACCACCCCAT TTAGGACAGT TTCTGTGCGA
TCGCAAGTAG AAGGAAGACT CATCGCCCTC AATTTAGATG TTGGAGATAC AGTCAACCGT
GGAGAAATTA TTAGCCAGTT AGACGACGTT CTACTGATGA CGGCATCACA GCAAGCAGAA
GCAGAACTAG CAACCAGTCA ATCAGAAGTC GCTAGGGCGA TGACACAAGT AAGTAATGCT
CAAGCAGAAG TCGAAAAAGC CAGATTAGAA GTTATCCAAG CCCAAGCAGA CTCCCAAAGA
CAACAACAAT TATTCAAAGC GGGAGCAATT TCCGAACAAG CCGCCCAACA AGCCAACACC
AAAGCCCAAA CAGTCCAAAA AGCCCTACAA GCCACCATTG AACAAGTCCG TACAGAAAAA
CAAGCTGTAG CCGCCGCGCA AGGTAGAGTA TTTGCCCAGC AAGCAGTAGT TAAAGCCGCC
AAAGAACGTC GTTCCTATTC CCGCTTAATC TCCCCCATCA CCGGCGTAGT CACAGAAAAA
GTCACAGAAC CTGGTAATCT TCTACAACCA GGAAGCGAAG TCTTAAAAAT TGCTGACTTG
AGTCGGATTA AAGTAGTAGT CCAAGTTTCC GAATTAGAAC TAGCAAAAAT ACAGGTCGGG
CAATCTGTAC AAGTGCGTTT AGATGCCTTC CCAGATCAAA CCATCATTGG TAGAGTAGCG
CGTATTTCTC CAACTGCTGA TAGCACAGCC AGGTTAGTAC CTATAGAAGT AGTCATTCCC
AATAGTGGCG GAAAAACTGG TAGCGGACTA CTAGCACGAG TCAATTTTAC GACCCAAACA
CCACAGCGAG TCGTGGTGTC ACAAACAGCA ATTAATGTAA CAGATCAACA AACAAAACCA
GAAAATACCA CAGGTACGAT ATTTATTCTT CAAGAAACCG ACGGTAAAGC CAAAGTAAAA
GAACAATCTG TAACTTTAGG GAAAAAAGCT AACGGTAGTG TAGAAATTCT CTCTGGCTTA
CAACCAGGAG AAAGTTATGT TGTTCGTAGT AGTAAACATT TAAAAGACAA TGAAGTTGTC
AAGTTATCAA TTTTGTCGGA AAAAGATTTG AAAGAACCAC AAAAAACACA AAAAAAGAAT
TTTTAG
 
Protein sequence
MVLDTNKSQK QVKISTPIVT KPVINNYSLL MFCMLLLALL TASCGSLPKE SAGAQSRRPD 
GRERGNSEAS VDVAIARTNL LRPPAGYIGN TTPFRTVSVR SQVEGRLIAL NLDVGDTVNR
GEIISQLDDV LLMTASQQAE AELATSQSEV ARAMTQVSNA QAEVEKARLE VIQAQADSQR
QQQLFKAGAI SEQAAQQANT KAQTVQKALQ ATIEQVRTEK QAVAAAQGRV FAQQAVVKAA
KERRSYSRLI SPITGVVTEK VTEPGNLLQP GSEVLKIADL SRIKVVVQVS ELELAKIQVG
QSVQVRLDAF PDQTIIGRVA RISPTADSTA RLVPIEVVIP NSGGKTGSGL LARVNFTTQT
PQRVVVSQTA INVTDQQTKP ENTTGTIFIL QETDGKAKVK EQSVTLGKKA NGSVEILSGL
QPGESYVVRS SKHLKDNEVV KLSILSEKDL KEPQKTQKKN F