Gene Aazo_3919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3919 
Symbol 
ID9341723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3981326 
End bp3982570 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content39% 
IMG OID 
ProductDevB family ABC exporter membrane fusion protein 
Protein accessionYP_003722545 
Protein GI298492368 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAACTA TCTATGAGCA AAAATGGAAT AAAAATATGG CATTAAATAA AGAACGCCGG 
TTATTGAAAA GAGCCGCTCA GAAGTGGAAA ATAATTTTAG CCGCTTCCAT TGCTTTAGCT
ACAGGGTTAC TATCTTTTTA CAGTTTTTCT CAATTAAAAT TGAAACCTAA TTCTCAGATT
CCTTTTTCCT TACGTAGTTC CCCCAAACCA ACTCCTGTAA AACTTGCCGT AACTGCTTTG
GGACGCTTGC AACCACAAGG TAAAATTACT TATTTATCTG CTCCCAATTC CATTAACGGT
GTGCGTGTAG AAAAACTGTT AGTTAGTGAA GGAGACGAGG TAAAACCGGG GCAAGTCTTA
GCTTATTTAG ATGACTATGC TCGTTCTAAG ATAGCGATAA AACAAGCCTT TGATAAGTTG
CTAATCGCTA AAGCTAAACT AGCGCAGGTA AAAGCTGGGG CCAAACCCGG AGATATCAAT
GCTCAAAAGG CAACAATGGC TCGGTTAGAT TCGCAACTAA AAGGAGAAAA TGCGGCTCAA
ACAGCTACAA TTAACCGTAT TCAGGCTGAA GTAGATAATG CTCAAAAAGA GAGCGATCGC
TATCAGCAAT TGTATAAAGA CGGTGCCATT TCTGCTTCCA TTGCAGATAC CAAAGCCTTA
CAACTGAAGA CATCACAACA ACAGTTAACA GAAGCTAAAG CCACTCTTAT CCGTACTCAA
AACACACTGC AAGATCAAAT TAAAGAAGCC ACAGCTAAAC TCAACAGTAT TAAAGAAGTA
CGCGGTGTTG ATGTGGCCTT AGCAGAAGGT GAAGTCAAAA GCATCCACAC TGTTATTCAA
CAAGCCAAAG CAGATCACGA ATTAACCTAC ATCAAATCTC CCATAGATGG CAGAGTTTTG
AAAATTCGTT CTAAAAATGG CGAAGTCATC ACTACATCCG GCTTTGCTGA ACTTGGTAAA
ACAGCAGAAA TGAGTGTACT TGCTGAAGTT TATCAAACCG ATATTCAAAA AGTGCGTGTA
GGTCAAAAAG CCATCATTAC CAGTGCTACC TTTCCTGAAA AAATACATGG AACTGTGAAA
GCAATTGGTT GGCAAATTGA CAAACAAAAT ATCTTTAGTA TTAACCCCAA TTCAGATACA
GACCGCAAAA TAGTTGAGGT CAAAATATCC ATAGATAATC CTGCTGATAG CAAAAAAGTG
TCTCGCTTAA CCAACTTGCA AGTAGATGTC GCTATCCAAA TTTAG
 
Protein sequence
MITIYEQKWN KNMALNKERR LLKRAAQKWK IILAASIALA TGLLSFYSFS QLKLKPNSQI 
PFSLRSSPKP TPVKLAVTAL GRLQPQGKIT YLSAPNSING VRVEKLLVSE GDEVKPGQVL
AYLDDYARSK IAIKQAFDKL LIAKAKLAQV KAGAKPGDIN AQKATMARLD SQLKGENAAQ
TATINRIQAE VDNAQKESDR YQQLYKDGAI SASIADTKAL QLKTSQQQLT EAKATLIRTQ
NTLQDQIKEA TAKLNSIKEV RGVDVALAEG EVKSIHTVIQ QAKADHELTY IKSPIDGRVL
KIRSKNGEVI TTSGFAELGK TAEMSVLAEV YQTDIQKVRV GQKAIITSAT FPEKIHGTVK
AIGWQIDKQN IFSINPNSDT DRKIVEVKIS IDNPADSKKV SRLTNLQVDV AIQI