Gene Aazo_4446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4446 
Symbol 
ID9342248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4522705 
End bp4523904 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content37% 
IMG OID 
ProductDevB family ABC exporter membrane fusion protein 
Protein accessionYP_003722874 
Protein GI298492697 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACACA AAGAAAAGCA TTCATTGACA AACCCTGTAA GTTGGTTGTC CATAACTTTG 
GCAATTACGA CAGCTATAAC TACTGCTACA GTATCTCTCT ACAGCCTATC AAAATTTAAT
TTTGAATCTA AGTCTAATGT TCGAGTTCCT ATCAGGAATT CTACCCCTAT AGCGACTGCT
ATTGCAGCTT TAGGACGTTT AGAACCCGAA GGAGGAATTA TTCATTTGTC TGCTCCTAAT
TCTCAAGGGG GAGGGGTAAG AGTAGCTAAA CTTTTAGTAA ACAAAGGTGA TAAAATCCGT
CAAGGACAAG TAGTAGCAAT TCTTGATAGT TATACTCCTA ATATTGCTGC TTTAGGAAAA
GCTAAACAAC AAGTGGAAGT TGCTCAAGCT AGTGTCAAAC GAGTAGAAGC TGGTGCAAAA
CAAGGTGATA TTTATGCTCA AAAAGCTACA ATTGCTCGTC TAGATGCTGA ATTGCGTGGA
GAAACTTATG CTCAAAAAGC TACAATTGCT CGACTAGAAG CCGAATTCAA TAATGCAGAA
ATAGAGCATC AAAGATATCA GAAATTATAT CAACATGGTG CTATTTCTGC TTCTGATGGA
GATACTAAAC GATTGCGAAG AGATACATTA CAACAGCAAC TCAATGAAGC CAAAGCTTCC
TTAAATCGAA CTGTAGAAAC GCTGCAAAAA CAGTTAAATG AAGCCCAAGC TAGGCTTAAT
AGTATTGTTG AAATTCGCCC TACTGATATA CAAGCCGCGC AAGCTGATGT TAATAGTGCA
AAAGCTTCTT TTAAACAAGC TCAAGCAGAA CTAGATTTCA GTTCTATCCG TTCTCCTATA
GATGGTCAAA TACTAAAAAT TAATGCTTGG CCAGGAGAAA CAATTGGTAA TAATGGCATA
GCTGAATTAG GTCGCACTCA AAAAATGTAT GTAGTAGCAG AAGTCTACGA AACTGATATT
AAAAAAGTGC GCTTAGGTCA ATCAGTTGTC ATTACTAGTG ATGCTTTTAC AGGAAAAATA
CAAGGAACAG TCACAGATAT TGGTTTGCAA GTTGGTAAAC AAAATATCTT CAATAATAAT
CCCGGTGCAG ATACAGATAA TAAAATAATT AATGTCAAAA TTCGCATTGA TAAATCAACG
GACAACCAAC GAGTTGCAGC TTTGACTAAT TTACAGGTGC AGGTACTCAT TAAAGTATAA
 
Protein sequence
MVHKEKHSLT NPVSWLSITL AITTAITTAT VSLYSLSKFN FESKSNVRVP IRNSTPIATA 
IAALGRLEPE GGIIHLSAPN SQGGGVRVAK LLVNKGDKIR QGQVVAILDS YTPNIAALGK
AKQQVEVAQA SVKRVEAGAK QGDIYAQKAT IARLDAELRG ETYAQKATIA RLEAEFNNAE
IEHQRYQKLY QHGAISASDG DTKRLRRDTL QQQLNEAKAS LNRTVETLQK QLNEAQARLN
SIVEIRPTDI QAAQADVNSA KASFKQAQAE LDFSSIRSPI DGQILKINAW PGETIGNNGI
AELGRTQKMY VVAEVYETDI KKVRLGQSVV ITSDAFTGKI QGTVTDIGLQ VGKQNIFNNN
PGADTDNKII NVKIRIDKST DNQRVAALTN LQVQVLIKV