Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4751 |
Symbol | |
ID | 9342558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 4855683 |
End bp | 4856948 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | |
Product | major facilitator superfamily protein |
Protein accession | YP_003723061 |
Protein GI | 298492884 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.092373 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCGA GCAAACAGAA ATCAACTCAC CATAACAGCA AGGCTGCACA ACACGATCCT TTAGCAGCCA TGAGATTTCG AGATTATCGG CTATTTACCA TTGGGCGTGT ACTCCTCTTC ACCGGGGGAC AAATGCAGAC CGTGGCGCTG GGTTGGGAGC TTTATGAGCG GACAAATTCA GCGATAGTAT TAGGTGGAAT AGGACTGGCG CAAGTTCTAC CAATGATTGC ACTAACTTTG ATTACAGGAC ATATTGCTGA TAAGAGCGAT CGCAAACGCA TTACTTTATT CTCAATTTTG CTGCTAACTC TTTGCTCAAT AGCTTTAGCA GTTATTTCCT TTAATGAAGG TGCAATTTTT CTAGTTTATG GTTGCTTATT ATTAACAGGT GTAGCCAGAG CATTTCTCAA ACCTGCCGGT GATGCACTGA TGTGGCAGTT AATACCCACG AGTGCTTTTA CCAATGCAGC AACTTGGAAT AGCAGTAGCT TTCAATTAGC ATCAGTAATT GGGCCAGCTT TGGGAGGATT CAGCGTTGTT CTTTTCGGAA ATGCGACAGG GGTATATATA TTAGCCACAT TGGCAGCACT ATCATGTTTT TTCCTCACAG CCGCAATTAA ACCACAAAAA ACTAACTTTG CCAAAGAACC AACATCTTTA AAAACTCTAG CTGCTGGTGC CGAATTTGTT TGGAATAATC AACTAATTCT TGCAGCCATC ACCCTCGATT TATTTGCAGT CTTGTTAGGT GGTGCAGTTG CATTATTACC CATCTTTGCC AAAGATATTT TGCAAGTTGG TCCGGTAGAA TTAGGCTATT TACAAGCAGC CCCATCCATA GGTGCATTGA TTATGGCAGC ATTGTTGGTA TATTTGCCAC CTATCCACAA AGCCGGCCCT GCCTTACTTT GGTCAGTTGT CGGGTTTGGG ATTGTGACAA TTATTTTTGG GTTATCCCGT TGGGTCTGGC TATCGTTGTT GATGTTGGCA TTAAGTGGGG CGTTAGACAG CATTAGCGTG GTGATTCGTC ATACTTTGGT GCAAATTCGC ACTCCTGACC ATTTGCGGGG TCGAGTTGCA GCTATAAATA GTGTATTTAT CAGTGCTTCC AATGAATTGG GAGGTTTTGA ATCTGGTTTG ACTGCGGCTT TATTTGGGCC TGTTTTGTCT GTCGTTGGTG GAGGTGTGGG GACGATTTTA GTAGTGGTAG CAACAGCCAT GATTTGGCCG GAAATTCGCA AATTAGGAGC TTTGCATGAG GATTAA
|
Protein sequence | MSSSKQKSTH HNSKAAQHDP LAAMRFRDYR LFTIGRVLLF TGGQMQTVAL GWELYERTNS AIVLGGIGLA QVLPMIALTL ITGHIADKSD RKRITLFSIL LLTLCSIALA VISFNEGAIF LVYGCLLLTG VARAFLKPAG DALMWQLIPT SAFTNAATWN SSSFQLASVI GPALGGFSVV LFGNATGVYI LATLAALSCF FLTAAIKPQK TNFAKEPTSL KTLAAGAEFV WNNQLILAAI TLDLFAVLLG GAVALLPIFA KDILQVGPVE LGYLQAAPSI GALIMAALLV YLPPIHKAGP ALLWSVVGFG IVTIIFGLSR WVWLSLLMLA LSGALDSISV VIRHTLVQIR TPDHLRGRVA AINSVFISAS NELGGFESGL TAALFGPVLS VVGGGVGTIL VVVATAMIWP EIRKLGALHE D
|
| |