Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_0968 |
Symbol | |
ID | 9338764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 1027628 |
End bp | 1029034 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | |
Product | major facilitator superfamily protein |
Protein accession | YP_003720477 |
Protein GI | 298490300 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.623645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTCTG TTCAAGTTAA AATAGTTCCA TCTTTAAATC TGGAAATTCC TCAGATCGAT TCATTAACAA CAGTCTTATC TTCCGAATCT AAATCTAGCT CTCGTTTTCC TAAAAATGCT ATTCGTACCA GTTTACAAGC ATGTACTTTA GATGCTATTT TTGCAACGAT TTTTTCTATT ACTACTGGGG GGATTTTACT TAGTAACTTT TTGGTGGAAT TGGATGCTAG TCCAGTCGTT TTTGGGTTGC TATCCTCAAT TCCCATGTTG GTAAATCTGA TTCAGCCGTT GGGTGCTTAT ATCTCAGAGC ACACTACTAG CCGCTTTCAG TATTCCATGC TTGTTTATGG AATTTCTCGG CTATTGTGGT TGATTTTAGT AATAGGTATT TTGGGTGTAG GTTGGGGACT TATAAATTCT CAACAGTTGC TGATATTAAC TCTGGTCATC GTTTTTCTTA GTCATCTTTC AGGTGGATTG GGAAGTGCAT CATGGATGAG TTGGTTAGCT ATCATTGTTC CCCGAAGTTT GCGCGGTAGG TATTTTGGAA TCCGTAATAG TGCTACCAGC TTGACTAATT TGCTATGTAT ACCTTTAGCT GGTTTAGCTG TTTCCCATTG GTATGGAGGC ACTCGCCAAG GTTATGGGGT AGTTTTGTTT GTAGGTATCG TATTTGGGTT TATTAGTCTA GGATGTCAGT ATTTCAAAGT TGATGTGAAT CCTCAATTAC AAAATACTTG TATAGTTTAT TCATCCGTAA GCAGCAAGAA TGAATTGGGT GAAATAACTA TCAGTGATAT GGTATCTATT CCCCAAAAGC AATGGGATGA CAATATTTGG AAAAACTCTA ATTTTTGGAT ATTTATACTG TATTTCAGTT TATGGATGCT GGCTTTTAAT CTCTGCGCCC CGTTTGTTAA TCTTTATTTG TTAGAAACTT TAAATGTAGA TGTGGGTTGG CTGACACTTT ACAGCAGTCT GCAAGCGGGA GCAAATCTGC TGATGGTTGT GCTGTGGGGT AAGTTAGCAG ATAAGGTGGG AAACCGCCCA ATTTTGATAT TTGCTGGAAT TATTGTTGCG GTTACGCCTT TGCTGTGGTT TGGTATTGCT AATACCAGTT TAGATATCTG GCTGTGGCTA CCGCTATTGC ATATTTTTAT TGGTGGTACT GGGGCTGCAG TTGATTTGTG TAACAACAAT ATGCAAATTG GCATTACCCC AGTTAGAAAT CAGTCTATTT ATTTTGCGAT CGCAGCGGCT ATAGCTGGAG TAACTGGTGC GTTGGGAACA ACCATTGGCG GTTTTATCGC CCAATCGCCC CAATTTGGTG GTTTATTAGG CTTATTTGCC TTCTCTAGTA TATTGAGACT CGCAGCATTG ATACCGCTTG TATTTGTTCA GGAATAG
|
Protein sequence | MDSVQVKIVP SLNLEIPQID SLTTVLSSES KSSSRFPKNA IRTSLQACTL DAIFATIFSI TTGGILLSNF LVELDASPVV FGLLSSIPML VNLIQPLGAY ISEHTTSRFQ YSMLVYGISR LLWLILVIGI LGVGWGLINS QQLLILTLVI VFLSHLSGGL GSASWMSWLA IIVPRSLRGR YFGIRNSATS LTNLLCIPLA GLAVSHWYGG TRQGYGVVLF VGIVFGFISL GCQYFKVDVN PQLQNTCIVY SSVSSKNELG EITISDMVSI PQKQWDDNIW KNSNFWIFIL YFSLWMLAFN LCAPFVNLYL LETLNVDVGW LTLYSSLQAG ANLLMVVLWG KLADKVGNRP ILIFAGIIVA VTPLLWFGIA NTSLDIWLWL PLLHIFIGGT GAAVDLCNNN MQIGITPVRN QSIYFAIAAA IAGVTGALGT TIGGFIAQSP QFGGLLGLFA FSSILRLAAL IPLVFVQE
|
| |