Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4040 |
Symbol | |
ID | 9341845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4099059 |
End bp | 4100714 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | |
Product | putative sodium symporter protein |
Protein accession | YP_003722628 |
Protein GI | 298492451 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.824873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCAGTTG AACTTTGGAC AATTATCTTA GTTGGACTTT CTTTTGCACT CTATATTTAC ATTGGTTGGC AATCACGGGT AAAGGATACA AAAGATTTCT TTATTGCTGG TCAGGGTATT CCTTCCATTG CTAACGGTGC AGCAACAGCA GCTGATTGGA TGTCTGCGGC TTCGTTTATT TCCATGGCTG GGCTAATTTC TACTTTGGGT TATGATGGTT CAATTTATTT GATGGGTTGG ACTGGTGGCT ATGTTTTGTT AGCTTTATTA TTAGCCCCAT ATTTGCGGAA ATTTGGTAAA TATACAGTTC CCGATTTTGT AGGCGAGCGC TATAAATCTA ATCTAGCTCG TTTAGTAGCA GTAGTTGCAG CTATTTTCGT TTCTCTTACC TACGTTGCTG GCCAGATGCG CGGTGTAGGT ATTGTCTTTA GCCGCTTTTT AGAAGTTGAT ATCAATACTG GTGTCATTAT CGGTATAGTA ATAGTCGGCT TTCTTGCAGT GTTGGGAGGA ATGAAGGGTA TTACTTGGAC ACAAGTTGCC CAATATTGCA TCTTAATTTT CGCTTATTTG ATTCCTGCCA TTGCGCTCGC CTTTATTCTC ACGGGCAATC CTGTTCCTCA GTTAGCATTT ACCTTTAGTG ATGTTGCTGA TAACTTAAAT AAAATTCAGA CCGATTTAGG GTTTGCACAA TATACCCAAC CTTTTGCTAA CAAAACCATG ATGGATGTCC TATTCATCAC TATTGCTTTG ATGGTAGGAA CTGCGGGTTT ACCCCACATT ATCGTCCGGT TTTACACAGT ACCTAATGTG CGTGCAGCGA GATTTTCCGC AGGTTGGGCA TTATTATTTA TCGCCATTCT TTACACAACT GCCCCGGCTT TATCCATGTT TGCCCGCTAC AATTTAATTC AATCTCTCCA CAACCATACA GTTGAAGAGG TTAGACAATT AGACTGGGCA AATAAATGGG AAAAAACCAA ACTCCTCACT TTTGAAGATA AAAACAAAGA TGGTAAATTA CAGTTAACTA GCAAAAAAGA AACTAACGAA ATTACCATCG ACAATGATAT TATTGTTCTC TCTACCCCAG AAGTTGCTAA ACTCGCACCT TGGGTCATAG CTTTAGTGGC AGCGGGAGGT TTAGCAGCAG CATTGTCAAC AGCTTCTGGT TTATTACTAG TAATTTCTAG TTCTATTGCC CATGACGTTT ATTATCGCAT CTTCGATTCT ACAGCTTCCG AAGAAAAACG AGTATTTGTA GGCCGAACAG TTGTCGGTTT TGCCTTAGTT CTTGCAGGTT ATTTTGGCGT AAACCCCCCT GGTTTTGTGT CTCAGGTCGT AGCTTTTGCC TTTGGTTTAG CTGCTGCTAG TTTCTTCCCA GTGATAGTTT TAGGAATTTT TGATAAACGC ACAAATGCCG AAGGTGCTAT TGCGGGAATG TTAACCGGTT TCATTTTCAC TATCATCTAT ATTATCGGTG TGAAATTTAC GGGAATGACA CCTTGGTTTT TTGGAGTTTC TGCTGAAGGT ATCGGCACCT TAGGGATGAT CATTAATTTT ATTGTCACCA TTACAGTTTC CCGTTGTACT CCACCACCAG GAGCAGATAT TCAAGCTTTA GTTGAAGATT TACGTACTCC TAGTTTTGAA GAATAG
|
Protein sequence | MSVELWTIIL VGLSFALYIY IGWQSRVKDT KDFFIAGQGI PSIANGAATA ADWMSAASFI SMAGLISTLG YDGSIYLMGW TGGYVLLALL LAPYLRKFGK YTVPDFVGER YKSNLARLVA VVAAIFVSLT YVAGQMRGVG IVFSRFLEVD INTGVIIGIV IVGFLAVLGG MKGITWTQVA QYCILIFAYL IPAIALAFIL TGNPVPQLAF TFSDVADNLN KIQTDLGFAQ YTQPFANKTM MDVLFITIAL MVGTAGLPHI IVRFYTVPNV RAARFSAGWA LLFIAILYTT APALSMFARY NLIQSLHNHT VEEVRQLDWA NKWEKTKLLT FEDKNKDGKL QLTSKKETNE ITIDNDIIVL STPEVAKLAP WVIALVAAGG LAAALSTASG LLLVISSSIA HDVYYRIFDS TASEEKRVFV GRTVVGFALV LAGYFGVNPP GFVSQVVAFA FGLAAASFFP VIVLGIFDKR TNAEGAIAGM LTGFIFTIIY IIGVKFTGMT PWFFGVSAEG IGTLGMIINF IVTITVSRCT PPPGADIQAL VEDLRTPSFE E
|
| |