Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4706 |
Symbol | |
ID | 9342513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 4807935 |
End bp | 4809275 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | |
Product | sun protein |
Protein accession | YP_003723032 |
Protein GI | 298492855 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAATC CCCGCCAACT TGCCTTTATT GCCCTCAAAG AAGTACATAA AGGGGCTTAT GCTGATGTAG CTCTAGACCG TGTGCTACAA AAGTTTAAAT TACCCGACAA TGATCGTCGT TTAATGACAG AATTAGTCTA TGGTAGTGTT AGAAGACAAC GCACTCTAGA TACTCTAATT GATAAATTAG CTACAAAGAA GGCACACCAA CAACCACCAG AACTTCGTAC TATTTTACAT CTCGGTTTTT ATCAATTGCG TTATCAAGAA AAGATCCCTG TTTCTGCTGC TGTGAATACC ACAGTTGAAC TAGCAAAGGA AAATGGCTTT TCTAGTTTAA CTGGTTTTGT GAATGGTTTG TTACGTCAAT ATCTACGTCT TATAGAAAGT TCATCAGAAC CATTAAAGTT ACCAGAAAAT CCGGTAGAGA GATTGGGAAT TTTACACAGT TTTCCTGATT GGATAATTGA GGTGTGGTTA GAACAATTGG GTCTTAAAGA AACAGAAAGA CTCTGTGCAT GGATGAATAA AACCCCAACT ATTGATTTAC GGGTAAATAT CCTTCGCAGT TCCCTGGAAA AAGTGGAATC AGCTTTTAAA TCTGCTGGTG TTTTAGTTAG ACCTATTCCC TATTTACCTC AAGGTTTAAG ATTAATTAGT AGTACCGGGC CAATTAAAAA TTTACCTGGT TTCCGAGAAG GTTGGTGGAC TGTTCAAGAT AGTAGCGCCC AATTAGTTAG TCATTTGCTT GACCCAAAAC CGGGCAATGT GGTGATTGAT GTTTGTGCGG CTCCAGGGGG AAAAACCACC CATATTGGTG AGTTAATGGG AGATAAAGGT AAAATCTGGG CTTGTGATCA AACTGCTTCC CGGTTACGTA GACTCAAGGA AAATGTCCAA CGTCTACATT TAGAATCTAT CGAAATCTGT ACAGGGGATA GCCGCAATTT GACCCAATTT AACAACATTG CTGATTGTGT ATTATTAGAT GCACCTTGTT CCGGTTTAGG AACTATGCAC CGCCATGCTG ATGCACGTTG GCGACAAACA CCGTCTTCTG TTCAAGAACT CTCCCAACTA CAGAAAGAAC TGATATCACA TACAGCTAAT TTTATCAAGG TTGGAGGGGT TTTAGTTTAT GCCACTTGTA CACTCCATCC CATGGAGAAT GAAGAGGTAA TTTCTCAATT TTTAGCTGTA AATCCCCATT GGCAAATTGA ATCTCCTGGC TCGGATTTAG TTGATATTGC TTCTCCAGGG TGGTTAAAAG TCTGGCCTCA TCAACGGGAT ATGGATGGTT TTTTCATGGT GCGCTTAAGA AAAACCAAGG ATTCCGAGTG A
|
Protein sequence | MTNPRQLAFI ALKEVHKGAY ADVALDRVLQ KFKLPDNDRR LMTELVYGSV RRQRTLDTLI DKLATKKAHQ QPPELRTILH LGFYQLRYQE KIPVSAAVNT TVELAKENGF SSLTGFVNGL LRQYLRLIES SSEPLKLPEN PVERLGILHS FPDWIIEVWL EQLGLKETER LCAWMNKTPT IDLRVNILRS SLEKVESAFK SAGVLVRPIP YLPQGLRLIS STGPIKNLPG FREGWWTVQD SSAQLVSHLL DPKPGNVVID VCAAPGGKTT HIGELMGDKG KIWACDQTAS RLRRLKENVQ RLHLESIEIC TGDSRNLTQF NNIADCVLLD APCSGLGTMH RHADARWRQT PSSVQELSQL QKELISHTAN FIKVGGVLVY ATCTLHPMEN EEVISQFLAV NPHWQIESPG SDLVDIASPG WLKVWPHQRD MDGFFMVRLR KTKDSE
|
| |