Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4020 |
Symbol | |
ID | 9341824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 4079292 |
End bp | 4080665 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_003722614 |
Protein GI | 298492437 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTACAG AATGGGTAGC CAAGCGGCTT GGACAAGGCA ATGTATCGCA AATGCACTAC GCCCGTCAGG GTGTCATCAC CGAAGAAATG CACTACGTAG CCAAGCGGGA AAATCTTCCT GCTGAACTGA TCCGCGAAGA AGTGGCACGG GGAAGGATGA TTATCCCTGC TAATATTAAT CACACTAACT TAGAACCGAT GGCGATCGGT ATCGCTTCTA AATGTAAAGT AAATGCCAAT ATCGGCGCTT CTCCAAACTC TTCCAACCTG CAAGAAGAAG TCGATAAACT AAACTTAGCA GTAAAATATG GTGCAGATAC CGTCATGGAC TTGTCCACAG GAGGCGGTAA TTTAGATGAA ATTCGCACTG CAATTATCAA CGCTTCACCT GTTCCCATTG GCACAGTTCC AGTTTACCAA GCGTTAGAAA GCGTTCACGG CACAATTGAA CATCTCACCG CTGACGACTT TCTCCACATC ATCGAAAAAC ACGCCCAGCA AGGGGTAGAC TATCAAACCA TCCACGCGGG AATTTTAATA GAACATTTAC CCTTGGTCAG AAGCCGCATT ACAGGTATTG TTTCTCGCGG TGGTGGAATT TTAGCGCGGT GGATGTTGCA TCACCACAAA CAGAACCCCC TGTATACCCA TTTCAACGAC ATCATTGAGA TTTTCAAAAG ATATGATGTT TCCTTCAGTT TAGGCGACTC CTTGCGCCCT GGATGTACCC ATGATGCCTC AGATGAAGCC CAGTTAGCAG AACTGAAAAC CCTGGGAAAT CTCACCCGCA AGGCTTGGGA AGATGATGTC CAAGTCATGG TAGAAGGTCC TGGTCACGTA CCAATGGATC AAATTGAGTT CAACGTCCGC AAGCAAATGG AAGAGTGTTT AGAAGCACCA TTCTATGTTC TTGGTCCTTT GGTAACAGAT ATTGCTCCTG GTTACGACCA TATTACTTCA GCAATTGGGG CAGCAATGGC GGGATGGTAT GGTACTGCAA TGCTGTGTTA CGTGACTCCC AAAGAACATT TAGGTTTACC TAACGCTGAA GACGTGCGAA ATGGGTTAAT TGCTTATAAA ATTGCCGCTC ATGCAGCGGA TATTGCTAGA CATCGGCCAG GTGCTAGAGA TAGAGATGAT GAACTTTCGA AAGCTCGATA TAATTTTGAT TGGAATCGTC AATTTGAGTT ATCTTTAGAC CCAGAAAGAG CGAAGGAATA TCACGATGAA ACTTTACCCG CAGATATCTA TAAAACTGCT GAGTTCTGTT CTATGTGTGG ACCGAAGTTC TGTCCAATGC AGACTAAGGT TGATGCTGAT GCGTTGACTG AGTTGGAGAA GTTTTTAGCG AAAGAACCTG TAGCTCAAGT TTAA
|
Protein sequence | MRTEWVAKRL GQGNVSQMHY ARQGVITEEM HYVAKRENLP AELIREEVAR GRMIIPANIN HTNLEPMAIG IASKCKVNAN IGASPNSSNL QEEVDKLNLA VKYGADTVMD LSTGGGNLDE IRTAIINASP VPIGTVPVYQ ALESVHGTIE HLTADDFLHI IEKHAQQGVD YQTIHAGILI EHLPLVRSRI TGIVSRGGGI LARWMLHHHK QNPLYTHFND IIEIFKRYDV SFSLGDSLRP GCTHDASDEA QLAELKTLGN LTRKAWEDDV QVMVEGPGHV PMDQIEFNVR KQMEECLEAP FYVLGPLVTD IAPGYDHITS AIGAAMAGWY GTAMLCYVTP KEHLGLPNAE DVRNGLIAYK IAAHAADIAR HRPGARDRDD ELSKARYNFD WNRQFELSLD PERAKEYHDE TLPADIYKTA EFCSMCGPKF CPMQTKVDAD ALTELEKFLA KEPVAQV
|
| |