Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3787 |
Symbol | |
ID | 9341592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 3845838 |
End bp | 3847508 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003722445 |
Protein GI | 298492268 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGAAA GTAAGTCTAA ATTTTTAGTC CCTGTTATTA GCACTGCTAT AGTCGTCGCA GGTGGGATAG CTGCTTATGT ATATTTTAAA GTGCCCTCTG AAGATGTTTC CAGTCCTCTG GGAATTGCTA AAGTAGTACC GGCTAATGCC TTGATGGCGA CTTATATTAA CACAGATTCC CAATCTTGGA GTAAGTTACA GCAGTTTGGA ACTCCACAAG CACAACAACT AGTATCCAAA GGTCTACAGG ATATCAACAA ACAACTATTA AGTGATAGCA ATATTGTTTA TGAAACAGAC ATAAAACCTT GGATTGGTGG AGTCATGATT GCTGTGCTAC CACCAAATTC TACTATACGT AATCCACCAA ATCCACCAAT TCCAGTACAG CCAGAGCCAA ATATTTTGTT GGTAGTAGGT ATAAAAGATA AACTCAATGC CTTGAAATTT GCTACTAAAT TGAAGGAGCA AAAAAACTTA CAAATTCAAG AATCAGAATA CAAAGGTGAG AAAATTATTG CTAGTACAAG CAAGACTAAA TCGACTTACA TGGTTGTTTT GAATAACACT CGTATACTGT TGACACCAGA AAAACAAGCT GTAGAAAAAG CTATTGATAC CTATAAAGGT AAGCCATCCT TTGCCAACAA AGAAGGCGCA AGTAGTATTT TAGCTAAAGG TGTAGATGTT CAAAACAGCC TTGCTCAAAT TTATGTGCCT GATTACGCCA ATATGGCACA ACAGTTAACA GCTTTCAATC CACAGTCCAG GCCATTACCC CCAGAAACAT TCGCACAACT CAAGCAAGTA AAATCAATGG TAGCGGCTGT GGGTGTCGAT GATGCTGGAG TGAGAATGAA AGTAGTAGTG AACTTAGATC CGCAACTGAA CAAATTTCAA TATCAAAATA CTCCGGCTAA GATAGTGGCA CAATTTCCCA GTGATACTTT TGCTTTAGTC ACCGGACAGA ACATAAATCG TAGCTGGCAA ACCTTCCTGG AACAGTCAAA AGATTATCCT GAAATTAAGC AAGGTGTGGA ACAAGCACGA GGACAACTAA AACAAGCGGT CAATCTGGAT TTAGATAAAG AAATTTTTGG TTGGATGGAT CAAGAATTTG CCTTGGGTGC GGTGAAATCT AGTCAAGGTT GGTTAGCCAA TGTTGGTTTT GGGGGAGCGA TGGTATTTGA CACCAGTGAT CGCAAAACAG CGGAAGCCAC CTTCACTAAA CTAGATGACC TAGCCAAAAA GCAATCACTC AACATCACCA AAAGAAGCAT TGGTGGTAAA AATATCACCG AATGGCAAAT TACCCAACAA GGCACTTTCA TAGCACATGG TTGGCTAGAT CAGGATACCG TATTTCTCGC TATTGGTGGA CCAGTTGGTG AAGCGCTAGC AGACAAAAAA GGTCAACCCC TGGATAATAC GAACACATTT AAAGCTGTAA CGAGTTCCTT GCAAAAACCC AACGGTGGTT ATTTATACTT GGATTTAGAA AACACCTCTT CTTTAATTAC CCGTTTAGCC ACACAAGGTA AACCTCTTCC CCTGGAAACC AATGCTGTCC TATCATCCAT TCGTGGTTTG GGTGTGACAG TGAATAGCCC CGATAAATCC ACCAGTCAAA TGGAAATGTT GTTAGCTCTT AAACCAAGTA GTAGTAAATA A
|
Protein sequence | MPESKSKFLV PVISTAIVVA GGIAAYVYFK VPSEDVSSPL GIAKVVPANA LMATYINTDS QSWSKLQQFG TPQAQQLVSK GLQDINKQLL SDSNIVYETD IKPWIGGVMI AVLPPNSTIR NPPNPPIPVQ PEPNILLVVG IKDKLNALKF ATKLKEQKNL QIQESEYKGE KIIASTSKTK STYMVVLNNT RILLTPEKQA VEKAIDTYKG KPSFANKEGA SSILAKGVDV QNSLAQIYVP DYANMAQQLT AFNPQSRPLP PETFAQLKQV KSMVAAVGVD DAGVRMKVVV NLDPQLNKFQ YQNTPAKIVA QFPSDTFALV TGQNINRSWQ TFLEQSKDYP EIKQGVEQAR GQLKQAVNLD LDKEIFGWMD QEFALGAVKS SQGWLANVGF GGAMVFDTSD RKTAEATFTK LDDLAKKQSL NITKRSIGGK NITEWQITQQ GTFIAHGWLD QDTVFLAIGG PVGEALADKK GQPLDNTNTF KAVTSSLQKP NGGYLYLDLE NTSSLITRLA TQGKPLPLET NAVLSSIRGL GVTVNSPDKS TSQMEMLLAL KPSSSK
|
| |