Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_3879 |
Symbol | |
ID | 9341683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 3932768 |
End bp | 3933871 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | |
Product | hydrogenase expression/formation protein HypE |
Protein accession | YP_003722510 |
Protein GI | 298492333 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.18841 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATTT ACAACAACCA ACAATCTCAA AATCCTCTCT TTCAAAAAGT TGAACAAGTC CGTCGTCGTC AGGGAAAAAT TAGGGATACA TATATTAACC TTGCCCATGG TAGCGGTGGA AAAGCCATGC GAGATTTAAT TAATGATGTA TTTGTAAAAA ATTTTGATAA TCCCATTCTT TCCCAATTAG AAGACCAAGC CAGTTTTGAC TTAGCCAGTC TTTCCCAACA TGGAGATAGA TTAGCATTTA CCACAGATTC CTATGTTGTA GACCCATTAT TTTTTCCAGG TTCAGATATA GGAGAATTAG CAATTAATGG CACAATTAAC GACTTAGCAG TTAGTGGTGC AAAACCTTTA TATCTTAGCT GTAGTGTCAT CTTAGAAGAA GGTTTACCAG TAGAAACATT GCGCCGTGTA GTTTCCAGTA TGCAAAAAGC GGCACAAAAA GCCGGAATAC AAATAGTTAC AGGAGACACA AAAGTAGTTC ATCGTGGTTG TGCTGATAAA CTCTTTATCA ACACTGCCGG AATCGGCATC ATTCCTGCTA ATATTGATAT TTCTCCCCGC AACATTCAAA CCGGAGACGT AGTAGTTATT AACGGAGAAA TAGGGAATCA TGGAACAGCC ATATTAATTG CTAGGGGAGA ACTAGAACTA GAAACAAATA TAGAAAGTGA CTGTCAATCA TTACATGAAT TAGTAGCAGA GATTATCAAA ACTTGCCCAC AAATTCATGC CATGAGAGAC GCAACTAGAG GAGGTTTAGC CACAGTATTA AATGAATTTG CCGTCACAGC AAACGTAGGA ATACGCATCC ATGAAAATGC CATTCCTGTT AAAGAACAAG TAAATGGAGT GTGTGAAATA CTCGGTTTAG ATCCTTTATA TTTAGCGAAC GAAGGAAAAC TAGTCATAGT TGCACCCAAA GAAAAAGCCG AGTTAATTTT ATCAACTATG AAAAACTACC CAACAGGAAA ACAAGCATCT ATCATTGGTG AAATTATTCC CACACCACCA GGAATAGTCC TCTTGAAAAC CGCCTTTGGT GCAGAAAGAA TAGTTGATAT GCTAGTAGGC GACCAACTCC CACGAATTTG TTAA
|
Protein sequence | MAIYNNQQSQ NPLFQKVEQV RRRQGKIRDT YINLAHGSGG KAMRDLINDV FVKNFDNPIL SQLEDQASFD LASLSQHGDR LAFTTDSYVV DPLFFPGSDI GELAINGTIN DLAVSGAKPL YLSCSVILEE GLPVETLRRV VSSMQKAAQK AGIQIVTGDT KVVHRGCADK LFINTAGIGI IPANIDISPR NIQTGDVVVI NGEIGNHGTA ILIARGELEL ETNIESDCQS LHELVAEIIK TCPQIHAMRD ATRGGLATVL NEFAVTANVG IRIHENAIPV KEQVNGVCEI LGLDPLYLAN EGKLVIVAPK EKAELILSTM KNYPTGKQAS IIGEIIPTPP GIVLLKTAFG AERIVDMLVG DQLPRIC
|
| |