Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_4863 |
Symbol | |
ID | 9342670 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | + |
Start bp | 4975841 |
End bp | 4976977 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | glycine cleavage system T protein |
Protein accession | YP_003723132 |
Protein GI | 298492955 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCTAATC AAGAAAATAT CACCCAATCT CTAGCACGAA CCCCTTTATA TCAAATCGGT GTAGAACTTA AAGCCCGTTT TACCAGCTTT GGGGACTGGG AAATGCCTGT ACAATATAGC AGTATTACCA AAGAACACGA AGCTGTTAGA AATAATGCTG GAATATTCGA TATTTCCCAC ATGGGTAAAT TTACTCTCCA AGGCAAAAAC CTCATTGACC AACTGGAGAA TTTAGTTCCC TCAGATTTAA GTCGTCTGCA ACCTAGTCAA GCTCAATACA CAGTATTATT AAATCCCCAA GCAGGAATTA TTGACGACAT AATTATTTAC TACCAAGGTC TAGACACCAT TGGTACACAA AAGGTAGTAA TTATAGTCAA TGCCGCAACC ACTGATAAAG ACAAATCCTG GATATTGACA CACCTTGATA TTCAAACAGT TGAATTTCAA GACCATTCAC GGGATAAAAT CTTAATTGCC GTTCAGGGAC CAAAAGCTAC TAGCTATCTT CAGTCCTTGG TGACAGCAGA TTTAACCCCC ATTAAAGCAT TTGCACACCT AGAAACAACA ATCTTTGGAC GACCCGCATT CCTAGCTCGT ACAGGTTACA CTGGGGAAGA TGGTTTTGAA GTCATGGTAG ATTCAGAAAT CGGAATAGAA TTATGGCAAC GTCTCTATGA TGCTGGTGTT ATCCCCTGTG GACTTGGTTG TCGAGACACC CTCCGTCTCG AAGCCGCAAT GGCGCTTTAT GGACAAGATA TCGACGATAG CACCACACCC CTAGAAGCGG GTTTGGGTTG GCTAGTAAAT TTAGATACCA AAGGTGATTT TATTGGGCGT AGTGTTTTAG AACAGCAAAA AACCAAGGGA GTGCAGCGTA AACTTGTAGG TTTGCAAACC CAAGGCAGAA ATATTCCCCG TCACGGCTAC TCCGTATTAT CATCGGGTAA AACAGTAGGA CAAGTAACTA GTGGTACTTT CTCACCTACA CTTGGTTATC CCATTGCTTT AGCTTACGTT CCTAGCCAGT TAGCAACCAC AAAACAGCAG ATAGAAGTGG AAATTCGCGG CAAAGCTTAT CCCTCAGTCG TGGTCAAACG TCCTTTCTAT CGGTCACAAA ATCGTGTTAC TAGCTGA
|
Protein sequence | MANQENITQS LARTPLYQIG VELKARFTSF GDWEMPVQYS SITKEHEAVR NNAGIFDISH MGKFTLQGKN LIDQLENLVP SDLSRLQPSQ AQYTVLLNPQ AGIIDDIIIY YQGLDTIGTQ KVVIIVNAAT TDKDKSWILT HLDIQTVEFQ DHSRDKILIA VQGPKATSYL QSLVTADLTP IKAFAHLETT IFGRPAFLAR TGYTGEDGFE VMVDSEIGIE LWQRLYDAGV IPCGLGCRDT LRLEAAMALY GQDIDDSTTP LEAGLGWLVN LDTKGDFIGR SVLEQQKTKG VQRKLVGLQT QGRNIPRHGY SVLSSGKTVG QVTSGTFSPT LGYPIALAYV PSQLATTKQQ IEVEIRGKAY PSVVVKRPFY RSQNRVTS
|
| |