Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Aazo_1862 |
Symbol | |
ID | 9339655 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | 'Nostoc azollae' 0708 |
Kingdom | Bacteria |
Replicon accession | NC_014248 |
Strand | - |
Start bp | 1933295 |
End bp | 1934953 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | |
Product | carbonate dehydratase |
Protein accession | YP_003721083 |
Protein GI | 298490906 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.512714 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGTCC GCAGCACGGC GGCACCCCCA ACCCCGTGGT CAAGGAGTTT AGCTGAACCC GATATCCATC AAACCGCATT TGTACATTCT TCTTGTAATT TAATTGGGGA TGTACACCTG GGTCAAAATG TTATTATTGC TCCAGGAACT TCGATTAGAG CAGATGAAGG AACACCTTTT TTTATTGGTG AAAATACTAA TATTCAAGAT GGTGTAGTAA TTCATGGGTT GGAGCAAGGC CGAGTAATTG GTGATGATGG CAAAAACTAC TCGGTATGGG TTGGTAAAGA TGCTTCTATT ACCCACATGG CGCTGATTCA TGGGCCAGCT TATGTAGGGG AAAGTTGCTT TATTGGCTTT AGGTCTACAG TATTTAATGC CAGGGTAGGG GCTGGTTGCA TCGTGATGAT GCACGCTCTA ATCCAAGATG TAGAAATTCC ACCAGGTAAA TACGTAGCAT CTGGGTCGAT AATTACTATG CAGCAGCAAG CTGACCGATT GCCAGATGTA CAAGCTCAGG ATCAGCAATT CGCTCACCAC GTTGTGGGGA TAAATCAGGC TTTGCGGGCT GGTTATCGCT GTGTAGAGGA TATTAAGTGT ATTGCCCCGA TTCGGGACGA GCTTAATCTG TCTGGTGATA GATCTTATAC AAGTATTATT GTTGACGAAT TGGAAAGGAG CAGTGAAGTG GCAAGCAAAT TGGGTGCAGA AATAGTAGAT CAGGTACGTT ATCTACTGAA TCAAGGTTAC AAAATTGGTA CAGAACACGT AGACCAACGT CGTTTCCGTA CAGGCTCTTG GCAAAGCTGC CAGCCTATTG AAACCAGATC ATTAGGAGAA GCGATCACAG CATTGGAATC TTGTCTAATA GACCACAGTG GCGAGTACGT GCGTTTGTTC GGCATTGACA ACGGCAGAAA ACGGGTATTA GAAACTATTA TCCAACGTCC TGATGGTGTA GTAGCTACAA GTACATCTAG TTTTAAAACT CCTGCTGCAT CTTACAGCAG CTACAACGGT AATGGTAACA GTAACGGTGC AGTTGCTAGT GGCAGCCTCA GTGCTGAAAC AGTGAACCAA ATTCGCCAGC TCTTAGCTAA TGGTTACAAA ATTGGTACAG AACACGTAGA CCAACGTCGT TTCCGTACAG GCTCTTGGCA AAGCTGTAAC CCTATTGAGG CAACCTCAGC TAATGATGTA GTTGCTGCTT TGGAAGAATG CATGACTTCT CATCAAGGCG AATATGTGCG GTTAATTGGC ATTGACAGCA AAGCCAAACG TCGTGTATTG GAAGCAATTA TCCAACGTCC TAACGGTCAA GTAGTATCCT CCGGTAGTGC TAAAACATCA GGTACTTTAT ACAGTGGTGC AACTGCAAGT GCCACTGCAA CTAGCACCCG CTTGAGTACC GAAGTAGTAG ACCAACTGAA ACAGTTGTTA ACAGGTGGTT TTAAGATTAG TGTTGAACAC GTAGACCAAC GTCGTTTCCG TACAGGCTCT TGGGTAAGCT GCGGTCAAAT TCAGGCTACA TCTGAAAGAG ATGTGCTCGC TGCACTAGAA GCTGTTATCT CTGAATATGC AGGTGAATAC GTGCGTTTAA TCGGAATCGA CCCCGTAGCC AAACGCCGCG TGTTGGAAGC AATCATCCAA CGTCCATAA
|
Protein sequence | MVVRSTAAPP TPWSRSLAEP DIHQTAFVHS SCNLIGDVHL GQNVIIAPGT SIRADEGTPF FIGENTNIQD GVVIHGLEQG RVIGDDGKNY SVWVGKDASI THMALIHGPA YVGESCFIGF RSTVFNARVG AGCIVMMHAL IQDVEIPPGK YVASGSIITM QQQADRLPDV QAQDQQFAHH VVGINQALRA GYRCVEDIKC IAPIRDELNL SGDRSYTSII VDELERSSEV ASKLGAEIVD QVRYLLNQGY KIGTEHVDQR RFRTGSWQSC QPIETRSLGE AITALESCLI DHSGEYVRLF GIDNGRKRVL ETIIQRPDGV VATSTSSFKT PAASYSSYNG NGNSNGAVAS GSLSAETVNQ IRQLLANGYK IGTEHVDQRR FRTGSWQSCN PIEATSANDV VAALEECMTS HQGEYVRLIG IDSKAKRRVL EAIIQRPNGQ VVSSGSAKTS GTLYSGATAS ATATSTRLST EVVDQLKQLL TGGFKISVEH VDQRRFRTGS WVSCGQIQAT SERDVLAALE AVISEYAGEY VRLIGIDPVA KRRVLEAIIQ RP
|
| |