Gene Aazo_1862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_1862 
Symbol 
ID9339655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp1933295 
End bp1934953 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content45% 
IMG OID 
Productcarbonate dehydratase 
Protein accessionYP_003721083 
Protein GI298490906 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.512714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGTCC GCAGCACGGC GGCACCCCCA ACCCCGTGGT CAAGGAGTTT AGCTGAACCC 
GATATCCATC AAACCGCATT TGTACATTCT TCTTGTAATT TAATTGGGGA TGTACACCTG
GGTCAAAATG TTATTATTGC TCCAGGAACT TCGATTAGAG CAGATGAAGG AACACCTTTT
TTTATTGGTG AAAATACTAA TATTCAAGAT GGTGTAGTAA TTCATGGGTT GGAGCAAGGC
CGAGTAATTG GTGATGATGG CAAAAACTAC TCGGTATGGG TTGGTAAAGA TGCTTCTATT
ACCCACATGG CGCTGATTCA TGGGCCAGCT TATGTAGGGG AAAGTTGCTT TATTGGCTTT
AGGTCTACAG TATTTAATGC CAGGGTAGGG GCTGGTTGCA TCGTGATGAT GCACGCTCTA
ATCCAAGATG TAGAAATTCC ACCAGGTAAA TACGTAGCAT CTGGGTCGAT AATTACTATG
CAGCAGCAAG CTGACCGATT GCCAGATGTA CAAGCTCAGG ATCAGCAATT CGCTCACCAC
GTTGTGGGGA TAAATCAGGC TTTGCGGGCT GGTTATCGCT GTGTAGAGGA TATTAAGTGT
ATTGCCCCGA TTCGGGACGA GCTTAATCTG TCTGGTGATA GATCTTATAC AAGTATTATT
GTTGACGAAT TGGAAAGGAG CAGTGAAGTG GCAAGCAAAT TGGGTGCAGA AATAGTAGAT
CAGGTACGTT ATCTACTGAA TCAAGGTTAC AAAATTGGTA CAGAACACGT AGACCAACGT
CGTTTCCGTA CAGGCTCTTG GCAAAGCTGC CAGCCTATTG AAACCAGATC ATTAGGAGAA
GCGATCACAG CATTGGAATC TTGTCTAATA GACCACAGTG GCGAGTACGT GCGTTTGTTC
GGCATTGACA ACGGCAGAAA ACGGGTATTA GAAACTATTA TCCAACGTCC TGATGGTGTA
GTAGCTACAA GTACATCTAG TTTTAAAACT CCTGCTGCAT CTTACAGCAG CTACAACGGT
AATGGTAACA GTAACGGTGC AGTTGCTAGT GGCAGCCTCA GTGCTGAAAC AGTGAACCAA
ATTCGCCAGC TCTTAGCTAA TGGTTACAAA ATTGGTACAG AACACGTAGA CCAACGTCGT
TTCCGTACAG GCTCTTGGCA AAGCTGTAAC CCTATTGAGG CAACCTCAGC TAATGATGTA
GTTGCTGCTT TGGAAGAATG CATGACTTCT CATCAAGGCG AATATGTGCG GTTAATTGGC
ATTGACAGCA AAGCCAAACG TCGTGTATTG GAAGCAATTA TCCAACGTCC TAACGGTCAA
GTAGTATCCT CCGGTAGTGC TAAAACATCA GGTACTTTAT ACAGTGGTGC AACTGCAAGT
GCCACTGCAA CTAGCACCCG CTTGAGTACC GAAGTAGTAG ACCAACTGAA ACAGTTGTTA
ACAGGTGGTT TTAAGATTAG TGTTGAACAC GTAGACCAAC GTCGTTTCCG TACAGGCTCT
TGGGTAAGCT GCGGTCAAAT TCAGGCTACA TCTGAAAGAG ATGTGCTCGC TGCACTAGAA
GCTGTTATCT CTGAATATGC AGGTGAATAC GTGCGTTTAA TCGGAATCGA CCCCGTAGCC
AAACGCCGCG TGTTGGAAGC AATCATCCAA CGTCCATAA
 
Protein sequence
MVVRSTAAPP TPWSRSLAEP DIHQTAFVHS SCNLIGDVHL GQNVIIAPGT SIRADEGTPF 
FIGENTNIQD GVVIHGLEQG RVIGDDGKNY SVWVGKDASI THMALIHGPA YVGESCFIGF
RSTVFNARVG AGCIVMMHAL IQDVEIPPGK YVASGSIITM QQQADRLPDV QAQDQQFAHH
VVGINQALRA GYRCVEDIKC IAPIRDELNL SGDRSYTSII VDELERSSEV ASKLGAEIVD
QVRYLLNQGY KIGTEHVDQR RFRTGSWQSC QPIETRSLGE AITALESCLI DHSGEYVRLF
GIDNGRKRVL ETIIQRPDGV VATSTSSFKT PAASYSSYNG NGNSNGAVAS GSLSAETVNQ
IRQLLANGYK IGTEHVDQRR FRTGSWQSCN PIEATSANDV VAALEECMTS HQGEYVRLIG
IDSKAKRRVL EAIIQRPNGQ VVSSGSAKTS GTLYSGATAS ATATSTRLST EVVDQLKQLL
TGGFKISVEH VDQRRFRTGS WVSCGQIQAT SERDVLAALE AVISEYAGEY VRLIGIDPVA
KRRVLEAIIQ RP