Gene Aazo_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_2042 
Symbol 
ID9339835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp2121779 
End bp2123287 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content42% 
IMG OID 
ProductUbiD family decarboxylase 
Protein accessionYP_003721222 
Protein GI298491045 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.527905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGAG ATTTACGGGG ATTTATCAAA ATCCTAGAAC AAAGAGGACA ATTAAAGCGA 
ATTTCAGCTT TAGTTGACCC AAATATGGAA ATTGCCGAAA TTTCCAACCG GATGCTACAA
AAAGGTGGGA CAGGATTAAT CTTTGAAAAT GTCAAAGGTG CATCTTTCCC CGTTGCTGTC
AATTTAATGG GGACACTGGA AAGGATATGC TGGGCGATGA ACATGGAAAA ACCAGAGGAA
TTGGAAACCT TGGGAAAGAA ACTGAGTATG CTTCAGCAAC CCAAACCACC TAAAAAGATT
TCCCAAGCGA TAGACTTTGG GAAAGTGCTG TTTGATGTAG TGAAAGCGAA ACCAGGAAGG
GATTTTTTTC CTGCTTGTCA ACAGGTGGTA GTGGAAGGGG ATAATGTAGA TTTAAATAAG
TTGCCGTTGA TACGTCCTTA TCAGAGAGAT GCCGGAAAAA TTATTACGCT AGGATTGGTA
ATTACCAAGG ATTATGACAC AGGAACGCCC AATGTTGGTG TATATCGGCT ACAACTGCAA
TCTAAGAACA CCATGACAGT ACACTGGTTA TCAGTGCGGG GTGGTGCGAG ACATTTACGC
AAAGCAGCGG AATTTGGTAA AAAATTAGAA ATTGCGATCG CACTTGGTGT TGATCCTTTA
ATTATTATGG CAGCAGCCAC ACCCATTCCT CTAGACTTAT CAGAATGGTT ATTTGCAGGG
CTTTATGGTG GTTCAGGGGT ACAATTAGCC AAGTGTAAAA CAGTAGATTT AGAAGTTCCC
GCAGATTCAG AATTTGTCTT AGAAGGAACA ATTACACCAG GGGAAGTTTT ACCCGATGGA
CCCTTTGGCG ATCACATGGG ATATTATGGT GGCGTGGAAG ATTCGCCATT GGTACGCTTC
CAGTGTATGA CTCACCGCAA AGATCCAATT TATCTGACTA CATTTAGCGG TCGTCCACCC
AAAGAAGAAG CTATGATGGC GATTGCACTC AACCGCATCT ATACCCCTAT ATTACGGCAA
CAAGTATCAG AAATAGTCGA TTTCTTCCTA CCCATGGAAG CATTAAGTTA CAAAGCTGCG
ATTATTTCTA TAGATAAAGC TTACCCTGGA CAAGCAAGAA GAGCAGCCTT AGCTTTTTGG
AGTGCATTAC CCCAATTCAC ATACACTAAA TTTGTGATTG TTGTTGATAA ACATATCAAC
ATTCGTGATC CACGTCAGGT TGTCTGGGCA ATTAGTTCTA AAGTAGACCC TTCACGGGAT
GTATTCATAT TACCAAAGAC ACCCTTTGAC ACCTTAGACT TTGCTAGTGA AAAACTCGGG
TTAGGGGGCA GAATGGGCAT AGATGCAACT ACCAAAATTC CCCCAGAAAC TGAACATGAA
TGGGGTGAAC CATTAGAATC AGATCCTGAT ATTGCTGCAA TGGTAGAAAG ACGCTGGGCA
GAATATGGTT TAGCAGATTT AAAACTAGGA GAAGTAGACC CCAATTTGTT TGGTTATGAT
ATGAAGTAA
 
Protein sequence
MARDLRGFIK ILEQRGQLKR ISALVDPNME IAEISNRMLQ KGGTGLIFEN VKGASFPVAV 
NLMGTLERIC WAMNMEKPEE LETLGKKLSM LQQPKPPKKI SQAIDFGKVL FDVVKAKPGR
DFFPACQQVV VEGDNVDLNK LPLIRPYQRD AGKIITLGLV ITKDYDTGTP NVGVYRLQLQ
SKNTMTVHWL SVRGGARHLR KAAEFGKKLE IAIALGVDPL IIMAAATPIP LDLSEWLFAG
LYGGSGVQLA KCKTVDLEVP ADSEFVLEGT ITPGEVLPDG PFGDHMGYYG GVEDSPLVRF
QCMTHRKDPI YLTTFSGRPP KEEAMMAIAL NRIYTPILRQ QVSEIVDFFL PMEALSYKAA
IISIDKAYPG QARRAALAFW SALPQFTYTK FVIVVDKHIN IRDPRQVVWA ISSKVDPSRD
VFILPKTPFD TLDFASEKLG LGGRMGIDAT TKIPPETEHE WGEPLESDPD IAAMVERRWA
EYGLADLKLG EVDPNLFGYD MK