Gene Aazo_3688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_3688 
Symbol 
ID9341493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp3751551 
End bp3753008 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content40% 
IMG OID 
Productcarotenoid oxygenase 
Protein accessionYP_003722366 
Protein GI298492189 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACAA TTGATAAAAA GTCAAATAAA AAAAGTTGGG CAAAAGCTAT ATCTCAACCT 
GCAACTGAAT TTCCTCCTAC ACAATTATCT GTTCTCGCGG GTAAAATCCC TGATGGTCTA
CGGGGTACAC TTTACCGCAA CGGTGCAGCC AGATTAGAAC GGGGTGGTGT ATCAGTAGGA
CACTGGTTTG ATGGGGATGG GGCTATTTTA GCTGTCCATT TTACCGATGC TGGTGCTACC
GGGGTTTATC GCTATGTGCA GACTGCTGGT TATCAAGAAG AAACTGCTAG GGGTAAGTGG
CTGTATGGTA ATTATGGTAT GACTGCACCG GGGGCTATTT GGAATCAATG GCGAAAACCA
GTCAAACACA CAGCGAATAC TTCGGTTTTG GCATTACCTG ATCAACTTTT AGCCTTATGG
GAAGGGGATA ATCCCTATGC ACTAGATTTA GAGTCTTTAG AGACCAAAGG TTTAAATAAT
TTAGGTGGTT TGGCTAAAGG ACAACCCTAT TCTGCTCATC CTAAAATTGA CTATGGAACA
GGAGAAATCT TTAATTTTGG CATGAGTCCG GGTCCTAATG CCATACTGCA TATTTATAAA
AGTGACTTCA CTGGTAAGAT TCTCAAAAAA GCAAAGTTGA CCTTAAAAGG TTTTCCCATA
ATACATGATT TTGTGTTAGC AGGACAATAT TTAGTATTTT TTGCTCCTGC GGTGCGGTTA
AATATTTGGT CTGTTCTGTT TGGAACTAGT ACCTATAGTG ATTCCTTAAC CTGGCAACCG
GATCAGGGAA CTAAGATTAT AGTGATTGAT AGAGAAACTT TGTCTGTAGT CAGTCGTGGT
GTAACTGATC CTTGGTTTCA GTGGCATTTT GCTAATGGTT ATGTTGATGA TAGCGGTACA
GTAATTATTG ATTTTGCGAA ATATGCAGAT TTTCAGACTA ATGAATATTT GCGCCAAGTA
GCGACCGGGG AAACTCAAAC AGTTCCTAAA AATACTTTGA CGCGGGTACA AGTTAATCCA
CAATCTGGCA AAGTAACGGG AATTGAAACT TTGTTAAATA GAACTTGTGA ATTTCCTCAT
GTTCCTACCA AAAATGTGGG TAAGTTCTCT CGTTATAGCT ATATGTCTAT TTTTCGGGAA
GGAACAGATA CAAAGGGAGA AATCTTAAAT AGCATTGCTA GTTTTGATCA TCAAACTGCA
ACTCTGACAG AAGCTAATTT ACGTGAAAAT CTTTATCCTT CTGAACCTAT TTATGCTCCT
GATCATCAAA ATTCCCACCG AGGTTGGGTC TTAACTGTGG TTTATGATGG TAATACTGAT
AGTAGCGAAG TTTGGCTATT TAATAGCAAT ACTCTAGGTG GGGGCAGTCC ACGCTTGGAT
GAAGAACCAG TTTGTAAAAT AGGATTACCC AGCGTTATTC CCCACAGCTT CCACGGTACT
TGGAAACCTG GGGACTAG
 
Protein sequence
MQTIDKKSNK KSWAKAISQP ATEFPPTQLS VLAGKIPDGL RGTLYRNGAA RLERGGVSVG 
HWFDGDGAIL AVHFTDAGAT GVYRYVQTAG YQEETARGKW LYGNYGMTAP GAIWNQWRKP
VKHTANTSVL ALPDQLLALW EGDNPYALDL ESLETKGLNN LGGLAKGQPY SAHPKIDYGT
GEIFNFGMSP GPNAILHIYK SDFTGKILKK AKLTLKGFPI IHDFVLAGQY LVFFAPAVRL
NIWSVLFGTS TYSDSLTWQP DQGTKIIVID RETLSVVSRG VTDPWFQWHF ANGYVDDSGT
VIIDFAKYAD FQTNEYLRQV ATGETQTVPK NTLTRVQVNP QSGKVTGIET LLNRTCEFPH
VPTKNVGKFS RYSYMSIFRE GTDTKGEILN SIASFDHQTA TLTEANLREN LYPSEPIYAP
DHQNSHRGWV LTVVYDGNTD SSEVWLFNSN TLGGGSPRLD EEPVCKIGLP SVIPHSFHGT
WKPGD