Gene Aazo_4788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAazo_4788 
Symbol 
ID9342595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism name'Nostoc azollae' 0708 
KingdomBacteria 
Replicon accessionNC_014248 
Strand
Start bp4890717 
End bp4892156 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content42% 
IMG OID 
Productcarotene 7,8-desaturase 
Protein accessionYP_003723085 
Protein GI298492908 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGTTG CAATAGTAGG TGCGGGACTG GCTGGGCTGT CCACTGCTGT AGATTTAGCT 
GATGCGGGTT GTGAGGTACA AATTTTCGAA TCTCGTCCGT TTGTTGGTGG TAAAGTCGGT
AGTTGGGTAG ATGGTGATGG CAACCACATT GAAATGGGGT TGCACGTATT TTTTGGGTGC
TATTACAATC TGTTTGAGTT GATGGAAAAG GTGGGAGCAG GTGAAAATTT ACGTCTTAAG
GAACATTCCC ATACTTTTAT CAACAAAGGT GCACGTACTG GGGCTTTAGA TTTTCGTTTC
ATCACTGGTG CGCCTTTTAA TGGCTTAAAA GCATTTTTCA CCACTTCCCA ACTTTCCCTA
CAGGATAAAT TACAAAATGT GATCGCATTG GGAACTAGCC CCATTGTTCA GGGTTTAATA
GACTTTGACG GGGCAATGAA AAATATCCGC AATTTAGATA AAATTAGCTT TTCTGACTGG
TTTTATCGTC ATGGTGGCAG TAAAGGCAGC ATCAAACGGA TGTGGAATCC TATTGCCTAT
GCTCTCGGTT TTATCGACTG CGATCATATT TCTGCCCGTT GTATGTTGAC AATTTTCCAG
TTTTTTGCAG TCAAAACTGA AGCTTCTATT CTGCGAATGC TGGAAGGTTC ACCTCACGAA
TATTTACACA AGCTAATTAT TGAATATCTT GAAGCCAGAG GCACAAAAAT ATATACCCGT
CGTCAAGTGC GGGAAATTCA CTTTGCTGAA TCAGAGGCAG AAACCCGCGT TACTGGTATA
GTTGTTGCCC AAGGTGATAC AGAAGAAATA ATTACCGCTG ACGCTTACGT TTGTGCTTGT
GATGTGCCAG GAATTCAGCG CGTTTTACCG CAAGCATGGC GGAAATGGTC AGAATTTGAC
AACATTTATA AACTTGATGC TGTCCCAGTG GCTACAGTGC AGCTACGGTT TGATGGTTGG
GTAACGGAAT TGGAAGATGA GGAAAAACGG AAACAGTTAA ATCAGGCCGA AGGGATAGAT
AATTTGTTGT ATACCGCCGA TGCTGATTTT TCTTGTTTTG CTGATTTAGC TTTAACCAGT
CCAGCAGATT ATTATCGTCC TAGGGAGGGT TCATTGTTGC AACTGGTATT GACACCGGGA
GATCCTTTTA TTAAAGAAGG TAATGAAGTG ATCGCACAGC ACGTCCTCAA ACAAGTCCAT
GAACTCTTCC CATCGTCAAG AGAATTGAAC ATGACTTGGT ACAGTGTAGT TAAACTAGCT
CAATCCTTAT ATAGAGAAGC ACCAGGAATG GACGCTTACC GTCCTGACCA AAAAACACCT
ATACCCAACT TCTTCCTAGC TGGTAGTTAT ACCCAGCAAG ACTACATTGA CAGCATGGAA
GGTGCAACTA TTTCTGGAAG ACTTGCAGCT AAAGTGATTC TGGAAAGTTT GAATAATTAG
 
Protein sequence
MRVAIVGAGL AGLSTAVDLA DAGCEVQIFE SRPFVGGKVG SWVDGDGNHI EMGLHVFFGC 
YYNLFELMEK VGAGENLRLK EHSHTFINKG ARTGALDFRF ITGAPFNGLK AFFTTSQLSL
QDKLQNVIAL GTSPIVQGLI DFDGAMKNIR NLDKISFSDW FYRHGGSKGS IKRMWNPIAY
ALGFIDCDHI SARCMLTIFQ FFAVKTEASI LRMLEGSPHE YLHKLIIEYL EARGTKIYTR
RQVREIHFAE SEAETRVTGI VVAQGDTEEI ITADAYVCAC DVPGIQRVLP QAWRKWSEFD
NIYKLDAVPV ATVQLRFDGW VTELEDEEKR KQLNQAEGID NLLYTADADF SCFADLALTS
PADYYRPREG SLLQLVLTPG DPFIKEGNEV IAQHVLKQVH ELFPSSRELN MTWYSVVKLA
QSLYREAPGM DAYRPDQKTP IPNFFLAGSY TQQDYIDSME GATISGRLAA KVILESLNN