Gene Ava_4795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4795 
Symbol 
ID3679410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6025646 
End bp6027085 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content48% 
IMG OID637720152 
Productzeta-carotene desaturase / three-step phytoene desaturase 
Protein accessionYP_325287 
Protein GI75910991 
COG category[S] Function unknown 
COG ID[COG3349] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02731] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000234203 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGTAG CGATCGCTGG CGCTGGTCTA GCAGGACTTT CCTGCGCGAA ATATCTCACG 
GACGCAGGTC ACACTCCCAT AGTCTTAGAG CGTCGGGACG TATTGGGTGG CCTTGTGGCG
GCGTGGAAAG ACTCTGACGG CGACTGGTAC GAAACCGGGT TACACGCCTT CTTTGGGGCA
TATCCCAATA TGCTGCAATT ACTGAAGGAG TTGGGCATTG AAGACCGACT CCAGTGGAAA
GAACATACAC TGATTTTTAA TCAACCAGAT AAACCCGGAA CACTCTCACG TTTTGATGTT
CCTGATATCC CCTCTCCCTT TAACATCATT GCCTCGATCC TTCGCAACAA CGATATGTTG
ACTTGGGAAC AGAAGATTAG GTTCGCTATT GGACTGCTTC CAGCAATAGT TCGAGGCCAG
AAGTATGTTG AGGAGATGGA CAAGTACAGC TTCTCCGATT GGTTGAAAAG GCAAGGTGTG
GGTGAGCGGG TAGCAAGTGA CGTGTTCATC GCCGCATCCA AGGCTTTAAC CTTTATTAAT
CCCGATGAGG TTTCCTCGAC AATTCTATTA ACAGCCCTAA ATCGCTTTCT GCAAGAGCGA
TATGGCTCCA AAATAGCCTT TTTGGATGGT TCTCCCACAG AACGACTGTG CCAACCAATC
GTTGATTACA TCACCGAACG AGGTGGAGAA GTCAGGCTCA ACGCCCCTCT AAAAGAGATT
TTGCTCAACC CGGATGGTAC AGTAAAAGGG TTCTTGCTGC GCGGGTTAAA TGGAGAACCA
GATGAAATGA TTACGGCAGA CTTTTACGTG TCAGCTATGG CAGTTGACCC ATTAAAAGTC
ATGTTGCCAC AACCTTGGCA GCAAATGGAG TTTTTCCAGA AGCTAGAAGG TTTAGAAGGC
GTACCAGTAA TTAACCTCCA TCTGTGGTTT GATCGGAAAT TAACAGACAT TGATCACCTG
TTGTTTTCGC GATCGCCCCT CCTCAGCGTT TATGCTGATA TGAGTAACAC TTGTCGTGAA
TATGCTAATC CTGACCGCTC AATGCTGGAA TTAGTTCTAG CTCCCGCCAA AGACTGGATT
AGTAAATCCG ACGAGGAAAT CGTCTCTGCT ACTATGGTCG AATTGGAAAA ACTCTTCCCC
GACCACTTTA AGGGCGATAA TCCAGCAAAA TTGCTGAAAT CTCACGTCGT AAAAACGCCG
CGTTCAGTTT ACAAAGCGAC TCCTGGTCGT CAACAGTACC GTCCAGCCCA AAAAACCCCC
ATTGCCAATT TCTTTCTAAG TGGGAGTTAC ACCATGCAAC GCTATTTAGG CAGTATGGAA
GGGGCCGTAC TTTCTGGTAA GCTAACAGCG CAGGCGATTT GTGAATCGCT GCCAGAGGAC
AACACCTCAA ACCTGCAAAC GCTAACCCGA CCGCCTGCAA CGAATGCTGC AACTGCCTGA
 
Protein sequence
MRVAIAGAGL AGLSCAKYLT DAGHTPIVLE RRDVLGGLVA AWKDSDGDWY ETGLHAFFGA 
YPNMLQLLKE LGIEDRLQWK EHTLIFNQPD KPGTLSRFDV PDIPSPFNII ASILRNNDML
TWEQKIRFAI GLLPAIVRGQ KYVEEMDKYS FSDWLKRQGV GERVASDVFI AASKALTFIN
PDEVSSTILL TALNRFLQER YGSKIAFLDG SPTERLCQPI VDYITERGGE VRLNAPLKEI
LLNPDGTVKG FLLRGLNGEP DEMITADFYV SAMAVDPLKV MLPQPWQQME FFQKLEGLEG
VPVINLHLWF DRKLTDIDHL LFSRSPLLSV YADMSNTCRE YANPDRSMLE LVLAPAKDWI
SKSDEEIVSA TMVELEKLFP DHFKGDNPAK LLKSHVVKTP RSVYKATPGR QQYRPAQKTP
IANFFLSGSY TMQRYLGSME GAVLSGKLTA QAICESLPED NTSNLQTLTR PPATNAATA