Gene Ava_4537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4537 
Symbol 
ID3680141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5687551 
End bp5688753 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content44% 
IMG OID637719893 
ProductHEAT repeat-containing PBS lyase 
Protein accessionYP_325030 
Protein GI75910734 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATAATA TCGGTCAACT TTTAGTACAA GCGCAGGCGG CGCATAGTGC AGATGATTTG 
TCATTGGTGA TTAATGATCT GCAACAGTTG ATTTTAGTGG ATAGCAATGA AATAGGTCAC
TCAGAACAAC TGCTGAAATT AGCCCTCTCT ATTTTTGAGA TGGGAGATTT TCAGCAACGT
TGGGATATTG CCAAGGTGTT AGTTCATTTG GGAACTGCGA CCATCAACCC ATTAATTGAC
ATCCTAGAAG ATGAAGATAC AGAGGATGAA TTGCGCTGGT TTGCTGCCCG AATTTTAGGT
GAATTACAGC ACCCAGATGC GATTATGCCT CTGGTAGAAC TATTGAAAAC TAGTGAGGAG
GATGAACTGA AAGCGATCGC CTCATCCGCA ATAGCGCAAA TGGGTAGTAT GGCTATTCCT
GTGCTTGTAG AACTGCTAAC ACAAGAAAAT ACAAGACTTT TGGCAGTGCG ATCGCTTGCT
TATATTCGGC GTACAGAAAC TATTGCGCCT TTATTGAGTG TGGTGCAAGA TACTCAAGCC
TCCGTCCGCG CTGCCGCACT CGAAGCCCTC AGCAGTTTTC ATGATGAACG TGTACCACCC
ATACTGTTGA ACGCTTTAAA TGATTTATCT GCCGCAGTCA GACGTACAGC GATTCAGGGT
TTAAGTTTTC GCTCTGATTT ATCTTCAGAA TTAAATTTAG TCGCCACATT ACAACCCAAA
TTGTACGACT TTAGTGTGGA GGTGTGTTGT GCAGCTGCCA ATGCTCTTGC CCGGATGGGT
GGTGATGACG CAGCCAAGCA CCTATTCACA GGTTTGATAT CACCTCACAC ACCTATTACC
TTACAACTGG AAATTATTCG CGCTTTGAAT TGGCTAGAGT CACTAAAGGC GCTGGAGTAT
TTACGACAAG CTTTAAATCA AGTCACTTCC ATAACTCTTT GCCAAGAAGT TGTCACAGTT
CTAGGAAGAA CACAAAAGCC TGAGTTAAAA ATACCAGCCA CAGCCATTCT GTTAGAGATA
CTGAACTCGC CACATCCAGC TATAAAAACT AATAGTGTGA AAAGTGCGAT CGCTTTATCT
TTGGGTCAGC TAGGTAGTCC AGAAGCAACG GAAAGCTTAA CCATGCTGTC ACAAGATACA
GATGAACTTG TCCGAGTCCA TGCGATCGCT GCACTCAATA AGCTAGCCCC TGCTGCTGTA
TAA
 
Protein sequence
MNNIGQLLVQ AQAAHSADDL SLVINDLQQL ILVDSNEIGH SEQLLKLALS IFEMGDFQQR 
WDIAKVLVHL GTATINPLID ILEDEDTEDE LRWFAARILG ELQHPDAIMP LVELLKTSEE
DELKAIASSA IAQMGSMAIP VLVELLTQEN TRLLAVRSLA YIRRTETIAP LLSVVQDTQA
SVRAAALEAL SSFHDERVPP ILLNALNDLS AAVRRTAIQG LSFRSDLSSE LNLVATLQPK
LYDFSVEVCC AAANALARMG GDDAAKHLFT GLISPHTPIT LQLEIIRALN WLESLKALEY
LRQALNQVTS ITLCQEVVTV LGRTQKPELK IPATAILLEI LNSPHPAIKT NSVKSAIALS
LGQLGSPEAT ESLTMLSQDT DELVRVHAIA ALNKLAPAAV