Gene Ava_3987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3987 
Symbol 
ID3679661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4956664 
End bp4958652 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content43% 
IMG OID637719339 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_324487 
Protein GI75910191 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTT CCCAAAATGT GGAGCGTGGT TGCTGCCTTT TTCCTAACAA ACCAGCGCTC 
ATTTTTGAAG GTTTATATTT TACTTATAAA CAACTCAATG AAATGGCAAA TCGCGTTGCC
AATGCTTTAC TGGGGTTGGG GATTGAACGT GGCGATCGCA TAGCATTATT ATTACCAAAT
ATTCCTGAAT TTGTGATTTC CTATCTAGGG ATTCTCAAGA TTGGGGCGAT CGCAGTTTCT
ATTAACCCAA ATCTGCAAAG CGATGAACTC AAGTTTATTC TCAATGACTG TGGAGCAGCT
GTACTCGTAA CCACAGAAAC GCTGCGAGAA AAATTGCCCA AAGTCGATTT ACCCCACCTC
AAGCACATCA TAATTGCTGA GGGGCAAGCA GGTGAGGCGA TCGCCTTGAG TGAATTCATG
GCCAATGCTT CCCCTAACGC CCGTGCTGTG GAGATAGAAC GTGATGAGCC AGCAGCGATT
CTTTATACTT CTGGTACAAC AGGTTTTCCT AAAGGCGCTA CTTTATCTCA TGGCAATGTG
ATTTCCAATA TGCACTCCAT GAAGCACTGT TGTGAAATGC GCCCTAATGA TCAAATTTTA
CTATTTTTAC CGATGTTCCA CTGTTTTGGG CAGAACGCAG TTCTCAATAG TGGACTGAAT
ACCTGTGCAA CGATCATTTT ACAGCGATCG TTCGACCCAG AAACGGTACT GACAACCATC
AGCGAATATA ATATTACAAT CTTTTTTGGT GTTCCCACTA CTTTTATTCT TTTATGTGAT
AAAGCATCTA TCCGCGATTT GGATTCAGTG CGTTACTACT TCTCTGCGGC CGCAGGTTTG
CCTGTAGAAA TTGCCAAACG TTGGCAAGAC AAGTTTGGTA AAGTCATCAA CCAAGGGTAT
GGTCTGACAG AAACATCACC ATTAGCTAGT TATAACCACG AATTGAGGTA CAAACTCGGT
TCTATTGGTT CACCAATTGA AAATGTCGAG ATGAAGATTG TCAGCCTGGA TGATGGTTGC
GAAGTTGCCC CTGGTGAACT CGGTGAAATC GTGATTCGCG GTGTCAATGT CATGCTAGGT
TACTGGAATC GCCCGGCTGA AACTGCCAAA GCCATGAAGA ACGGATGGTT TCACACCGGT
GACATTGGTC AAATAGACGA ATTAGGCTAC TTTTACATCG TTGACCGCCT CAAAGATATG
ATCAACAACG GTGGATTAAA AGTATACCCA GCCGAAGTTG AAAATGTTAT TTACCAGCAT
CCAGGTATTG CAGAGGTAGC TGTCTACGGT GTACCAGATT CAGTACTAGG TGAACAGGTG
AAAGCTAGTA TTGTCCTCAA ACCAGACCAA GCGGTTACAG AAGCAGAAAT CATTGCCTTC
TGTTACCAGA AACTAGCTCA ATATAAAGTT CCTAGTGCCG TCGAATTTGT CTCCTCCATC
CCCAAGAACC CCACAGGCAA AATACTCAAG CGGCTACTTC GGCAAGAAAA TTCTGCTGCG
CCTTCCCATA CTGTGGTTAG CAAAACGCAA ACATCTGCGT CGGTAAAAGT CTCTTACCAA
ACTGCAGAAT TGATCGAAAA CTGGATTATC GATTGGGTAG TGAGAAAATT GGCAGTAGCA
GCCCAGTCAA TCGACCAAAG TAAGTCATTT GCAGATTATG GATTAGATTC CGTCAGGGCT
GTCAAGTTAG CCCAAGAATT GAGTGAATGG TTGGGATATC CCTTAGAAGC AACTATAGTC
TGGAACTTCT CGACCATCGA ATCTTTAGCA CGCCACCTAG CCAGTCAAAA AATTACCCAA
CCGACAGAAT TAGCCAAGAC AAAGCCAGAG TCTAATCTGC ACACAGAAAA TCTGTTGGTG
TCTGTAGAAT TGCCGACAAA CAACGCAGAA CCGACCGAAT CAGCAGACTT AAAAGCACTC
TCAGACGCGG AAATAGCTGA ATTACTCACC AAAGAAATTG CCACAGTTAA ACAAAGGAGA
TTAGTATGA
 
Protein sequence
MNISQNVERG CCLFPNKPAL IFEGLYFTYK QLNEMANRVA NALLGLGIER GDRIALLLPN 
IPEFVISYLG ILKIGAIAVS INPNLQSDEL KFILNDCGAA VLVTTETLRE KLPKVDLPHL
KHIIIAEGQA GEAIALSEFM ANASPNARAV EIERDEPAAI LYTSGTTGFP KGATLSHGNV
ISNMHSMKHC CEMRPNDQIL LFLPMFHCFG QNAVLNSGLN TCATIILQRS FDPETVLTTI
SEYNITIFFG VPTTFILLCD KASIRDLDSV RYYFSAAAGL PVEIAKRWQD KFGKVINQGY
GLTETSPLAS YNHELRYKLG SIGSPIENVE MKIVSLDDGC EVAPGELGEI VIRGVNVMLG
YWNRPAETAK AMKNGWFHTG DIGQIDELGY FYIVDRLKDM INNGGLKVYP AEVENVIYQH
PGIAEVAVYG VPDSVLGEQV KASIVLKPDQ AVTEAEIIAF CYQKLAQYKV PSAVEFVSSI
PKNPTGKILK RLLRQENSAA PSHTVVSKTQ TSASVKVSYQ TAELIENWII DWVVRKLAVA
AQSIDQSKSF ADYGLDSVRA VKLAQELSEW LGYPLEATIV WNFSTIESLA RHLASQKITQ
PTELAKTKPE SNLHTENLLV SVELPTNNAE PTESADLKAL SDAEIAELLT KEIATVKQRR
LV