Gene Ava_0594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0594 
Symbol 
ID3678624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp747405 
End bp750536 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content43% 
IMG OID637715922 
Producthypothetical protein 
Protein accessionYP_321113 
Protein GI75906817 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTACA ATTTGGATAG TTTTTTAATT TATTCCTGGT TCTCCTTGGG AATAGCAGGC 
ACAAATAAAT TGCCGGAAGT AATAGCTCCT ACAGCAGTCC CTATTAGTTT TTCTGGGACG
AAAATTTTAG TGGCTTTATT GGCTGGTACT CTGATGGCGA TCGCCTTCCA TTTACTATTA
ACTAATCTTT CTGTTGCCGT TGGCATTTCT ACTTACGGCA GAAATCCCTA TAGTGATGAT
GACGACGACT CAGAAAGTTT TGGTAAGCAA GTTAGAGAAG TAGAAGCAAA AGTTGGTAGT
TGGGCAATAG TCACAGCCAG TATTGCTTTA TTCGCAGCCA CTTTCTTAGC AGTAAAATTG
AGCTTGATCG CCAATACAAC CTTAGGGGTA ATTAGTGGTG TGGTCATTTG GTCTACCTAC
TTCTTTTTAA TCATTTGGTT GGGTTCTTCC GCCTTAGGTT CCTTTCTAGG TTCAATTATC
AGCACAGTTA GCTCAGGGTT ACAAGCTCTA ACAGGGACGG CTACTAGCGC TATTGGTGCA
GCAACAACTA GAAGCCAAAT GGTATCCACA GCAGAAGATA TTACAGCCGC AGTCCGGCGC
GAATTAACCG CAGGTTTTGA CCAGGATGCG ATTAAAAATA CCCTGCAAAG TTCTTTATCG
GCACTACAAT TACCACCACT AAACTTAAAC GAAATCCGCA GTCAGTTTGA CAAAATATTA
GCTGATGTAG ATTGGCAATC CCTCGGTGAT AGTGATTTAC TACAAAATGT GAATCGCCAA
ACATTTGTCG ATTTAATTAG CGATCGCACC AATTTATCCC CAGCTAATAT CAACCAAATT
GCTGACCAAT TACAAGCAGC TTGGCAGCAA GTTTCCCATC GCAGAAATCC CACAGAACAA
GTAATTAATT TACTGCAATC TGCCACACCA GAAGAGTTAA AGTCTGAAGA TTTGGGTGAA
CGTTTGCAAC AACTAGTGAC ATCTAGGAAA AACGGTAATG GTAGAAATGG GATGATGCAG
CAAGCGGTGA AATATGGTAT CAGCGCCGCC GTACCAGCAG TACTAGATCG GGTAGATGTT
TCTGATATCG ATGTAGACAG AATTACCAAT CAACTGCAAC AGTTACGAGA TAAAGTTCAA
GATATCGATG TCGAGAGAAT CACCAAACAA TTGCAACAGA TTCGAGAGAA AACTACTGAG
CAAATTAGTC ACAGATTCTC ACCCCCAAGT GATAACACGA TCAAAACAGA CGTAGAAGAC
TATATCCGCA ATTCTTTTCC TTGGCACTTC AACCGCCTCA CTATTAGAAA TGAATTTCCG
GATGTCATCT ACGACCCACA GGCAGATCCT ACAAATATCC GCCAGCAAAT TGAACAATTA
AACTCAAATG ATTTTACTAA CTGGCTGACA CAACGAGGCG ACCTGACGGA AGCCAAGGTG
AAAGAAATTG CTCAAGACAT GGAGAGTATC CGTCAGCAAG TTTTAGAAAC AGTCCAGCAA
TCGGAGAAAG TCGAAAAAGG TAGAGAAATC CGCAGCCGTA TCGAAAATTA TCTCCGTGCT
ACTGGTAAAC CAGAACTGAA TTCTGAGGCA ATTAACCGAG ATTTTGGTAA TTTGTTAACA
GAAGCAGGAC AGGAGTTTGC AGATATCAAT ACTCGCTTGC AAGAATTTGA CCGAGAAGCC
TTTGTACAAG TATTGTTGCA ACGCCAAGAT TTCAGTGAAG CAGAGGCTAA TAATATTGTT
AGTCAACTCG AAGGCATCCG CGATAATTTT CTCAATCAAG CCAGAGAAAC CCAAGAACAA
GCAACCACCA AAGCCAATGA ACTATGGCAA AAGGTAGAGG AATATCTGCG TCACACCAAG
AAAGAAGAGT TAAATCCTGA CGCAATTAAG CGGGATTTGC GAGTGTTATT AGAAGACCCC
CAAGTGGGAA TTAATCTTGT ACGATCGCGC TTATCTCAAT TTGACCGTGA TACCTTAGTA
CAGTTGCTCA ATCAGCGTCA AGACTTAAGT GAGGCGCAAA TTAATCAAAT TATCGATCAG
GTAGAAGCTG TTAGAGATAG CGTTCTACAA GCACCCCAAG CCGTGGTAGA CCAAACTAAA
GCACAATACG AAAAAACTAC CACAGCGATC GCCAACTATC TCCGCAATAC TAACCTAGAA
GAACTCGACC CTGAAGGTAT TAGGCAAGAT TTGGCAACTT TACTCACTGA CCCGAAAGAA
GGTGCGGTGG CGTTGAGACA TCGGTTATCT CAAATTGACA GAGAAACCTT AGTCAAAATC
CTCAGCCAAC AGCAAAACTT GAGCGAAGAC CAAGTTAATG GGATTATTGA TCAACTACAA
GACGCTATTA GGGATATTAT CAGAGCGCCC CGCCGTTTTG CTAAACGTGC TACTCAAAGG
GTTGTAGACT TTGAAGCCGG TCTAGAAGAT TATTTACGTC AAACCAACAA AGAGGAGCTG
AACCCAGAAG GAATCAAAAG CGATTTACAA TTACTCTTGC GAGATCCACG CTTGGGAATT
GGTAGTTTAG GCGATCGCGT TTCTAAGTTC GACCATGCAA CAATTGTGGC TTTACTCTCC
CAACGGGAAG ACATCTCAGA GGAAGAAGCC AACCGCATCG CCGATCAAAT TGAGTCAGTC
CGCAACACCA TTACCGAGCA AGTGCAGCAG ATTCAGCATC AGTTACAGTC AGCAATTGAC
CAAGTCTTTG ACAAGATTCG CAACTATCTC AACTCCCTGG ATCGTCCAGA ACTCAACTAT
GAAGGTATCC GTCAGGACTT TGCTAAATTA TTTGATGACC CACAAGCTGG ATTTGAAGCC
TTACGCGATC GCCTCAGTCA ATTTGACCGT GATACCCTAG TTGCAGTTCT CAGTTCCCGT
GAGGATATCT CCAGGGAAGA TGTTAACCGC ATCATCGACC AAATCGAAGC CGCACGGGAT
AGCGTCCTAC ATCGCGCCGG ACGCATCCAA CAAGAAGCAC AAAAACGCAT CAAAGCCGTG
AGACAGCAAG CTAGACAACA AGTAGAAGAA ACCAGAAAAA CCGTTGCTAG CGCCGCCTGG
TGGCTATTTG GTACAGCTTT CTCTTCTCTA GTCGCTAGTG CGATCGCTGG TGCATTAGCC
GTTATGAATT AG
 
Protein sequence
MFYNLDSFLI YSWFSLGIAG TNKLPEVIAP TAVPISFSGT KILVALLAGT LMAIAFHLLL 
TNLSVAVGIS TYGRNPYSDD DDDSESFGKQ VREVEAKVGS WAIVTASIAL FAATFLAVKL
SLIANTTLGV ISGVVIWSTY FFLIIWLGSS ALGSFLGSII STVSSGLQAL TGTATSAIGA
ATTRSQMVST AEDITAAVRR ELTAGFDQDA IKNTLQSSLS ALQLPPLNLN EIRSQFDKIL
ADVDWQSLGD SDLLQNVNRQ TFVDLISDRT NLSPANINQI ADQLQAAWQQ VSHRRNPTEQ
VINLLQSATP EELKSEDLGE RLQQLVTSRK NGNGRNGMMQ QAVKYGISAA VPAVLDRVDV
SDIDVDRITN QLQQLRDKVQ DIDVERITKQ LQQIREKTTE QISHRFSPPS DNTIKTDVED
YIRNSFPWHF NRLTIRNEFP DVIYDPQADP TNIRQQIEQL NSNDFTNWLT QRGDLTEAKV
KEIAQDMESI RQQVLETVQQ SEKVEKGREI RSRIENYLRA TGKPELNSEA INRDFGNLLT
EAGQEFADIN TRLQEFDREA FVQVLLQRQD FSEAEANNIV SQLEGIRDNF LNQARETQEQ
ATTKANELWQ KVEEYLRHTK KEELNPDAIK RDLRVLLEDP QVGINLVRSR LSQFDRDTLV
QLLNQRQDLS EAQINQIIDQ VEAVRDSVLQ APQAVVDQTK AQYEKTTTAI ANYLRNTNLE
ELDPEGIRQD LATLLTDPKE GAVALRHRLS QIDRETLVKI LSQQQNLSED QVNGIIDQLQ
DAIRDIIRAP RRFAKRATQR VVDFEAGLED YLRQTNKEEL NPEGIKSDLQ LLLRDPRLGI
GSLGDRVSKF DHATIVALLS QREDISEEEA NRIADQIESV RNTITEQVQQ IQHQLQSAID
QVFDKIRNYL NSLDRPELNY EGIRQDFAKL FDDPQAGFEA LRDRLSQFDR DTLVAVLSSR
EDISREDVNR IIDQIEAARD SVLHRAGRIQ QEAQKRIKAV RQQARQQVEE TRKTVASAAW
WLFGTAFSSL VASAIAGALA VMN