Gene Ava_1848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1848 
Symbol 
ID3681839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2300833 
End bp2304090 
Gene Length3258 bp 
Protein Length1085 aa 
Translation table11 
GC content38% 
IMG OID637717188 
Producthypothetical protein 
Protein accessionYP_322365 
Protein GI75908069 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.47429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAAG TAAGTGCAAA TGAACTGCAA TATCGTGGGA TTAACCGCAC AATTTGTATT 
GGCTTAGGTG GTACTGGACG AGATGTTTTG ATGCGAATTA GACGGTTAAT TGTTGACCGT
TATGGAGATT TAAGCAATCT GCCAATTGTA AGTTTTGTTC ATCTAGATAC TGATAAAGCT
GCAACACAAG TGACTGGCAT TCGTACAGGA AGTACTTATC ATGGTGTTGA TCTCAGCTTT
CGAGAAGCCG AAAAAGTTAG CGCCACTATG TCCGCCAAGG AAGTAACGAT GTTTGTGGAA
GGACTAGAAA GGCGCTCAGA ATATACTCGT TACGGCCCCT ACGACCATAT TGCTAGATGG
TTTCCTCCCC AACTGTTGCG AAATATTAAA GCTGTGGAGG AAGGTGCAAA AGGAATTAGA
CCTGTAGGGA GACTAGCTTT TTTTCATAAT TATCAAAAGA TAAAAATAGC GATTGAAACC
GCAGAAAGAC TTAGTAGGGG ACATGATGCT TTATTGCTGA GAAAGGGGTT AAGAGTTGAA
CCAGGATTGA ATATTTTTGT GATTGGTTCT CTGTGTGGTG GTACAGGGAG CGGTATGTTT
TTGGATGTTG CTTATAGTCT TAGACATCTT TATGGTGAAC AAGGCGCTCA GATTGTCAGC
TATTTAGTGA TTAGTCCAGA ATTATATGGT AATACCCCTA ATATGAGTGC TAATACTTAT
GCTGCTTTGA AAGAGTTAAA TTACTACAGT ACTCCAGGGA CAAAATTTGC AGCCTGTTAT
GATATTGAAA ATCTAGAATT TCTACAAGAA AAGCGTCCGC CTTTTGACTA CACTTATTTA
GTTTCTCATC AGACAGGAGG CGAATATCAA ATTCTTGATC AAGGTAAGTT ATGTAATGTG
ATCGCTCACA AGATAGCTCT AGATTTTTCC GGTGAGTTAG CACCTGTAAT TAAAGGACAT
AGAGATAATT TTCTCCAACA TATAATTCAG TGGGATAAAC ATCCACGTCC TAATGGTCAG
AGGTATTTAA CATTTGGGTT AGCGGCGATT TATTTTCCCC GTGACACTAT CGTGGAAATT
GCCTTAATAA GGGTTAGTTT AGCATTAGTA AAGTTTTGGT TAAATGGCAA AGGTCAAAGT
CCAGATCCTC AGAAACTACT GGATCAATTT CTGATTCAAT CTCGTTGGCA TAATGACTTA
GCCAAAAAAG ACGGCTTAAC TACGAAAATA GCAGAATCAG TAGAGGATAC AAATAAAAAC
TTTAGTAGCA ATATTAGTAC CTGGAGAAGT AAATTAGAGC GATCAATTTC TGAATGTCAG
AATAAAGATG ATCGTAACGG TATTCGTCAA CAGTTACCAA GGGAGTTTCG AGAGCAATTT
CGGAAAGTGC AGCCGGGGGA AACAGAAAAT GTCCGAGGTA TTTGGCTGAC AAAATTGCTC
CAGTCTTCTC CAAATATCAC CAAGGAACTA AAGACTAATA TTGACGATTA TTTAATTCAG
TTACTCACGC CAAGTGAGCC TATTTTCTCT ATTAAAAGCA GTCGTGATTG GCTAGATGCT
TTACAACATG AACTACATAA CTATCAATTC AATCTGCAAG AAGCAATTAC CGATTTTGGT
GGGATGAAAC GCGCGGAGGA TATTGATAAA AAATGGCGAG ATGCCGAGCA AATGATTGAA
GATATTGAGC ATAAAATTGG TATTCCCATA ATTAATACTA AGAATAGCCA AGTGCAAGCT
GAAGTTAAAA GGGTAGTGCA AGAAGTCTGC AAACTCATTA AACATAACTT TGATTTTACC
GTCTTTCAAG AGGCTCTAAA AATAGTCAAT GAATTACAAA AACACGTTCA GGAAAGAGGG
AATCAAGTTA CTGCTTTTAG TAGAGTCATT GAAAATTTGC AAACTTTCTA TGAGAAGCAA
GATAGTGATT TAAGACAGTT AAACTTTGAT GAAATGAGTG GAGAAGCCAT ATTTGATAGT
GAAGATATTG ATCGCTGTTA TCAAACTATG TTGCCAGAAG ATGATCTTCG CAGACAATTG
GTATTAGCTA GCTCGGAAAT TACGGAACCT GCTGGAAGGG GACAATCTTT GGCAAGTTTT
ATAGATAGAG AAAGAACTAC GCCAGAACAG CTACAAACAG AAATTGACCT AAAGGTTGAC
AGTTTATTTG CTTCTCGCGT TACTAATATT GTCAACTCTG TGATTAAGCG TTTCATGCAA
AAATATCCTT TAGCAGCGCG TTCGACTCGG TTAGCGCAAG TTATGCAAGA AGCTGAACCT
CTGCTGAGGC TGAATTTAAG TGACCCTTAT TTCCGTGAAG ACCCGGCGAA AAGTAGTAAA
TTAATTGGGT TTAAGGATAA GGATGAATTG GAGGTACGAC AGTTTAAAAC TGTATTAGCA
CAAGATTTAG GTATTGAATC AAGTGTGATA AAAGCGACAC AATCTGAAGA TGAGATTTTA
ATTGTCAATG AGTATGCTGG TTTTCCTCTC AGGCTAATTA GTAGTCTGGA GAGGATGAGA
AACCCCTATC TACGTGAACA AAATTCTGCC ACATCTTTTC TGCATAACGA TTACCAAGTA
GCATTTCCAG ATATTATCCC CCCAGATGCG ATCGCAATGG AAAAACTGGA AGATGTCTTC
TATCCTTGTT TGGCCTTTAG GTTACTCAAG GAAAACCAAG AAAATCAACA ATTAGAATTT
CAATATTATG ATTCCTTGCG TGATAGTTAC AATACTGCTA CTTTGAGTCC AGAGTGGAGT
CAAGCCTTGG AAGAATTAGC TAACCGCAAC GACATGACTG AGGCTTTGCT ACAGCTTTTA
GAGCGAGAAA TTTCTGTAAT TTCTGGACAA CCAGAACTTT GGGAAAATCA GTATTTACCA
AAACTAAGGC AATTTGTGCA GGCAGTAGAT GATTTATCAG AAGATAGTCC CAATTATCCC
TACAAACTCG CAGTAGTAGG AACATCCGCC AGCACAGATC CTACAGTTAA AGAAGGAATT
ATTCATCGCT TTCGGAGAAA AATGAATGAG CGATTTAGCA TATCTCAAAG TCGCGCTTTT
GCACCAAATA ATAATACATC AATGCAAACA GCTATTGCTG GTGAAATAGT CGTGGATATG
CCTGTTGATA CTACTGATAA TAGAGTCAGG CGGCGCTTAG AATTAGAGCG GTTGAAACAA
GATTTAGATG AAGATTTTAT TACTCAAGAT GAATATGAGC GTGAAAAACA AAGGATTTTT
GCTCAATATC CCCTTTAG
 
Protein sequence
MNQVSANELQ YRGINRTICI GLGGTGRDVL MRIRRLIVDR YGDLSNLPIV SFVHLDTDKA 
ATQVTGIRTG STYHGVDLSF REAEKVSATM SAKEVTMFVE GLERRSEYTR YGPYDHIARW
FPPQLLRNIK AVEEGAKGIR PVGRLAFFHN YQKIKIAIET AERLSRGHDA LLLRKGLRVE
PGLNIFVIGS LCGGTGSGMF LDVAYSLRHL YGEQGAQIVS YLVISPELYG NTPNMSANTY
AALKELNYYS TPGTKFAACY DIENLEFLQE KRPPFDYTYL VSHQTGGEYQ ILDQGKLCNV
IAHKIALDFS GELAPVIKGH RDNFLQHIIQ WDKHPRPNGQ RYLTFGLAAI YFPRDTIVEI
ALIRVSLALV KFWLNGKGQS PDPQKLLDQF LIQSRWHNDL AKKDGLTTKI AESVEDTNKN
FSSNISTWRS KLERSISECQ NKDDRNGIRQ QLPREFREQF RKVQPGETEN VRGIWLTKLL
QSSPNITKEL KTNIDDYLIQ LLTPSEPIFS IKSSRDWLDA LQHELHNYQF NLQEAITDFG
GMKRAEDIDK KWRDAEQMIE DIEHKIGIPI INTKNSQVQA EVKRVVQEVC KLIKHNFDFT
VFQEALKIVN ELQKHVQERG NQVTAFSRVI ENLQTFYEKQ DSDLRQLNFD EMSGEAIFDS
EDIDRCYQTM LPEDDLRRQL VLASSEITEP AGRGQSLASF IDRERTTPEQ LQTEIDLKVD
SLFASRVTNI VNSVIKRFMQ KYPLAARSTR LAQVMQEAEP LLRLNLSDPY FREDPAKSSK
LIGFKDKDEL EVRQFKTVLA QDLGIESSVI KATQSEDEIL IVNEYAGFPL RLISSLERMR
NPYLREQNSA TSFLHNDYQV AFPDIIPPDA IAMEKLEDVF YPCLAFRLLK ENQENQQLEF
QYYDSLRDSY NTATLSPEWS QALEELANRN DMTEALLQLL EREISVISGQ PELWENQYLP
KLRQFVQAVD DLSEDSPNYP YKLAVVGTSA STDPTVKEGI IHRFRRKMNE RFSISQSRAF
APNNNTSMQT AIAGEIVVDM PVDTTDNRVR RRLELERLKQ DLDEDFITQD EYEREKQRIF
AQYPL