Gene Ava_0441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0441 
Symbol 
ID3682602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp563563 
End bp566454 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content45% 
IMG OID637715770 
ProductPEP-utilising enzyme, mobile region 
Protein accessionYP_320962 
Protein GI75906666 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0344] Predicted membrane protein
[COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.299176 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGAAC TTTGGGGTGC CTTAGTTATA TTAATTGTCT GTCCCTTCTT GGGCGCGTTA 
CCTGTAATTG CTTGGATTAC TTACGCGCTC AAGAAGAGAC GTTTAGCTCA AATAGGTACA
AGAAATATCA GTGTCTCCGC AGCTTTTTAC CACGGTGGCA CAATTGCTGG GATTTTAGCG
GTTGTATCAG AAGCCCTTAA AGGAGTCGCT GCAATTTATC TTGCTCGTGC TTTCTTCCCT
GAGGGGTCAT TTTGGGAATT GCTTTCCCTG ATAGCTTTGG TACTCGGTAG GTACTTTATG
GGCAGAGGGG CGGGGACAAC CAACGTAGTT TGGGGATTAT TAGTACATGA TCCCCTGCTA
ACAATTTTTG TGAGCCTGTT GGCAATTATC AGCTTCACCC TGTTGCAGTC GAGAAATGTG
GTAAAGTACG GGGTCTTATT TGTGTTTCCT TTATTTGTGG TGCTTCTCCA CGCCGAAGAC
TTTCCTAAAA TTATTAGTGC TGTAGCACTA GCGGGATTGT TATGGTGGAT TTATAAGAAA
ATTCCTGACG ACTTGGATTT GTCTTCCCAA GAGGTAGATG CAGAGTCACA AGGCGCATTT
GAATATTTAC AGGGCAATGA TGTCATCCTC AGTTTAGATG ATGAGTTAGA TCCGGCGATC
GTGGGACACA AAGCCGCTAC TTTATCTCAA ATTAAGCGCT GGGGTTATCA AGTACCAAAG
GGTTGGGTAC TCACCCCTGG AGACGATCCA GAAAAGTTAT TAGAATTCCT CCAACCTTCC
GAATTATCAC CCATAGTTGT CCGTTCTTCC GCCATTGGGG AAGACTCAGA ACAGGCTTCG
GCGGCTGGGC AATATTTAAC AGTGATCCAA GTTGCCAGTT ATCAGCAACT ACAACAAGCC
ATTACAGAAG TCAGAGAATC ATATAATTAT TCACCGGCTG TGCAGTATCG GCGCGATCGC
GGTTTACCCG ACACAGCCAT GTCAGTCCTG ATTCAACAAC AAGTCCAAAG CGCCTATTCT
GGGGTAGCTT TTAGCCGTGA TCCTATTACC CAGCAAGGTG ATGCGGTGAT TATCGAAGCC
CTACCCGGTA GCCCTACTCA AGTTGTTTCC GGCAAAGTCA CACCAGAACA ATATCGGGCT
TTTGTGCTGG AGGCCGATAA TTTGTCTTCG GTGAAACTAG AAGGTACCGG AAGAGTACCC
CAGGCATTAA TTAAACAAGT GGCTTACTTA GCTCGTCGGC TGGAAAAGCG TTATCTGGGA
GTACCTCAAG ATATCGAGTG GAGTTACGAC GGTCAAACCC TGTGGTTATT GCAAGCAAGA
CCAATCACCA CCTTATTACC CATTTGGACA AGGAAAATCG CGGCGGAAGT GATTCCAGGT
GTGGTGCATC CCTTAACTTG GTCGATTAAT CGTCCCTTAA CTTGTAGCGT TTGGGGTGAT
ATTTTTACGA TAGTGTTAGG CGATCGCTCT ACAGGATTGG ATTTTACAGA AACGGCAACC
CTGCACTACT CTAGAGCCTA CTTTAACGCC TCTCTTCTAG GAGAAATTTT CCTCAGGATG
GGATTACCGC CAGAAAGTCT AGAGTTTTTA ACGAGGGGTG CAAAAATCAG TAAACCGCCG
TTGCAGTCCA CCTTACAAAA TCTGCCGGGA TTATTCAAGT TACTGAAACA AGAACTCAAT
TTAGAGAAAG ACTTTAAACA AGATTACCAA AAGGTATTTA TTCCGGGGTT ATCTCAATTA
GCCAATGTTT CCCTAGAGGA ACAAGAGATA GGAGAACTGC TAGCCGGGAT TGATTTCAAC
CTAGAATTGA TGCGCCGTGG CACTTATTAC AGCATTTTAG CTCCCCTGAG TGCCGCTATC
AGACAGGGAG TTTTTCGGGT GAAAGATGAG CAAATTGATA ACAGCGTCAC CCCAGAAGTA
GCCGCTTTAC GCTCACTCAG AGCTTTAGCT GTAGATGCCA AACAGATATT ACCAGAGTGT
GAACCTGAGC AAGTCTTCGA TACATTGGCG CAAGTCCCAG GGGGAGAAAA AATCCTCTAT
GAATTTAACG AATTATTGGA AGATTACGGT TATTTGAGTG ATGTCGGCAC AAATATCGCT
GTCCCCACTT GGAAAGAAGA CCCCCAACCC ATCAAACAGT TATTTGTCCA GTTAATTCAA
CTCAGTGAGC CAGAAAAAGC CGAATTAGAA GCCAAAAAAG TTGTCGCCCC GAAACGCAAA
CGGGGGACTG TACAACGACG AGTAGATATT AAAGGGCGAG TCACCGAGCT TTATTCGCGC
CTATTAGCCG AATTACGGTG GAGATTCGTG GCTTTAGAAA AAATTCTGCT GAAATCAGGA
GTACTCAAGC AAGTAGGGGA TATCTTCTTT TTAGAACTCG ATGAATTACG AGATTTATTA
GCAGATACCA ATAATGAGTT AAGAGTTAGC TTAAACGAAC TAATCCAATT TAGGCGATCG
CAATTCCACC AAGACAGTCA AATTGAACAA GTCCCCCTGG TAGTCTACGG TAATATACCC
CCCCATCCTT CAGAAACCAC AGACGTATAC TCTGACCAAA TATTACAAGG TATTGCCGCC
AGCCACGGAC AAGCCGAAGG CAGAATCAAA GTGGTGCGAA ACTTACAGAA CTTACCAGAC
ATCGATAAAG ATACAATACT AGTAGTACCC TATACAGATT CCGGCTGGGC CCCTCTCTTA
GTCAGAGCCG GAGGATTAGT TGCAGAAGCC GGCGGTAGAC TTTCCCACGG GGCGATCGTC
GCACGAGAAT ACGGTATACC TGCGGTGATG GATGTTAAAG GCGCAACCTG GATTCTGCAA
GATGGTCAAC GAGTCAGAAT CGACGGGTCT AGGGGGATTG TGGAACTATC CAACGATTTA
CGACCAGAAT GA
 
Protein sequence
MRELWGALVI LIVCPFLGAL PVIAWITYAL KKRRLAQIGT RNISVSAAFY HGGTIAGILA 
VVSEALKGVA AIYLARAFFP EGSFWELLSL IALVLGRYFM GRGAGTTNVV WGLLVHDPLL
TIFVSLLAII SFTLLQSRNV VKYGVLFVFP LFVVLLHAED FPKIISAVAL AGLLWWIYKK
IPDDLDLSSQ EVDAESQGAF EYLQGNDVIL SLDDELDPAI VGHKAATLSQ IKRWGYQVPK
GWVLTPGDDP EKLLEFLQPS ELSPIVVRSS AIGEDSEQAS AAGQYLTVIQ VASYQQLQQA
ITEVRESYNY SPAVQYRRDR GLPDTAMSVL IQQQVQSAYS GVAFSRDPIT QQGDAVIIEA
LPGSPTQVVS GKVTPEQYRA FVLEADNLSS VKLEGTGRVP QALIKQVAYL ARRLEKRYLG
VPQDIEWSYD GQTLWLLQAR PITTLLPIWT RKIAAEVIPG VVHPLTWSIN RPLTCSVWGD
IFTIVLGDRS TGLDFTETAT LHYSRAYFNA SLLGEIFLRM GLPPESLEFL TRGAKISKPP
LQSTLQNLPG LFKLLKQELN LEKDFKQDYQ KVFIPGLSQL ANVSLEEQEI GELLAGIDFN
LELMRRGTYY SILAPLSAAI RQGVFRVKDE QIDNSVTPEV AALRSLRALA VDAKQILPEC
EPEQVFDTLA QVPGGEKILY EFNELLEDYG YLSDVGTNIA VPTWKEDPQP IKQLFVQLIQ
LSEPEKAELE AKKVVAPKRK RGTVQRRVDI KGRVTELYSR LLAELRWRFV ALEKILLKSG
VLKQVGDIFF LELDELRDLL ADTNNELRVS LNELIQFRRS QFHQDSQIEQ VPLVVYGNIP
PHPSETTDVY SDQILQGIAA SHGQAEGRIK VVRNLQNLPD IDKDTILVVP YTDSGWAPLL
VRAGGLVAEA GGRLSHGAIV AREYGIPAVM DVKGATWILQ DGQRVRIDGS RGIVELSNDL
RPE