Gene Ava_4727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4727 
Symbol 
ID3679707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5916564 
End bp5919647 
Gene Length3084 bp 
Protein Length1027 aa 
Translation table11 
GC content44% 
IMG OID637720083 
Productglycerophosphoryl diester phosphodiesterase 
Protein accessionYP_325219 
Protein GI75910923 
COG category[C] Energy production and conversion
[S] Function unknown 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase
[COG4222] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTAAATG TTCAGTTAGT GGGCTTTGCT TCTCTCGCGG CTGATACCTA TGCTGATGGG 
CCAGCTTCTG GTAATGGTAT TTCCGGAAAC GGTAGGACAG GCCCTTTCCC TGGACAGCCA
GTGCAGGGCT TTAGTGCAGT ACAATTCGCC AATAGTGGCT CATTTTACTT CTTGTCCGAT
AATGGCTACG GTAGTAAAGA TAATAGTGCT GACTACCTAC TACGTATCTA TCGTTTAGAC
CCGAATTTTC GAGGCGTAGA GAATGGTGAT GGCAGCGTTA AGATTTTAGA CTACATCCAA
CTATCTGACC CCAATAACAA AATTCCTTTC CAGATAGTGA ATGAGGCTAG TGCTGACAGA
TTACTGACAG GCGCAGATTT AGACATTGAG TCTTTTGTCA TTGATAAAGA TGGTTCTATC
TGGGTAGGTG ACGAATTTGG CCCTTACTTA CTACATTTTG ATGCTACGGG TAAATTGTTA
GATGCACCCA TCGCCACACC AGACCGCTTC AAGACTTTAG ATGGTACAGC ACCCGAAGTT
ATCGGACACC GGGGTGCAAG CGGCTTTCGT CCAGAACATA CCCTAGAGTC CTACAAGCTG
GCTATTGAAC AAGGTGCAGA CTTTATCGAA CCAGACTTGG CTGTGACAAA AGATGGTGTG
TTAATTGCTC GCCATGAACC TGCTTTAGCG GTGTTGAACG CTGATGGTAG CGTTAATTTC
AGCAATACAA CCACGAACGT TTACCAAATT GCCAAGTTTA GCGATCGCCT AAAAACGGTA
AATTTAGATG GAACTGAAAT TACTGGTTGG TTTGCTGAAG ATTTTACTTT AGCGGAAATC
AAGGAATTAC GAGCCATAGA GCGTCTACCT TTCCGTGACC AGTCATTCAA CGGTCAATTT
ACCATTCCTA CCCTGACAGA AATTATTGAC CTGGTGAAAC AGGTAGAAGC CGAGACTGGG
AAAAAGATTG GTATTTATCC AGAAACCAAA CATCCTACTT ATTTTGCTCA AGAAGCTACC
TATGTAGGGA CAACAGAGAA GATTAACCGC AATATCAGTC AGATTCTCAT CGATACCCTC
CAGGCTAATA ACTTTACTGA TCCTAGCCGG ATTTTCATCC AGTCCTTTGA AGTTGGCAAT
CTTAAAGAGC TGCATGATGT GATTATGCCT CGTGCTGGGG TCGATATTCC CTTGGTACAA
CTTTTTGATG CCATTGACGT TGATATTGAT GGTAGGCTCA TAGAAACCAG ACCCTATGAC
TTTATCGTTA GTGGTGACAC TCGCACCTAC GGCGATTTAC GCACTCCAGC AGGCTTGGCG
GAAATTGCTG AATATGCTGA TGGTATCGGC CCCTGGAAGC GGATGATTGT TAGTGTCAGA
GGTACTGATG CTAATAATGA TGGACAAGCA GATGATGTCA ATGGAGATGG CGCAGTTAAT
GATGCTGATA AGACTCTGTT ACCGCCAACT ACCTTAGTTC AAGATGCTCA CAATGTTGGT
TTGCAGGTTC ACCCATACAC CTTCCGTGAT GAAGAACGTT ACTTAGCAGC GAATTATCAA
GGAAATCCAG AACTAGAGTA TCAGCAGTTA TTTCAATTAG GGGTAGATGC TTTATTTACC
GACTTCCCCA TTACAGCAGA TAGAGTCCGT GACCTATTGA GTCTCCCTGG AAACAATATA
GTCCGTTCTC CCCAAAACCC TGATGTGCTT TCAGGAGATG CTTTAGCAAA TTTGGGTGGT
TCTAGAGGTT TTGAAGGTGG CGCAATTAAC GCCAGCAAAA CCAAGCTCTA TATGCTTCTG
GAAGGAACAG TCCAAGGTGA TCCTGTAGGT GCATTACGGC TGAATGAATT TGACCTAGCA
ACCCGTAGCT ATACAAATAA TTTACGCTAT TACAGGCTGG AAAATCCTGC TCATGCGATC
GGAGAAATCA CCGTCATTAA TGACAATGAG TACCTAGTCA TTGAACGAGA TGGAGGTCAA
GGCGCTTCAG CTAGATTCAA GAAGATTTAC AAAATAAACC TGTCTCAAAC AGATGCTAAT
GGCTTTGTAG CTAAACAAGA AATTGCCGAT TTATTAAATA TCCAAGACCC CAACGACCTG
AATAGAGACG GTAAAACAAC CTTTGACTTC CCTTTTACCA CCATCGAATC TGTTGTAGTT
GTTGACGCAA ACACCATTTT GGTCGCCAAT GACAACAACT ATCCATTTTC CGTCGGTCGC
CCTCCAGCTA TAGATAATAA CGAAATTATC TTATTGAGGT TAGAGCAACC CCTCAATCTT
GCACCTGGTT TGGGACAGCC ACAAGCTACA GAAATTAAGT TCGGCTCTCC TAGTAGCGAT
GAGATTACGG CGGAACCCGG TCGAATATTA TTCACAGGTG ATGGCGCAGA TACAGTAGAT
TCCCCTGGGA ATAATACTAT CTCCACAGGC AACGGAAATG ATACGGTATT TGTGGGCAGT
GATGCTTCTG TCTCTACTGG CAATGGTAAT GATCAAGTCT TCATTGGTGT GAATGGCCCC
ACCAGCAACA CTACAGCTAA CGGTGGTAAT GGTAATGACG AAATCACCGT GATTGAAGCA
GGGGGAAGTA ATAACCTTTT TGGTGCAGCA GGTAATGACA CTCTGCAAGT CATCGAAGGT
TCCCGTCAAT TCGCCTTTGG TGGTTCTGGT AACGACACCC TCACAAGTAA CGGTAGTTAT
AACCGTCTCA ATGGTGGTTC GGGAGATGAC AAATTATTCT CCAATGTGAA TGACTCTTTG
TTTGGCGGCG ATGGCGATGA TGTGCTATTT GCAGGTCAAG CCGGTAGTAA CCGTCTGAGT
GGTGGCGCTG GTACTGACCA GTTCTGGATT GCTAATGGTA GTTTACCAAC TAGCAAGAAT
ACGGTGACAG ACTTCGCTGT CGGTGTTGAC AAAATTGGAT TGGGGGGAAT TGGTGTCACG
CAATTTAGTG CTTTGAGTTT GGTAAAGCAA GGCGCTGATA CTTTGGTGAA GTTAGGCGCG
ACTGACTTAG TTGCATTACA AGGAATTACG TCAACTAGTC TGACTGTGAC TGACTTTGTT
TTTGCTGTCA GTGTGGTTGG TTAG
 
Protein sequence
MVNVQLVGFA SLAADTYADG PASGNGISGN GRTGPFPGQP VQGFSAVQFA NSGSFYFLSD 
NGYGSKDNSA DYLLRIYRLD PNFRGVENGD GSVKILDYIQ LSDPNNKIPF QIVNEASADR
LLTGADLDIE SFVIDKDGSI WVGDEFGPYL LHFDATGKLL DAPIATPDRF KTLDGTAPEV
IGHRGASGFR PEHTLESYKL AIEQGADFIE PDLAVTKDGV LIARHEPALA VLNADGSVNF
SNTTTNVYQI AKFSDRLKTV NLDGTEITGW FAEDFTLAEI KELRAIERLP FRDQSFNGQF
TIPTLTEIID LVKQVEAETG KKIGIYPETK HPTYFAQEAT YVGTTEKINR NISQILIDTL
QANNFTDPSR IFIQSFEVGN LKELHDVIMP RAGVDIPLVQ LFDAIDVDID GRLIETRPYD
FIVSGDTRTY GDLRTPAGLA EIAEYADGIG PWKRMIVSVR GTDANNDGQA DDVNGDGAVN
DADKTLLPPT TLVQDAHNVG LQVHPYTFRD EERYLAANYQ GNPELEYQQL FQLGVDALFT
DFPITADRVR DLLSLPGNNI VRSPQNPDVL SGDALANLGG SRGFEGGAIN ASKTKLYMLL
EGTVQGDPVG ALRLNEFDLA TRSYTNNLRY YRLENPAHAI GEITVINDNE YLVIERDGGQ
GASARFKKIY KINLSQTDAN GFVAKQEIAD LLNIQDPNDL NRDGKTTFDF PFTTIESVVV
VDANTILVAN DNNYPFSVGR PPAIDNNEII LLRLEQPLNL APGLGQPQAT EIKFGSPSSD
EITAEPGRIL FTGDGADTVD SPGNNTISTG NGNDTVFVGS DASVSTGNGN DQVFIGVNGP
TSNTTANGGN GNDEITVIEA GGSNNLFGAA GNDTLQVIEG SRQFAFGGSG NDTLTSNGSY
NRLNGGSGDD KLFSNVNDSL FGGDGDDVLF AGQAGSNRLS GGAGTDQFWI ANGSLPTSKN
TVTDFAVGVD KIGLGGIGVT QFSALSLVKQ GADTLVKLGA TDLVALQGIT STSLTVTDFV
FAVSVVG