Gene Ava_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2039 
Symbol 
ID3680980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2521347 
End bp2524355 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content43% 
IMG OID637717384 
Producthypothetical protein 
Protein accessionYP_322556 
Protein GI75908260 
COG category[C] Energy production and conversion 
COG ID[COG3202] ATP/ADP translocase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.373695 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTGA AAAATTACTC GTTGGCGGCA AATCCAAGTT GGCTGGCACA ACTTTTAAAA 
TGGGTGAATC TACGACCAGA GGAAGGCGAA CGGACTTGGA TGATGTTTGC CTTTTACACG
ACTGTATCTG TGGGGTTGCG ATGGGCTGAG GATAGTACAG TAGCACTATT TTTGGATGAA
TATGGCGCTG GGCCGTTGCC TTGGATGTAT ATTGCCAGTG CGGTTATGGG TATGGCGCTG
GTTTTTGTCT ATTCTTGGTT GCAAAAGATT TTTCCTTTGC GCTGGGTGGT GGTGGCGATC
GCACCTTGTA TGGTCGTGCC ATTAATCTTG TTAGTATTAT TACGCTGGGG CATCCATCTT
TCTTACTTTT CAGTCATTGT TGTATTTTTA CTACGGTTGT GGGTAGATTC CCTTTATGTA
GTTAATGATC TAAATACTTC CATTGTTGCT AACCAATTAT TTAATATTCG AGAAATTAAG
CGCACTTACC CACTGATTAG TAGCGGATTA TTAGTGGCTG ATGTGATCAG TGGCTTTAGT
TTGCCTTGGT TGCTGGAGTT CGCCAAACTC AACCGCGTGA TTATGATCGC CTGTGGAGTG
ATTATGATTG GCTCGGCAAT TCTCTGCTAT TTAAGTTATC AGTATCGCAG TTCTTTTCCC
GAAGCACCAC AGCGCCTAAT TCCTCAAGAA CAAGCCTCTC GGCATCGGCG TATCCAAGCA
CCACTCAAGC GTTATACTTT ACAGCTATTT GCCTTTGTTG GGCTGTTACA AATTATTGGT
TTATTAGTTG ATTTTCAATA TCTCCAAGAA CTAAAAATCA ACTTAGGCGA TCGCGAACTA
GCAGGCTTTT TAGGTATATT TGGTGGGATT GTTGGCTTGT GTGAATTAGG AACTCAGTGG
TTCGTTTCCA GCCGCTTGAT CGAACGCTTT GGAGTGTTCT TTACTGCTGC ACTTTTGCCT
GTGGCTGTAG GCTTTGTCGT TCCTGGGATG ATTGTTGTAC TTTACTTACT ACCAGGAATA
CAATCACTGG CATTTTTCTG GGGATTAGTC GGGTTGAAGT TCTTTGATGA ACTGCTACGC
TACACCTTCG TTATTAGTAG TGGGCCGTCA CTATATCAAC CGATACCAGA ACGCATTCGT
AGCCGGATGC AGGCATTATC AGGAGGAACG GCGGAAGCGA TCGCCACTGG TACTGCTGGT
ATCATCATTG TGATTACCTT GTTTGTGTGT GGGTTATTTG TACCTGCAAC AATGCAAAAG
TGGGTGTTTA TCTGCGAAAC AATGGTAGTA GCCGTTGCCT GCTTGAAGGT AGTATGGTTA
TTGCGATCGC GTTATGTTGA TTTGTTAGTC TTGAGCGCCG ATAGAGGCGA ACTAAGTGCC
TCTAATGTGG GTTTACGTGC CTTTAAACAA GGGGTAGTCA AAGCTTTAGA AGAAAAGGGC
AACACCGCAG ATAAACACTC ATGTATTGAA TTATTAGCCC AAATTGACCC CCAAGGGGCA
GCAGATGTTC TCTCACCAAT TTTATTTAAG CTAACGCCAG ATTTACAACG CCATAGTTTA
GAGGTGATGC TAGGGGCTGA TGCTAATCCC ACCCATGTAT CGGAAGTAAA ATGGTTGTTA
GAGCGTCATC CAGACAATGT TAATCCCGAA GTTTTTGCCC TAGCATTACG TTATGTTTGG
CTAGCTGACC CCAATCCCAA TTTAAGCCAA CTGGAAGAAT ACCTCAACCA GCGCCACCAC
TCATTAACTC GCGCCACAGC TGCGGCTTTG CTATTGCGTC AGGGAACACC AATCCAAAAA
GTAGCCGCCA CCAAGACTTT AAGCCGGATG CTAACCCATA AGCAGGAACG AGAACGGGTT
AATGGTGTCA AAGCCCTCCG AGAAGTGGTT TATTTACAGA CGTTACGGAT TCATATTCCT
AATTTGTTAC AGGATGAATC CTTAAGGGTG CGTTGTGCTG TCTTAGAAAT GATTGCTGCA
ACACGCATGG AAGAATACTA TTCCTCACTG CTGACAGCAC TTTATTACAA ATCAACCCGT
GCTACCGCCA TGCGTGCCTT AATCAAAATG GAGAATGAAG CATTGGATAT GCTGTTACAA
CTGGCTACCA ATGCCCATAA GCCAGAAGTA TTAAGGATGT ATGCTTGGCG TACCATAGGA
CAAATTTCTA CTACAGAAGC TGTAGAGACT TTATGGTTGC AGTTGGAATC TTCTTGGGGT
GCTACCAGGT ACCATATTCT GCGTAGCTTG TTGAAAATTC AGAAACAATC AGAAATTACC
AATTTAGTAG ATAGGTTTCA ACACAGTCGG GTAGAAAGTC TCATTGATCA GGAATTACGA
TTTTTGGGTG AGATTTATGC AGCTTATATA GACTTACGGC CACAATTAAA TTTAGATAAT
AATCAAACAA ATTCCAGAGC TTTGATTGTT GCGGATTTAC TCCAACGTGC CTTGGCAGAA
ATGGAATGGG ATATTCGGGA ACGTTTATTA TTATTATTAA AATTACTGTA TTCACCAGAT
AAAATGCAGG CAGCCGCTTT TAATTTGCGG TCTGAATCGA TGGTGAATTT AGCACGAGGA
TTGGAAATAT TAGAACATAC AGTAAATTTG CCCAGAAAAT CATTGTTGTT AAATCTCTTA
GATAAGCGAT CGCACCAAGA AAAGCTGCAT GATCTTTTAG AGGCAGAATT CACAGAATAT
CAACCGTTGT CAGCTAGTGA AAGATTGCGG AAGATGCTGA CTTTAGGCAC TTTTCTCTCT
GATTGGTGCT TGGCTTGCTG TTTTCATTTT GCCCAGGTTA ACCGGATTCG TTTGACAATT
AACGAAATTC TGATGGCTTT GCGTCATCCT ACGGGCTTTG TACGGGAAGC GGCGATCGCT
TATTTGAGTG TGGTTTCACA TCGCGTCCTC TTAGAACTTC TGCCCAAATT AAAAAAAGAT
TCTCATCCCC TAGTAGCAGC ACAAATTCAG GAATTGATCA AAAAATCCGC CATCAAAATT
AATCAATAA
 
Protein sequence
MELKNYSLAA NPSWLAQLLK WVNLRPEEGE RTWMMFAFYT TVSVGLRWAE DSTVALFLDE 
YGAGPLPWMY IASAVMGMAL VFVYSWLQKI FPLRWVVVAI APCMVVPLIL LVLLRWGIHL
SYFSVIVVFL LRLWVDSLYV VNDLNTSIVA NQLFNIREIK RTYPLISSGL LVADVISGFS
LPWLLEFAKL NRVIMIACGV IMIGSAILCY LSYQYRSSFP EAPQRLIPQE QASRHRRIQA
PLKRYTLQLF AFVGLLQIIG LLVDFQYLQE LKINLGDREL AGFLGIFGGI VGLCELGTQW
FVSSRLIERF GVFFTAALLP VAVGFVVPGM IVVLYLLPGI QSLAFFWGLV GLKFFDELLR
YTFVISSGPS LYQPIPERIR SRMQALSGGT AEAIATGTAG IIIVITLFVC GLFVPATMQK
WVFICETMVV AVACLKVVWL LRSRYVDLLV LSADRGELSA SNVGLRAFKQ GVVKALEEKG
NTADKHSCIE LLAQIDPQGA ADVLSPILFK LTPDLQRHSL EVMLGADANP THVSEVKWLL
ERHPDNVNPE VFALALRYVW LADPNPNLSQ LEEYLNQRHH SLTRATAAAL LLRQGTPIQK
VAATKTLSRM LTHKQERERV NGVKALREVV YLQTLRIHIP NLLQDESLRV RCAVLEMIAA
TRMEEYYSSL LTALYYKSTR ATAMRALIKM ENEALDMLLQ LATNAHKPEV LRMYAWRTIG
QISTTEAVET LWLQLESSWG ATRYHILRSL LKIQKQSEIT NLVDRFQHSR VESLIDQELR
FLGEIYAAYI DLRPQLNLDN NQTNSRALIV ADLLQRALAE MEWDIRERLL LLLKLLYSPD
KMQAAAFNLR SESMVNLARG LEILEHTVNL PRKSLLLNLL DKRSHQEKLH DLLEAEFTEY
QPLSASERLR KMLTLGTFLS DWCLACCFHF AQVNRIRLTI NEILMALRHP TGFVREAAIA
YLSVVSHRVL LELLPKLKKD SHPLVAAQIQ ELIKKSAIKI NQ