Gene Ava_4083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4083 
Symbol 
ID3681606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5072930 
End bp5075845 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content48% 
IMG OID637719434 
ProductATPase, E1-E2 type 
Protein accessionYP_324582 
Protein GI75910286 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0474] Cation transport ATPase 
TIGRFAM ID[TIGR01106] sodium or proton efflux -- potassium uptake antiporter, P-type ATPase, alpha subunit
[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.48353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTGC ATCAACCAGT CTGGACATTA CCCATTGCAG CAGTTTATGA GTTATTGGGA 
ACTACTGAGA ACGGCTTAAC TGAATATGAG GCAACACAAA GCCTTGAGCG TTATGGTGCG
AATGAACTGC CAGAAACGCC CCAGCGCCCA ATGTGGCTTC GCTTCACGGA TCAACTCACT
CACTTTATGG CTCTATTGCT ATGGGTAGCA GGAATTTTGG CATTTATTTC CCGCACTCCA
GAACTGGGAT GGGCAATCTG GGCTGTGATC TGGATTAATG CCGTCTTTAG TTTTTGGCAA
GAGTTTCAGG CAGAACAAGC ATTATCAGCA CTTAAGAATG TGTTACCGAT GCAGGTAAAA
GTGTATCGAG ATGGTGAACT CAAGCAAATA CCAGCACGGG AACTGGTGCG TGGGGATATT
ATGCAATTGG AAGAAGGGGA TCACGTTTCG GCTGATGCGC GGTTGGTAAA ATCTGAAAGT
CTATACCTCG ATGTTTCTGT GCTGACTGGT GAATCTCTGC CTGTGGCTCG CAATGCTTAT
CCAGTAAGGG TGCGGGAAGT TGCCTCTATT CGGGGCGGTA AGACTCTACC AGCCGGTGAA
CAACCTTTAC AAGAACCAAC TAATTTGGCG GAAATTCCCA ATTTGGTGTT AGCAGGTTCA
ACAGTGTCAT CAGGACGAGG GGTAGCTATA GTCTATGCTA CAGGCGCACA AACAGAATTT
GGTCATGTAG CACATCTCAC AACCGTTGTG CAACGCGAAC CTAGCACTTT AGAAGTGCAA
GTGGCACAAG TGGTACGGGT GATCACAGCG ATCGCTTTGA CGATGGGAGT CTTGGTATTT
TTACTAACAT CTCTGTTAGT GGGGATGGAA GTGAAAGAGA GTTTTATTTT TGCGATCGGT
ATTATTGTAG CCTTAGTGCC GGAGGGGTTA TTGCCTACCG TCACATTATC ATTAGCAATT
GGAGTCCGGC GGATGGTACG GCGTAATGCT CTAGTGCGTC GGCTGTCGGC TGTGGAAACC
TTAAGTGCTA CTACGGTCAT CTGTACAGAC AAAACCGGCA CATTGACCAA AAACGAAATG
ACGGTGCATT ATCTCTGGAT TCCCTGGCAA CCCACTGGCA ATGAACTGCC GGAAACACCA
TTGGCTATCT CACCCACCCT GATTGAAGTC ACTGGGGCAG GATATGACCC TACAGTCGGG
AAAGTGCATA TGTCGGGGAA TTTTGCTGCC GCCTGGAAAG TTCATTTGTT ACTCACAGGC
GCAGCACTTT GCTCTAATGC CCGTCTGATT CACCTGACAG CCCCTAGCCG TTGGCAAGAG
ATTGGTGATC CTACAGAAGC AGCTTTGTTA GTCGCTGCTG CTAAGGCTGG GCTAAATTTG
GAAACCTTGC AAACACAACT GCCACGCTTG CGCGAAGTGC CATTTGATTC GCGGCGGCGA
ATGATGACGG TAATATTAGA TTGGCGTGCC TCAACGGTAT GGACTGGTGA CTTGCCGAAC
TTAGCTTTTA CTAAAGGTGC GCCGTTAGAA GTCTTACGCC ACTGCACATA TATCTTAAGA
AATGGCACAC TTGCAGACAT CAACCAAGAC GACTGGAATC AAGTGGTGGC AGCTAACGAT
AGTTTAGCCG CCCAAGGTTT TCGGGTGCTA GGAGTGGCGG CGCGGCGTGG TGGGAGTGAA
ATGTTAGATT GGCGATCGCA AGACCTAGAG CAGAATTTAA CCTTTATTGG TTTAGTGGCG
ATGTTTGATC CGCCCCGTCC AGAAGTAAGT GATGCGATCG CTGAATGCCA TGCTGCCGGA
ATCAAGGTAA GTATGGTGAC AGGTGATTAT GGTTTAACTG CGGAAGCGAT CGCCCGTCAA
ATTGGACTGG TCAACAACTC GGTACGGATA GTCACCGGTG AAGGCATGGG AAATTTATCT
GATGCACAAC TGCGGCAAAT TGTCAAATAT CGTTCTGGGT TGGTATTTGC GCGGATGTCT
CCGGAACACA AGTTACGATT GGTGCAAGCC TACAAAGATA TAGGTGATGT AGTTGCAGTC
ACAGGGGATG GAGTTAATGA TGCCCCAGCC TTAAGAGCCG CTCATATAGG AGTAGCGATG
GGGATGAATG GCACAGATGT AGCACGGGAA GCCGCCGATA TTGTGCTTAC AGATGATAAC
TTTGCTACCA TTGTGAGCGC GATCGAGCAG GGACGCACCG TGTATCAAAA CATCCGCAAA
TTTATGACTT ATATCTTGGC ATCGAATGTA GCGGAATTAG TGCCTTTTTT GCTAATGGTA
GCCCTGAAAG TTCCCCCCGC CTTGGTAATT ATGCAGATTC TCGCCATTGA TTTAGGCACT
GACTTAGTAC CGGCTTTAGC CTTGGGTGCA GAAAAAGCAG AAGTTGGTAC AATGCACCAA
CCACCTCGGA AAAAATCGCG ATCGCTTCTT GACCGTTCCC TGCTTTTACG CGCCTATTGT
TTTCTAGGTC TACTGGAAGC GATTCTGGGA ATGACAGCAT TCTTCCTCGT CTGGTGGAGT
TACGGATATA ACTTACAACA ATTACAAGCC GTCACACCCA GTATTTTATC GCACTCAGCC
AACGCCGCCA CCGTTGCCAT CTATACTCAA GCTACAACTA TGACCTTAGC TACCATTGTC
GCCTGTCAGG ATGGTAATGT TTTCGCTTGC CGTTCGGAGC GTACTTCTAT TTGGCGGCTA
GGATTATTTT CTAATCCTCT GATTTGGTTA GGAATTGCGA CTGAATGGAT GTTAGTCATA
CTGATTACCA ACAGCACATT CTTAAGTAGA TTTTTTTCCA CTGCACCCCT AGCACCTTGG
CAATGGTTGC TGTTGCTAGT ATGTCCGCCA ATTATTTTAG GCGCAGAAGA ACTAAGAAAA
GCTGCTTGGC GCAGAAATCT TAGACATAGA CGATAA
 
Protein sequence
MSLHQPVWTL PIAAVYELLG TTENGLTEYE ATQSLERYGA NELPETPQRP MWLRFTDQLT 
HFMALLLWVA GILAFISRTP ELGWAIWAVI WINAVFSFWQ EFQAEQALSA LKNVLPMQVK
VYRDGELKQI PARELVRGDI MQLEEGDHVS ADARLVKSES LYLDVSVLTG ESLPVARNAY
PVRVREVASI RGGKTLPAGE QPLQEPTNLA EIPNLVLAGS TVSSGRGVAI VYATGAQTEF
GHVAHLTTVV QREPSTLEVQ VAQVVRVITA IALTMGVLVF LLTSLLVGME VKESFIFAIG
IIVALVPEGL LPTVTLSLAI GVRRMVRRNA LVRRLSAVET LSATTVICTD KTGTLTKNEM
TVHYLWIPWQ PTGNELPETP LAISPTLIEV TGAGYDPTVG KVHMSGNFAA AWKVHLLLTG
AALCSNARLI HLTAPSRWQE IGDPTEAALL VAAAKAGLNL ETLQTQLPRL REVPFDSRRR
MMTVILDWRA STVWTGDLPN LAFTKGAPLE VLRHCTYILR NGTLADINQD DWNQVVAAND
SLAAQGFRVL GVAARRGGSE MLDWRSQDLE QNLTFIGLVA MFDPPRPEVS DAIAECHAAG
IKVSMVTGDY GLTAEAIARQ IGLVNNSVRI VTGEGMGNLS DAQLRQIVKY RSGLVFARMS
PEHKLRLVQA YKDIGDVVAV TGDGVNDAPA LRAAHIGVAM GMNGTDVARE AADIVLTDDN
FATIVSAIEQ GRTVYQNIRK FMTYILASNV AELVPFLLMV ALKVPPALVI MQILAIDLGT
DLVPALALGA EKAEVGTMHQ PPRKKSRSLL DRSLLLRAYC FLGLLEAILG MTAFFLVWWS
YGYNLQQLQA VTPSILSHSA NAATVAIYTQ ATTMTLATIV ACQDGNVFAC RSERTSIWRL
GLFSNPLIWL GIATEWMLVI LITNSTFLSR FFSTAPLAPW QWLLLLVCPP IILGAEELRK
AAWRRNLRHR R