Gene Ava_0857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0857 
Symbol 
ID3681758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp1048113 
End bp1051220 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content46% 
IMG OID637716191 
Productputative phosphate transport system substrate-binding protein 
Protein accessionYP_321376 
Protein GI75907080 
COG category[S] Function unknown 
COG ID[COG3330] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.275371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGCAAA AGAATAAACA AGATAGTGGC ATAGTGAATC TGGCATTATT GTTTGCCTTA 
ACAACCACCC CCATCGCAGC AAATTTATTA TTGTCAGCCC CGATGCTGGC ACAGTCGGAG
ACAGACACCC CCAATTTTGC GCTACCGCAA ACTGTGGAAA ACGGAACGAC AGTGCGGATT
GATGGCTCCA GTACCTTGAT AGCGGTGAAT CAAAGCTTGA AACAGGGTTT TGAGCAACAG
TTCACTGGTA CAAGAGTAGA TGTAGCTACT AATGGTACAG AAGCTGCACT CAAATCTGTA
TTAGCTGGAA ATATTGATGT GGCGGCTATT GGTCGTGGTT TGACCCCGGA AGAAAAAGCC
CAAGGTTTGG AACAGGTAAG GTTACATCGG GAAAAGATAG CCATTATTGT TGGGGAAGAA
AATCCCTTTA AGGGCAGCTT GACCGATCGC CAATTTGCCA GAATTTTTCG CGGACGCATT
ACTAATTGGT CACAACTGGG AGCGCCATCA GCCAAGATTC GGGTGATTGA TCGCCCTAGC
ACCAGCGATA CCCGCGAAGC ATTAAATAAC TATCCAGTTT TCAAGGCTGC TAAATTTGCT
ACAGGCTCGA CGGCAACTCA GGTAACAGAA GACAACACCG CCGAAATTAT CAAACAGCTA
GGTAAGGACG GGATCAGCTA CGCCAGAGCC AATCAAATCT CTAAATTGCC TGGTGTGCGG
GTACTTAAAT TACACGACAC CCCACCAGAT GATCCTAAAT ATCCTTTTTC TCAGCCTTTA
GTTTACGTTT ATAAAAAGAA TCCTAGTCCA GCGATCGCCT CTTTTCTCGG TTTTGCTTTA
GCGTCACCAG GACAAAAGGC AATAGAAACG GCAAGAGTAG CTGAAGCAGA AGCGATCGCC
AAAGATGCAG CCCAACAAGC ATCCTTCACA ACCGCCAATT CTCTGGCTCC AGCAGCCAGT
GCTTCCCCCA CAGCAGAAAC CACACCAGCA ATCAATTCCT CTCCCATTCT GGAAAGCCCA
CCCACAGCTA ACACTTTTCC TGATACTGCC AGTACCAGCA CTAATGTGAA TCAGCCTGCG
ATCGTATCTA CCCCAGAAAC ACCTAGATTG GATCGCTCCT TGTTATGGTG GTTATTGTTG
CCAGTAGGGG CTATAGCTGG ACTATTGTTG TGGTTTGTCA AACGTTCATC TGCGACTAAA
GTATTAGAAA GCACTACAGA ATCCATTGCT GGTGATACGG AAACACCTGC CACTGAACCA
TTAGGAGAAA GCCGCAATGG GCTACATCCC AACCTAAGTG ATGGCAATAC TGTAACCGCA
GTCCCAGGAA CAAATGCAGT AGCATCCCCC ACCACAACTA CCACGGCTGT AGCTGATCAA
GAGCCGAAAA ACACCAATTA CACAAGTAAT TATCAAGTAA AAATCCCGCC TGAATTGGAG
CAATCACCTT GGGATATGGA AGCACCCGCA TCTGTGGTTA ATACTTCCTA TCCCCAAATG
ATCGAAGTCG GGAAAGCCCC CACTCATGCA GAAACCCAAA CAACAGATAT CGCCCCAGAA
ATCCCCGAAA TCTCATCAAA TGGGGATAAT CATCACCCAA TCACAGAGGA ACAACCCCAG
CCACCAGAGA ATTTAGCAGA TCAGACTCAT AGGGAGCCAG AAGATCACCA AGAACCCGAA
TTAACAACCT ATCCAGTCAG CGAAGATACA TCGGAGATAG TGGATTCACT GCCAGAATTG
CCAGACTTTG AGGCGATTTT TGCCGATGAA CCAGAAGAAA ATAGCGAAGT CGCTAACAAC
TGGATAGCCC CAATTACAGA ACAGACAAAT ATCTCATCTG TTGATGGACA AGTTGAGAAA
ACTACAACAG CGATCGCCTC GGCTGAATTA CCTGAACAAC AGACAACACA AGAAGTAACA
ACCACACTGC CAAAGTTTGC CGATATTCCC GAAGATGCCC TCAATTTGGT AGCCGATGCA
GCCGAAATTC ATGAAGGTGG CACACTAGAA GATCCAGATT ATCTCACTCC ATCCAGTTTA
GGGGCTTTAG CCGGTGGTGC AGTCTTAGCA GGTGTAGGGG TACAAACCTG GGTTGGTAAA
CATGATGTAG AGGAAAGTGA CACATCCACA GTGCCAGCAT CACCCAACAC TCCTGAAGTA
GCCACAACTG ACGAGTTAGA AGATGGGGAA GAAGACAGTA ATATTGTCCT CAGACCCCGT
AACCCTGAAT GGGCTTACGT TTCTTGGTAT ATTTCACCTA GTGATCAGCA GAAGTTTCAA
GCTCAAGGTT CTTCAGAATT GGCATTGCGA CTTTATGACA TTACTGGTGT TGATCTGAGT
TACCAAAATC CCCACCAGGC GCAGCAGTAT GAATGTGAAC CAGGAACAAG CGATCGCTAT
GTACCCATTC CGGCAAGCGA TCGCGATTAT ATAATTGAAA TTGGCTATCT AAGCAGTGGT
GAACACTGGA CTACTATTGC CCGTTCTCCT AGCGTCCGCA TCTTTGATCG TCTACATATA
GACTCTCTGT CCCCAAATTT ACCAGCAATA GACGAAGCCA GCAGTGTTGT CTTCCAGCAC
CGGACTTCCA AATGGGCTTA TGTAAGTTGG GACATTTCCG CAACTCATCA GCAATTACTC
AAGGATGCAG GGATTTCCCA GCTAGCACTA CGGCTTTATG ATGCCACCAA TATTGACCTG
AGTTACCAGC GCCCCCAATT GGTGCAGCAA TATGAATTTG ATGAAATCAC CCGCGATCGC
TATGTGTCAA TTCCCCAGAG CGATCGTGAC TACATGACAG AAGTGGGTTA CTCTACCCCA
GAGGGTGAAT GGGTAACTAT CGCCAGTTCC CATACTGTTC GTGTCTTTAG TAGTCCTCAA
GGAGATTTTT GGTTCTTAGC AGATACTGAG TTAATTATCC ACGGAGCCAC AGATCCAGGT
GCAACAGTCA ATATTGCCGG TAAACCAGTC ACCCTCAAAT CTGACGGCAC TTTTCACCTG
CGTGTCCCCT TCTCTGAAGA CTTGCTAGAT TATCTGATCA CCGTCACTAG TGGAGAACAA
AGAAAAACCA TCCGTAAGAA GTTTGTTCAA GAAACTTCAG ACAGCTAA
 
Protein sequence
MWQKNKQDSG IVNLALLFAL TTTPIAANLL LSAPMLAQSE TDTPNFALPQ TVENGTTVRI 
DGSSTLIAVN QSLKQGFEQQ FTGTRVDVAT NGTEAALKSV LAGNIDVAAI GRGLTPEEKA
QGLEQVRLHR EKIAIIVGEE NPFKGSLTDR QFARIFRGRI TNWSQLGAPS AKIRVIDRPS
TSDTREALNN YPVFKAAKFA TGSTATQVTE DNTAEIIKQL GKDGISYARA NQISKLPGVR
VLKLHDTPPD DPKYPFSQPL VYVYKKNPSP AIASFLGFAL ASPGQKAIET ARVAEAEAIA
KDAAQQASFT TANSLAPAAS ASPTAETTPA INSSPILESP PTANTFPDTA STSTNVNQPA
IVSTPETPRL DRSLLWWLLL PVGAIAGLLL WFVKRSSATK VLESTTESIA GDTETPATEP
LGESRNGLHP NLSDGNTVTA VPGTNAVASP TTTTTAVADQ EPKNTNYTSN YQVKIPPELE
QSPWDMEAPA SVVNTSYPQM IEVGKAPTHA ETQTTDIAPE IPEISSNGDN HHPITEEQPQ
PPENLADQTH REPEDHQEPE LTTYPVSEDT SEIVDSLPEL PDFEAIFADE PEENSEVANN
WIAPITEQTN ISSVDGQVEK TTTAIASAEL PEQQTTQEVT TTLPKFADIP EDALNLVADA
AEIHEGGTLE DPDYLTPSSL GALAGGAVLA GVGVQTWVGK HDVEESDTST VPASPNTPEV
ATTDELEDGE EDSNIVLRPR NPEWAYVSWY ISPSDQQKFQ AQGSSELALR LYDITGVDLS
YQNPHQAQQY ECEPGTSDRY VPIPASDRDY IIEIGYLSSG EHWTTIARSP SVRIFDRLHI
DSLSPNLPAI DEASSVVFQH RTSKWAYVSW DISATHQQLL KDAGISQLAL RLYDATNIDL
SYQRPQLVQQ YEFDEITRDR YVSIPQSDRD YMTEVGYSTP EGEWVTIASS HTVRVFSSPQ
GDFWFLADTE LIIHGATDPG ATVNIAGKPV TLKSDGTFHL RVPFSEDLLD YLITVTSGEQ
RKTIRKKFVQ ETSDS