Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0857 |
Symbol | |
ID | 3681758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 1048113 |
End bp | 1051220 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637716191 |
Product | putative phosphate transport system substrate-binding protein |
Protein accession | YP_321376 |
Protein GI | 75907080 |
COG category | [S] Function unknown |
COG ID | [COG3330] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.275371 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGCAAA AGAATAAACA AGATAGTGGC ATAGTGAATC TGGCATTATT GTTTGCCTTA ACAACCACCC CCATCGCAGC AAATTTATTA TTGTCAGCCC CGATGCTGGC ACAGTCGGAG ACAGACACCC CCAATTTTGC GCTACCGCAA ACTGTGGAAA ACGGAACGAC AGTGCGGATT GATGGCTCCA GTACCTTGAT AGCGGTGAAT CAAAGCTTGA AACAGGGTTT TGAGCAACAG TTCACTGGTA CAAGAGTAGA TGTAGCTACT AATGGTACAG AAGCTGCACT CAAATCTGTA TTAGCTGGAA ATATTGATGT GGCGGCTATT GGTCGTGGTT TGACCCCGGA AGAAAAAGCC CAAGGTTTGG AACAGGTAAG GTTACATCGG GAAAAGATAG CCATTATTGT TGGGGAAGAA AATCCCTTTA AGGGCAGCTT GACCGATCGC CAATTTGCCA GAATTTTTCG CGGACGCATT ACTAATTGGT CACAACTGGG AGCGCCATCA GCCAAGATTC GGGTGATTGA TCGCCCTAGC ACCAGCGATA CCCGCGAAGC ATTAAATAAC TATCCAGTTT TCAAGGCTGC TAAATTTGCT ACAGGCTCGA CGGCAACTCA GGTAACAGAA GACAACACCG CCGAAATTAT CAAACAGCTA GGTAAGGACG GGATCAGCTA CGCCAGAGCC AATCAAATCT CTAAATTGCC TGGTGTGCGG GTACTTAAAT TACACGACAC CCCACCAGAT GATCCTAAAT ATCCTTTTTC TCAGCCTTTA GTTTACGTTT ATAAAAAGAA TCCTAGTCCA GCGATCGCCT CTTTTCTCGG TTTTGCTTTA GCGTCACCAG GACAAAAGGC AATAGAAACG GCAAGAGTAG CTGAAGCAGA AGCGATCGCC AAAGATGCAG CCCAACAAGC ATCCTTCACA ACCGCCAATT CTCTGGCTCC AGCAGCCAGT GCTTCCCCCA CAGCAGAAAC CACACCAGCA ATCAATTCCT CTCCCATTCT GGAAAGCCCA CCCACAGCTA ACACTTTTCC TGATACTGCC AGTACCAGCA CTAATGTGAA TCAGCCTGCG ATCGTATCTA CCCCAGAAAC ACCTAGATTG GATCGCTCCT TGTTATGGTG GTTATTGTTG CCAGTAGGGG CTATAGCTGG ACTATTGTTG TGGTTTGTCA AACGTTCATC TGCGACTAAA GTATTAGAAA GCACTACAGA ATCCATTGCT GGTGATACGG AAACACCTGC CACTGAACCA TTAGGAGAAA GCCGCAATGG GCTACATCCC AACCTAAGTG ATGGCAATAC TGTAACCGCA GTCCCAGGAA CAAATGCAGT AGCATCCCCC ACCACAACTA CCACGGCTGT AGCTGATCAA GAGCCGAAAA ACACCAATTA CACAAGTAAT TATCAAGTAA AAATCCCGCC TGAATTGGAG CAATCACCTT GGGATATGGA AGCACCCGCA TCTGTGGTTA ATACTTCCTA TCCCCAAATG ATCGAAGTCG GGAAAGCCCC CACTCATGCA GAAACCCAAA CAACAGATAT CGCCCCAGAA ATCCCCGAAA TCTCATCAAA TGGGGATAAT CATCACCCAA TCACAGAGGA ACAACCCCAG CCACCAGAGA ATTTAGCAGA TCAGACTCAT AGGGAGCCAG AAGATCACCA AGAACCCGAA TTAACAACCT ATCCAGTCAG CGAAGATACA TCGGAGATAG TGGATTCACT GCCAGAATTG CCAGACTTTG AGGCGATTTT TGCCGATGAA CCAGAAGAAA ATAGCGAAGT CGCTAACAAC TGGATAGCCC CAATTACAGA ACAGACAAAT ATCTCATCTG TTGATGGACA AGTTGAGAAA ACTACAACAG CGATCGCCTC GGCTGAATTA CCTGAACAAC AGACAACACA AGAAGTAACA ACCACACTGC CAAAGTTTGC CGATATTCCC GAAGATGCCC TCAATTTGGT AGCCGATGCA GCCGAAATTC ATGAAGGTGG CACACTAGAA GATCCAGATT ATCTCACTCC ATCCAGTTTA GGGGCTTTAG CCGGTGGTGC AGTCTTAGCA GGTGTAGGGG TACAAACCTG GGTTGGTAAA CATGATGTAG AGGAAAGTGA CACATCCACA GTGCCAGCAT CACCCAACAC TCCTGAAGTA GCCACAACTG ACGAGTTAGA AGATGGGGAA GAAGACAGTA ATATTGTCCT CAGACCCCGT AACCCTGAAT GGGCTTACGT TTCTTGGTAT ATTTCACCTA GTGATCAGCA GAAGTTTCAA GCTCAAGGTT CTTCAGAATT GGCATTGCGA CTTTATGACA TTACTGGTGT TGATCTGAGT TACCAAAATC CCCACCAGGC GCAGCAGTAT GAATGTGAAC CAGGAACAAG CGATCGCTAT GTACCCATTC CGGCAAGCGA TCGCGATTAT ATAATTGAAA TTGGCTATCT AAGCAGTGGT GAACACTGGA CTACTATTGC CCGTTCTCCT AGCGTCCGCA TCTTTGATCG TCTACATATA GACTCTCTGT CCCCAAATTT ACCAGCAATA GACGAAGCCA GCAGTGTTGT CTTCCAGCAC CGGACTTCCA AATGGGCTTA TGTAAGTTGG GACATTTCCG CAACTCATCA GCAATTACTC AAGGATGCAG GGATTTCCCA GCTAGCACTA CGGCTTTATG ATGCCACCAA TATTGACCTG AGTTACCAGC GCCCCCAATT GGTGCAGCAA TATGAATTTG ATGAAATCAC CCGCGATCGC TATGTGTCAA TTCCCCAGAG CGATCGTGAC TACATGACAG AAGTGGGTTA CTCTACCCCA GAGGGTGAAT GGGTAACTAT CGCCAGTTCC CATACTGTTC GTGTCTTTAG TAGTCCTCAA GGAGATTTTT GGTTCTTAGC AGATACTGAG TTAATTATCC ACGGAGCCAC AGATCCAGGT GCAACAGTCA ATATTGCCGG TAAACCAGTC ACCCTCAAAT CTGACGGCAC TTTTCACCTG CGTGTCCCCT TCTCTGAAGA CTTGCTAGAT TATCTGATCA CCGTCACTAG TGGAGAACAA AGAAAAACCA TCCGTAAGAA GTTTGTTCAA GAAACTTCAG ACAGCTAA
|
Protein sequence | MWQKNKQDSG IVNLALLFAL TTTPIAANLL LSAPMLAQSE TDTPNFALPQ TVENGTTVRI DGSSTLIAVN QSLKQGFEQQ FTGTRVDVAT NGTEAALKSV LAGNIDVAAI GRGLTPEEKA QGLEQVRLHR EKIAIIVGEE NPFKGSLTDR QFARIFRGRI TNWSQLGAPS AKIRVIDRPS TSDTREALNN YPVFKAAKFA TGSTATQVTE DNTAEIIKQL GKDGISYARA NQISKLPGVR VLKLHDTPPD DPKYPFSQPL VYVYKKNPSP AIASFLGFAL ASPGQKAIET ARVAEAEAIA KDAAQQASFT TANSLAPAAS ASPTAETTPA INSSPILESP PTANTFPDTA STSTNVNQPA IVSTPETPRL DRSLLWWLLL PVGAIAGLLL WFVKRSSATK VLESTTESIA GDTETPATEP LGESRNGLHP NLSDGNTVTA VPGTNAVASP TTTTTAVADQ EPKNTNYTSN YQVKIPPELE QSPWDMEAPA SVVNTSYPQM IEVGKAPTHA ETQTTDIAPE IPEISSNGDN HHPITEEQPQ PPENLADQTH REPEDHQEPE LTTYPVSEDT SEIVDSLPEL PDFEAIFADE PEENSEVANN WIAPITEQTN ISSVDGQVEK TTTAIASAEL PEQQTTQEVT TTLPKFADIP EDALNLVADA AEIHEGGTLE DPDYLTPSSL GALAGGAVLA GVGVQTWVGK HDVEESDTST VPASPNTPEV ATTDELEDGE EDSNIVLRPR NPEWAYVSWY ISPSDQQKFQ AQGSSELALR LYDITGVDLS YQNPHQAQQY ECEPGTSDRY VPIPASDRDY IIEIGYLSSG EHWTTIARSP SVRIFDRLHI DSLSPNLPAI DEASSVVFQH RTSKWAYVSW DISATHQQLL KDAGISQLAL RLYDATNIDL SYQRPQLVQQ YEFDEITRDR YVSIPQSDRD YMTEVGYSTP EGEWVTIASS HTVRVFSSPQ GDFWFLADTE LIIHGATDPG ATVNIAGKPV TLKSDGTFHL RVPFSEDLLD YLITVTSGEQ RKTIRKKFVQ ETSDS
|
| |