Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4851 |
Symbol | |
ID | 3679349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 6110564 |
End bp | 6113596 |
Gene Length | 3033 bp |
Protein Length | 1010 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637720208 |
Product | hypothetical protein |
Protein accession | YP_325343 |
Protein GI | 75911047 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00405142 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAAAAT TTACCTTTTA TTTAATATTG TTCAATGGAT TATTTTTAGG AATTGCAACA GCGAACGCCA AGTTATCACC TGTAGATATA GTTAATCAAC ATTTGCAAAA TTCTCCAATT ATAGATATAA CAACTAAGCC TCAAAAACCC ATAACCAATC AACACCCCCA TATATCTACA CCCTTACATC CTGCTTACAC TCTTTCCCCC TCTGCTGCTA TCAAGCAATC ACATCAGCCT CTACTACAAA GGCCAGAAAA AATTTGGGTA ATTAACCAAA ATCAACAGGT CAAGGATCAA CCCTTTATTT GGGTGGTGAA TGATCATAAA AAGGCAGCAC AGCAACCATT TCTACAAGTT GGTAAATCCA CAGATAAACC GGATAAGAAA CCTGCTACAG AATCTAAGCC GGAAAAAAAG GATGACTTAG AATCTTTTGA TGAAGTAGTA AAAGATACTG AAAAACTAGA CGGTCTATTT ACTCTCTATC GTCATAAAGA AAAGAATAAA ATATATCTAG AAATTAAGCC AGAACAGCTA AATAAAAATT ACTTAGCTAC CGCAACCCTA GAATCTGGTA TTGGCGAACA AGGAATTTAC AGTGGTTTAC CATTACAAGA CTTTTTATTT TATTTCCAGA GAGTAGACAA AAAACTATCT TTTGTGGTGC GTAATGTGAA TTTTCGCACA AGGGAAGGTG ATCCACAAGC GCGATCGCTC GCCCGTTCGT TTAGCGATTC CGTTCTCTAC TCGGTGGAAA TCAAAAGCAT CCATCCCCAA AGAAAAACCT TGTTAATTGA CTTGGGTGAC TTGCTGCTGG CAGATTTAGC CGGATTATCT TTGTTTACAG GATTGACTCC AAATACAGAC CAGTCTTCCT TTGGCAGTGC TAAAACCTTT CCCCACAACT TAGAAATTGA GTCGGTATTG AACTTCTCTA GCAGTACTGG TACAAACCCT AACAATGAAA TGTTATATTT CACGACCGTA CCAGATAGTC GTGGCTTCAC CCTCAGGGTT CACTATAGTC TTTCCCAACT ACCAGAAAAT AATTATCGTC CCCGGATAGC TGATGAACGG GTTGGTTACT TTATCACTGC TTACCAAGAT TTATCTAAAG AAGAACGCAA CGATCCTTTT GTCCGCTATA TTAATCGCTG GCACTTAGAA AAGAAAGACC CGGAATCATC CCTATCTCGT CCCAAAAAAC CCATTGTCTT CTGGATTGAT AACGCCGTAC CCTTACAGTA CCGCGAAGCT GTCAAAGAAG GGATACTCAT GTGGAACAAG GCCTTTCTTA AGGCGGGATT TCAAGATGCA GTGGAAGCCA GACAAATGCC AGACAATGCC GCATGGGACC CAGCCGATAT TCGTTACAAT ACAATTCGTT GGATTAACAC TGTTGATGGT TATTTTGCTA TGGGGCCATC TCGCGTTAAT CCTTTAACTG GGGAAATTTT GGATGCAGAC ATATTAGTTG ATGCTAGTCT TGTCCGCTTA CTCAAAAATC GATACAGCAC ACTTGTAGAA CCTAGTCAAC TCAATACCCG TACCTCCTTA TCGGCATTAA TGCGGAATCG GGGACTTTGT AACAAAGGTT TAGCCGCAAA AGCCAACAAC ACTACTCAAG AAAAATCTCC AAGACCAAAT GGTTTTTTGC AGCGTTTATC CAAGCTAGCC GGTGATTATG ACTTATGCTA CGGCATGGAA GCCGCCAATC AATTTGCTTT TGGGGCTTTG TCCATGTCAC TGCTACAAAA CAACGCACCG AATCAAGAAC AGCTACAAGA ATATATCAAT CAATATTTAC GTTTAATTGT TGCCCATGAA GTAGGACATA CCCTGGGTTT ACGTCATAAC TTCCGTGGTA GTAATCTGCT ATCACCAGAA GAGATGAACA ATCAAGAAAT TAGCCGCCAT AAAGGTTTGA CAAGTTCGGT GATGGACTAT ATTCCACCGA ATATTGCCCC CCAAGGGACA CCGCAGGGAG ACTATTTTCC CAGTATGGTG GGGTCTTATG ATGATTGGGC TATTCAGTAC GGTTATACCC AAACCAACGC GAAAACTCCC ATAGCAGAAA AGCCGATTTT ACAAGCAATC GCCAGCCAAT CTTATAAGCC GGAATTGAGT TATTCTCCCG ATGAGGATAT GTATGACCTC GACCCCACCG CCGATGCTTG GGATCATAGT GGTAACGTGC TGGTTTATTC TCAATGGCAA TTAGATAATT CTCGGTTGAT GTGGGCAAAT CTCAATAAAC GTTTCCCTAT GCCGGGAGAA AGTTATAGTG ATTTAAGCGA TCGCTTTAGC TCAGTTCTCA GTAACTATTT TCAGAATATC TTCTACACAA CAAAATACAT TGGTGGGCAG TCCTTCTACC GTCTACAGGC TGGGGAAATA TCAGCTACTA AGCTAGCCAG TCGCCCAAAT CACTTACCCT TTGAACCTGT ACCTGTTGAA CAACAACGGC AAGCACTCAA AACACTACAA AAGTATATTT TTGCTGAAGA TGCCCTGAAT TTTCCCCCAG ACCTACTGAA TAAATTAGCA CCTTCTCGCT GGTATCACTG GGGGAGTTTT CCCCAAATTG GCCGCTTAGA TTATCCCGTT CATGACTTAA TATTATTCCT GCAAGCTGCT GTATTACGGG AATTACTGGC AGGCGATCGC CTCACTCGTC TCAAGGATAT TGAACTCAAG AGCTTACCCG AAAAATCACT AGCATTACCT GAGCTTTTTG ATACTTTGCA AGCTGGAGTT TGGACAGAAG TTCTCAAACC AAAAGCCGGG GCGCTGAAAA TTACTAGCCT CCGGCGCGGC TTGCAGCGAG AATACCTCGA TATCTTGATT GGTATGGTGT TGCGGCGAGA ATACGTCCCG GAAGATGCCC GTTCCTTGGC TTGGTATAAA CTTAAACAAT TAAACGACAA ACTCAAGTCA GTCAATACCA ACGATGAATA TACCAAAGCC CACTTGTTAG AAACTAGCGA TCGCATTGAG AAAGCTTTGA ATGCCCCATT GCAGGCGAAT TAG
|
Protein sequence | MRKFTFYLIL FNGLFLGIAT ANAKLSPVDI VNQHLQNSPI IDITTKPQKP ITNQHPHIST PLHPAYTLSP SAAIKQSHQP LLQRPEKIWV INQNQQVKDQ PFIWVVNDHK KAAQQPFLQV GKSTDKPDKK PATESKPEKK DDLESFDEVV KDTEKLDGLF TLYRHKEKNK IYLEIKPEQL NKNYLATATL ESGIGEQGIY SGLPLQDFLF YFQRVDKKLS FVVRNVNFRT REGDPQARSL ARSFSDSVLY SVEIKSIHPQ RKTLLIDLGD LLLADLAGLS LFTGLTPNTD QSSFGSAKTF PHNLEIESVL NFSSSTGTNP NNEMLYFTTV PDSRGFTLRV HYSLSQLPEN NYRPRIADER VGYFITAYQD LSKEERNDPF VRYINRWHLE KKDPESSLSR PKKPIVFWID NAVPLQYREA VKEGILMWNK AFLKAGFQDA VEARQMPDNA AWDPADIRYN TIRWINTVDG YFAMGPSRVN PLTGEILDAD ILVDASLVRL LKNRYSTLVE PSQLNTRTSL SALMRNRGLC NKGLAAKANN TTQEKSPRPN GFLQRLSKLA GDYDLCYGME AANQFAFGAL SMSLLQNNAP NQEQLQEYIN QYLRLIVAHE VGHTLGLRHN FRGSNLLSPE EMNNQEISRH KGLTSSVMDY IPPNIAPQGT PQGDYFPSMV GSYDDWAIQY GYTQTNAKTP IAEKPILQAI ASQSYKPELS YSPDEDMYDL DPTADAWDHS GNVLVYSQWQ LDNSRLMWAN LNKRFPMPGE SYSDLSDRFS SVLSNYFQNI FYTTKYIGGQ SFYRLQAGEI SATKLASRPN HLPFEPVPVE QQRQALKTLQ KYIFAEDALN FPPDLLNKLA PSRWYHWGSF PQIGRLDYPV HDLILFLQAA VLRELLAGDR LTRLKDIELK SLPEKSLALP ELFDTLQAGV WTEVLKPKAG ALKITSLRRG LQREYLDILI GMVLRREYVP EDARSLAWYK LKQLNDKLKS VNTNDEYTKA HLLETSDRIE KALNAPLQAN
|
| |