Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_C0237 |
Symbol | |
ID | 3678036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007412 |
Strand | - |
Start bp | 289206 |
End bp | 292244 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637715317 |
Product | hypothetical protein |
Protein accession | YP_320511 |
Protein GI | 75812894 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000237421 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000767054 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAATATTA TTTCTGAAGG TGACAATCAT TTAGCTCCCT TAGATTACAG CAGTTTACTC AACAAAATCT TGGAAGTGCT GCCCAGTCAA AACCCATTCC AGTTCTCCCC AGATGACAGT CGATTGCGGA TTAATATTGA CAAAATCGCC GATCTGGTTG CTAATTCACA AGTGGAAAGC CCTTTGGCAT CTGCTAGCGG GATTCGTTGT GCAACTATCA ACCTTAGCGA TCGCACAAGT AAAAATTTCC CAGAGCAAAT CAGGCTAATC CGCGATCGTT TGCAAAACCT CTTGATTGCC GCATTACCAC CAAACAAAAC AGTAGAATCA GCGATTTCGC AATTACTGAC AGATTTACAG TCTTTTCAAG GTACAAAGCC ATCTCTGGGA TTTACTTATC CTTTCGGAGA GTATACAGGT TTGCAAAAAC AGCGGCTCAG TCTTCAGCGC CACCGTTCAG GCAGTGATTC TCTCTTGAAA CTTCATAAAC TAATCATCAC AGTTAAAGAT GTCAAGGGAT TTGATTCTCA ACTTTCTTCC AGTCTTGATA ATTATATCCA ACAAGAATTT AGCGAAGAAA CTGAGAGTGA TTTAGAGGAA TTAAGCGACA CACTAGAAGA ACTGATTAAA AATCAAAAAT CTGACTTTTA CAAACTCAAA CGAGTTATGT ATACCGAAGC TATTGGTATG CTCAAACGAG AGGCGAAAAT CAGGTATTTA GAGTTTATTT TGGAGAATGT TAACGATAGT GTAGATGGCA AAATATATCT TCAAGATTTA ATTCGACGGC TACGATTACT AGAAGAATAT ATCAATGATA TCAACCGAGG GTATGGTCAT TACCAAGTAA ACTATGCCGG AAAGTCAGTC AACTACAGAG ACTTATTTTC TCGTGCCGAA GCTTTAGATA TTCTCCCCAT TATTCCCTTA ATAGAAGGGT ATTTAGGAGA AACAAAAGAT GATACTAAAG GAGAGCAGCA GTTTATTTTT GGTTTCAAAC TAAAATTTGG CGGTACAGTT CAAGCTTACG GAGACAAGCC ACTTTCTGTA CTAGATTATT ACCTCAACCT AATCGACCCA GAAAGTCAGG AACATAAGGC GGGATTAGGG GACTCTTTTA AAAGTGATTA TTTCAAAGAA AAAGTATTAA AAATAATATT TTTATACTAT TTTATATTTG CTTCTAGGAG TAATCCTTTA GCAGATAACT ACCAAGAAAA TTCAGAGCTA AATTATGAAC CTGTTTCGGT CTTTGAGCAA CAGGTATTAC CTGTACTTAA AGGTTCTGAC GAAGAAGCTA AAAAGAAAAT ATTTGCCAAA ATTAAGGAAG GAATTGAAAA ATTTAACGTC AAAATAAAAA TTGAAAAACT CAGGCAGTTA CTTAAGGATT TGCTCAAAGA TAAGCCTATT CTAAAAAGAC GAGAGTATCC GCTTTATCTC GGCGTTAAAA GAGGCATTTT AGAGCGCGAT TTAGACCGCA TTTTAACTGA TGCTACATTT TTCAAGCCTG TCTTTGGTGA GAAGTCACGA GAAGCCCTAA AGTATATTGC CATCAGGGAG CCTCATGTTG ATAGCAGCTA CCTTTGCAAT CTTTCTGTCA GTATTGCAGT AGAAGATATT CGCTATTTCC CCACTGATGA CCGCCAAATA TTTAGCATGG CGTATGATAT CACAAATATT AAAGCACTAC CAATTCTTTT GGTTCCTTTT ACTGAAGAAA ACTGTCAGAA CATCTACAAT AATTCTTTCA AAGATCAAAA ACCAATTTTA TTCATCTACA ATCATAAAAA GTTACGAGAG CAAATTTTCA ACAATCCTGA CTCCCCCAAA GCGTTTATTT ACCGATTTAC TTTTTCTCTG CTTATTTATA TCTGCCTGAA AGTACTGCTT GAGTCTGTCA AAGATAAACT GTTTATTCCC ATTGTGCGGT TACAACTGAA TATTAAGCAG AACTCCACAC CAGAAGAAGA ATTTATGCGC TCGCTCTCTA AAATTTTATC CCACTTATTG AGCGAAGAAC ATCAATCCAG TACTCAAGGA TTCTGTGTCA AAAACCTGAA TCCTTACAAA ATTGAAAATG GTCTAAGTTC TCTTTACTCT GTACTGCCCA AAAAATTTAA ATTCAACGCT TCTTCTTACA ACACCCAGCT AGACAAACTA GCAATTATTA TAGTCTCTAG CCGTGAGAGT GATGCCAGTT GGCGAGGTGG AGAACGAATA TCAAACTTGT TAGGAGAAGT TGTAGGAGTA GAGCGTCTAC AGGATGGCAC AGTTCGCTTA GAAAGGCTCA ACACCTTCGC AGCAAACTAT GAACATCAGC AGATGTTTCA AAGACCTACT ATTGTCATTG ATGAAGTGGA CAAATTGTAC CGCCAGGGCT ATCGGCACTT TATGTATATT GCCAAGTCTC CATATTCTAG CACTCTCAAT ATGACTCAGA GTGATGAAGA TGAATCACTA TTTTTCATGT CTACACCTGT ACTCAAGGCT TTAAAGGGAG ATAGAGACGA CATCAAAATA TATCCAGTAT TTTTTGATAA ATATTATGTG GTAAATCTAG GGCAGACTGT TAGTTCTCTG TACATACAAG ACACGCTTGA ACTAACAAGC TTAGTTGAAG ATCCCAGCAA GAAAGCTGTT GTCTTTTTCA ATCTCTTCAA TGGAATTAAA GTCGGTCAAG ATGACGAGCG TTACTACAAC GGGGTAATTT CCTACGCCAC CTTGCTAAAT GTCTATGATG ACATTCTAGA TGATGAAGAT ATCCGTAGGG GACTAATTTA TGACCGCGAA AACAATCCTA TCAAAAATGA TATCTTGCAA TATCTAACCT TATTCCACTT TTCGCGTTAC GAAGCTGTTC CCAGAAAGAA TCGGAACATT AGCTTTAAAC TTGACCCTTA CGAAAACATA ATTGGGGATG ATGCGGTTGG TAAATTGTCA GTTTTCAAGC ACACCACTAG GAGTGTAAGC TTTAACTCTT TAGCCTTTTT AACTGAAGTT AGACGTGCTT TGAATGTAGA AGGAGAGGAG CGGAAATGA
|
Protein sequence | MNIISEGDNH LAPLDYSSLL NKILEVLPSQ NPFQFSPDDS RLRINIDKIA DLVANSQVES PLASASGIRC ATINLSDRTS KNFPEQIRLI RDRLQNLLIA ALPPNKTVES AISQLLTDLQ SFQGTKPSLG FTYPFGEYTG LQKQRLSLQR HRSGSDSLLK LHKLIITVKD VKGFDSQLSS SLDNYIQQEF SEETESDLEE LSDTLEELIK NQKSDFYKLK RVMYTEAIGM LKREAKIRYL EFILENVNDS VDGKIYLQDL IRRLRLLEEY INDINRGYGH YQVNYAGKSV NYRDLFSRAE ALDILPIIPL IEGYLGETKD DTKGEQQFIF GFKLKFGGTV QAYGDKPLSV LDYYLNLIDP ESQEHKAGLG DSFKSDYFKE KVLKIIFLYY FIFASRSNPL ADNYQENSEL NYEPVSVFEQ QVLPVLKGSD EEAKKKIFAK IKEGIEKFNV KIKIEKLRQL LKDLLKDKPI LKRREYPLYL GVKRGILERD LDRILTDATF FKPVFGEKSR EALKYIAIRE PHVDSSYLCN LSVSIAVEDI RYFPTDDRQI FSMAYDITNI KALPILLVPF TEENCQNIYN NSFKDQKPIL FIYNHKKLRE QIFNNPDSPK AFIYRFTFSL LIYICLKVLL ESVKDKLFIP IVRLQLNIKQ NSTPEEEFMR SLSKILSHLL SEEHQSSTQG FCVKNLNPYK IENGLSSLYS VLPKKFKFNA SSYNTQLDKL AIIIVSSRES DASWRGGERI SNLLGEVVGV ERLQDGTVRL ERLNTFAANY EHQQMFQRPT IVIDEVDKLY RQGYRHFMYI AKSPYSSTLN MTQSDEDESL FFMSTPVLKA LKGDRDDIKI YPVFFDKYYV VNLGQTVSSL YIQDTLELTS LVEDPSKKAV VFFNLFNGIK VGQDDERYYN GVISYATLLN VYDDILDDED IRRGLIYDRE NNPIKNDILQ YLTLFHFSRY EAVPRKNRNI SFKLDPYENI IGDDAVGKLS VFKHTTRSVS FNSLAFLTEV RRALNVEGEE RK
|
| |