Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_1772 |
Symbol | |
ID | 3682115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 2208798 |
End bp | 2211917 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637717112 |
Product | hypothetical protein |
Protein accession | YP_322289 |
Protein GI | 75907993 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCTGC TAGATTTACA CCCACAACAC CTGGAAGAAT TAGTCAAGGA TAGTGGTATA GAATTACACT TGACTCAGCT TAATTTTAAG TCTCTCCAAG GCGTAAGCGC CTATGAGCAT CTATTAATTT CCGAACACCT ACCCCGCACC AATACGGGAA TGGTTAAAAG TGGCTGGTTA CACCTTTACA GTCATGTTAC GGCTGGTGGT TGGTGGTGTT CTGGGTTAGA TCCTCTCAAC AATTGGCAAG GTATGGAATG GGGATGTTTT AAGCCAAATC AACCGCGCAC GAATCAAAAT GGCAAATCTA TCAAATATGA ACATCCCCCC AGCACAGCAA CGCGGATATT CTGTCTGCGG GTAACATTAG CGATATGGAG ACAAGTCTCA GGGCGTTACA ATTTCCCGAT TCCTGAAGAT ATCACCATTA ATTCCCAAGG TGAAGCAGAA GGCTTTTGGC AATGGGTAAT GGAGCGCAAC ATACCAGTCA TCATTTGCGA GGGAGCCAAG AAAGCCGCAG CATTATTGTC TCAGGGATAT GCGGCGATCG CAATTCCGGG GATTACCAGT GGTTATAGAG TTGTTAAAGA TAAATTTGGT AAAGTCACTA GCCGCCAGCT AATCCCTGAC TTAGCTGTAT TTACGGCAAT AAAGCGGACT TTTTATATCT GCTTTGATTA TGAAACTCAA CAGAAAAAAA TAGCAGCTGT TAGTAATGCC ATTTCCCAAC TAGGTTGTTT ATTCCAAGCA AGAAAATGTC CTGTAAAAGT TATCGAACTC CCAGGGTTAG AAAAGGGTGT AGATGAGTTA ATTGTTGCTA AAGGCGCAAG TGTTTTTGAA AAAGTTTATC GTCAAAGTGT AGATTTAGAA ATTTACCTTG CTCAAATCAA ACCGCACAGC GAACTAACAA TTCCAGCAGC CATAACAGTG AATCTTCCAT ATTTAGCAGA AATACCCTTT CCTAGCTCTG GATTAGTTGG TGTCAAATCA GCGAAAGGTA CAGGTAAAAC GACATCATTA CAAGCGGTTG TCCAACAAGC CAAAAATATT AACAGATCTG TATTATTAAT TACCCATAGG ATTCAGTTAG GACGTTTTTT ATGTGAAAAA ATTGGTATTC AATGGGGAAT TAATCATACA GAAGGTTTAA CAAAAAATAG TGATTGGCTA AAAAATACAG AAACACCATC TTTAGGCTTA TGCGTTGATT CTATATGGAA ACTACGCCCA GAAGAATGGC AAGGTGCAAT CATAATTCTC GATGAAGTTG AGCAGTCTTT GTGGCATTTG CTCAACAGTA ATACTTGTAA ACATAAACGT GTCAAGATTT TAAAATTATT TCAACAATTA ATTTCTCTAG TTTTATCAAC AGGTGGCTTA GTAATTGCCC AAGATGCTGA TTTGTCAGAC GTATCTTTAG AATATTTACA AGGTTTATCT GGCTGTAAAA TCACCCCTTG GGTATTAATA AATCAATGGA AGCCACAACG AGGATGGGAA GTAACTTTTT ATGATTCCCC TAACCCCATA CCATTAATTC AGCAATTAGA ATTAGACTTG CTAGCAGGAC GTAAATGTTA CGTCACCACT GATAGCCGTT CCGGACGTTA TAGCTGCGAA ACAATTGAAC GTTATCTTAA AGAACGTTTA GAAAAACTGC GATACGAATT TCCCAAAACC CTAGTAGTTA ATAGTCACAC AACTAACACT CCTGGTCATG CAGCAGTCGA TTTCGTTGCA GCTATTAACC AGAAAATTAC CGAATATAGT AATGTGTTTG TGACTCCTAG CTTAGGAACA GGTATTAGTA TTGATGTGCA GCACTTTGAC CGGGTGTATG GCATTTTTCA AGGAGTAATT CCTGACTCAG AAGCACGACA AGCCCTAGCG AGAGTTAGGG ATAATGTACC AAGAATTGTC TGGTGTGCTA AACGGGGTAT TGGTTTAATT GGCAGTGGTA GTACCAATTA TCGTTTACTA TCCGATTGGT ATCAGGAGAA TCAAAAAGAA AATCTAGCTT TGCTTAGTCC ATTACACAAA ATAGATGTAG ATTTACCCTT AGTTTATGAC CCTATTCATT TACGAACATG GGCTAAATTA TCCGCCAGAG TAAATGCTTC TGTTCGTATC TATCGCCAAT CGATGGAAGA AGGATTAACT ACAGATGGGC ATCAAATTCG CTTGCGGAGT AATGCCGTTC ACAATAATAT TATTCGAGAT TTACGCTTGG CATTCCTCGC AACAGAGCCA AGTGATTTGA AAGAACGCCA AAGATTAGTT CTAGAAATTG TCAAAGTGCA GAAAGATTGG GTAGAAAAGC GGCATAAAGG TAAAGAAATC AAGCGTCAAA TTAAAAAAAT TAAGCAGCAA AATCAACTCA CTTCTGCCCA TAATGTAGCT GCTGCTAAAG ACATTGATTA TTTAGAATAT GAACATCTTT CAGCCAAGCA TTCTCTAACT GATGAAGAAC GCAATCAAAT TCAAAAATAT AATCTCCGCC AAAGATATGG CATCTTTGTC ACTCCTTCGC TCAAGTTAAG AGATGACCAA GGATATTATA CTCAACTGTT AATTCACTAC TACCTGACCC ATGAAAGTGA GTATTTTCAA ATTAGAGACC AACAGGAATG GCATCAACAA TTATCCTGGG GTAATGGTAA AGTTTTTCTG CCAGATTTGA AAACCTATAC GTTAGAAGTT GAGGCAATGA GAGCCTTAGG TATGCCCCAA TTTATTGATA TAGAACGAGA ATTTACGGAA AATGCCTCTG ACTTAATTTG GCTCAAAGAT GTGGTCTTTC AACATAGTAG ACATATTAAA AGAGTTTTGG GCATTGACTT TATTCGCTGC CAAGAAAGAA TTACAGCAAT TAAGGTTCTT AGCCGTCTAA TGAATTTGTT GGGTTTAAAG CTGAAGCGAG TCGGTGATAT ATATCAAATC GATTCGGAGA CATTTAATGA TGATAGACAA AAAATATTTC CAGTTTGGCA ACAGCGAGAT GAAGTCATAC TCACTCAAAT CAATAATATG AGACGCGAAA AATATAATGT ACTCTCAAAC CAAAATCCAC AAGCGAAAAA TACAAATTCG GTAATCTCTA CTATGGTTTC CCTATTTTAG
|
Protein sequence | MRLLDLHPQH LEELVKDSGI ELHLTQLNFK SLQGVSAYEH LLISEHLPRT NTGMVKSGWL HLYSHVTAGG WWCSGLDPLN NWQGMEWGCF KPNQPRTNQN GKSIKYEHPP STATRIFCLR VTLAIWRQVS GRYNFPIPED ITINSQGEAE GFWQWVMERN IPVIICEGAK KAAALLSQGY AAIAIPGITS GYRVVKDKFG KVTSRQLIPD LAVFTAIKRT FYICFDYETQ QKKIAAVSNA ISQLGCLFQA RKCPVKVIEL PGLEKGVDEL IVAKGASVFE KVYRQSVDLE IYLAQIKPHS ELTIPAAITV NLPYLAEIPF PSSGLVGVKS AKGTGKTTSL QAVVQQAKNI NRSVLLITHR IQLGRFLCEK IGIQWGINHT EGLTKNSDWL KNTETPSLGL CVDSIWKLRP EEWQGAIIIL DEVEQSLWHL LNSNTCKHKR VKILKLFQQL ISLVLSTGGL VIAQDADLSD VSLEYLQGLS GCKITPWVLI NQWKPQRGWE VTFYDSPNPI PLIQQLELDL LAGRKCYVTT DSRSGRYSCE TIERYLKERL EKLRYEFPKT LVVNSHTTNT PGHAAVDFVA AINQKITEYS NVFVTPSLGT GISIDVQHFD RVYGIFQGVI PDSEARQALA RVRDNVPRIV WCAKRGIGLI GSGSTNYRLL SDWYQENQKE NLALLSPLHK IDVDLPLVYD PIHLRTWAKL SARVNASVRI YRQSMEEGLT TDGHQIRLRS NAVHNNIIRD LRLAFLATEP SDLKERQRLV LEIVKVQKDW VEKRHKGKEI KRQIKKIKQQ NQLTSAHNVA AAKDIDYLEY EHLSAKHSLT DEERNQIQKY NLRQRYGIFV TPSLKLRDDQ GYYTQLLIHY YLTHESEYFQ IRDQQEWHQQ LSWGNGKVFL PDLKTYTLEV EAMRALGMPQ FIDIEREFTE NASDLIWLKD VVFQHSRHIK RVLGIDFIRC QERITAIKVL SRLMNLLGLK LKRVGDIYQI DSETFNDDRQ KIFPVWQQRD EVILTQINNM RREKYNVLSN QNPQAKNTNS VISTMVSLF
|
| |