Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_4132 |
Symbol | |
ID | 3681208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 5148212 |
End bp | 5149762 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637719478 |
Product | hypothetical protein |
Protein accession | YP_324626 |
Protein GI | 75910330 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02595] PEP-CTERM putative exosortase interaction domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0019999 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00477797 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCTCCAA ATCGTTGGAA ATCTCCGCCT TTGTTGGGAT TACTCACACC CCTAACCTTA GCTGCTTCAA TGGCTGTAAG TGACACTCCA GCCCAAGCCT TACAGTTTAA TTTCACTTAT GCACCAGGGA CAACCCTTGA CCAAATGCTT GGTTATGAGA TGGCTGGTAG ATATTGGTCA AATTATTTGG CTGATGATGT CACCGTCAAT ATTTTTATTG AATCGACCAA TATATTACCT ACTAATGTGA TTGGTGGGGC GATACCGGGG GTAACTTCAC AAAGTTATAA TAATGTGGTG AGAAGACTAC AAGCAGATAT CACCTCATCT AGCGATCGCA CAGCTTTAAA TACTATATAT AACCAATGTA ATATCAGCCT TACATTCAGC AATGGATCAT GGCAAAATCA ATGCACAGGA TATAAATCTT TAACCAATAC CTATAAATAT GGACTAGCTG ACTTAGAGTT TCAGTCCGAT GTGAGAAATA TTAATCTCAC CCGTGCTAAT GCCAAAGCTT TAGGAATTAT TAGTGCTAAT GATCCTGGTT ACGATGGTTA TATTCTCGTG AGTAACTTAG GCAATATTAC TCGGCCTCTC TCATGGAATT ATTTTTCTGC CACCAACGCT AGTAACACTA TTCCTACTGC CACTGTAGAT TTTTTTAGTG TAGCCGTACA TGAATTAACT CACACCCTCG GCTTTGTCAG TGGTATAGAC TCACCTGAAT ACGCAGATTT ATTGACAAAA AAGTTATGGT TCACTGATAG TGATATGTCT AAATATGGCT ACTTGATGGA TATGTTTCGT TTATCTCAAG ATAGCCGCTT TTGGAATAGG CCTGATATAT CCGTTGGTGT GGATACAGTA TTATCTATTG ATGGTGGTAT GACCAAGTTA GGTAATTTGT CTTCTGGAAC TACCAAATTG CGAGGAGACG GTAATCAAGG CAGTCACTGG AAAAAAGATG GTATCTATGA TGGCATTATG GAAGCTGCTC TGCGTGCAGG CGGTACAAGA AAACTCGTCA CCAATAACGA TCTGACTACT TTAGATATCT TGGGGTGGAA TCTCCAACAG AATAATCAAG ACCTACTCAC TATCTTCAAT AGTGCCAAAT CTGGACTTGC CACTAAGATG GGAGTCACTA CTAGTTGGAT CGATGCTAAT ACCAATCAAG CTGCTTCACT TCTGACACCA CAATATATCG ATGCCAATAA CAATAGCTAT GACGATCGCG GTGAAGCCCT CAATAAAATG ATCACTAGCA GTGGTACTTA TAATTGGGGC TGGAGTGGCT ATTGGTGGGG CTGGAGTGGC TATTGGTGGG GTTGGAGTGG CTATTGGCAA AATACAGATA ATTTAGCTAC GGATGGCTTT TGGCAAAATT TCGCCTGGGA AACATTAGAC GAATTTGATG ATGGCAACTC TGACCATTTG AGTGGGTCTT CTCAAGCGCA ATCTGTACCA GAACCCACTA CTATCTTTGG ATTATTAGGA ATGGCTGTAC TTGGCATTGC TCCCAAACTC AAGCGCCGTT GTGAAAATTA G
|
Protein sequence | MPPNRWKSPP LLGLLTPLTL AASMAVSDTP AQALQFNFTY APGTTLDQML GYEMAGRYWS NYLADDVTVN IFIESTNILP TNVIGGAIPG VTSQSYNNVV RRLQADITSS SDRTALNTIY NQCNISLTFS NGSWQNQCTG YKSLTNTYKY GLADLEFQSD VRNINLTRAN AKALGIISAN DPGYDGYILV SNLGNITRPL SWNYFSATNA SNTIPTATVD FFSVAVHELT HTLGFVSGID SPEYADLLTK KLWFTDSDMS KYGYLMDMFR LSQDSRFWNR PDISVGVDTV LSIDGGMTKL GNLSSGTTKL RGDGNQGSHW KKDGIYDGIM EAALRAGGTR KLVTNNDLTT LDILGWNLQQ NNQDLLTIFN SAKSGLATKM GVTTSWIDAN TNQAASLLTP QYIDANNNSY DDRGEALNKM ITSSGTYNWG WSGYWWGWSG YWWGWSGYWQ NTDNLATDGF WQNFAWETLD EFDDGNSDHL SGSSQAQSVP EPTTIFGLLG MAVLGIAPKL KRRCEN
|
| |