Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_C0071 |
Symbol | |
ID | 3677810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007412 |
Strand | - |
Start bp | 99406 |
End bp | 100593 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637715155 |
Product | IS891/IS1136/IS1341 transposase |
Protein accession | YP_320349 |
Protein GI | 75812732 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0675] Transposase and inactivated derivatives |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.681722 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAAG CCTACCGCTA CCGTTTTTAC CCAACTATCG AGCAAGAATC CCTCTTGCGT CGGACAATTG GTTGTGTGCG GTTGGTGTTT AATCGAGCCT TGGCTGCGAG AACCGAGGCT TGGTATGAAA GGCAAGAACG AGTTGATTAC GTTCAAACCT CTGCAATGCT GACGCAGTGG AAAAAACTTG ATGATCTCCA ATTTTTAAAT GAGGTTAGCT GTGTACCACT GCAACAAGGA TTGAGGCATC TGCAAACGGC TTTTAGTAAC TTCTTTGCGG GTAGAGCAAG ATACCCTAAC TTCAAAAAGA AGCGTAGTGG AGGTAGCGCA GAGTTTACCA AGTCTGCCTT CAAGTGGAAG GATGGGCAAG TCTACTTGGC AAAGTGTTCT CAACCTTTGC CAATTAAATG GTCTAGGCAA CTGCCAAAGG ATTGTAAACC CACAACCATC ACGGTTAAAC TTGACCCCTC TGGACGCTGG TTTGTTAGCT TGCAGATTGA CGATTCAACT GATGAAACCA TGCAGCCAGT TGATAGTGCC GTAGGCTTGG ATGTTGGGGT TAGTTCTCTT GTTACTTTGA GTACGGGTGA AAAGATTGCT AACCCCAAGG CGTTTGACAA GCACTACCAA AAGCTGAGGA GAGCGCAGAA AGCTTTGAGT CGCAAACAGA AAGCCTCTCG CAACAGAGAT AAAGCTAGAC TCAAAGTTGC TCGGATTCAA GCTAAGATTT CTGATTCTAG AAAAGACCAT CTGCACAAAC TGACAACTCG ACTGATTCGT GAAAACCAAA CGATAGTAGT TGAGGATTTG GCAGTTAAGA ACATGGTCAA AAATCCCAAA CTTGCCCGTG CCATCAATGA TGCTGCATGG GGTGAGTTAG TGAGACAACT GGAATATAAA GCCAAGTGGT ATGGTCGAGC CTTGGTAAAA ATTGACCGAT GGTTTCCTAG CTCTAAACGC TGTGGACATT GCGGTCACGT TGTTGAAAAG CTGCCGTTGA GTGTCAGAGA GTGGGATTGT CCCAACTGCG GAACACACCA CGACCGGGAT ATCAACGCTA GTCGTAATAT TTTGGCGGTG GGACACACCG TTACAGTCTG TGGAGCGAAC GTAAGACCTG ATAGGCATAT GTCTAAAGGG CAGTTGCAAA AATCCAGCAA TGGAAGGAAA CAGAAACCTA AATCGTGA
|
Protein sequence | MEKAYRYRFY PTIEQESLLR RTIGCVRLVF NRALAARTEA WYERQERVDY VQTSAMLTQW KKLDDLQFLN EVSCVPLQQG LRHLQTAFSN FFAGRARYPN FKKKRSGGSA EFTKSAFKWK DGQVYLAKCS QPLPIKWSRQ LPKDCKPTTI TVKLDPSGRW FVSLQIDDST DETMQPVDSA VGLDVGVSSL VTLSTGEKIA NPKAFDKHYQ KLRRAQKALS RKQKASRNRD KARLKVARIQ AKISDSRKDH LHKLTTRLIR ENQTIVVEDL AVKNMVKNPK LARAINDAAW GELVRQLEYK AKWYGRALVK IDRWFPSSKR CGHCGHVVEK LPLSVREWDC PNCGTHHDRD INASRNILAV GHTVTVCGAN VRPDRHMSKG QLQKSSNGRK QKPKS
|
| |