Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2451 |
Symbol | |
ID | 3682841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 3041514 |
End bp | 3043358 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637717794 |
Product | integrase catalytic subunit |
Protein accession | YP_322961 |
Protein GI | 75908665 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0255149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTAGAAA AGAAAGTTAG CCAAGGTGAA CAAAATAATC TGTTTGAGGT GACACCCATA GGCATGGAGT TGACGGCTCC CACTCAACAA GACCACAAAA TCTTGATTGA GCAAATTGAG GATGCCCAGA TGAAAAGAGA AATTCTCTTG AAAATGGATG CGATAGAGGA TATCCAGACT AATTCTGATA ATGGCAAACA GGAACGTATC AGACAGTGGG CGCAAAAATT AGGGAAGCAT CCAAGAACCA TTACCCGTAT GTTGTCTAAG GCAGACATTG AAGGATTAGC AGCAATAATC AAAACCAAAC GCGCGGATGC TGGTAAACGT AGAGGGAAAA AGCAGTGGCA GCCCAGCGTC GAGTATTGGG TGAATTTTAT CGAAAAAACT TATAGAGATG GCAATAAAAA TAGCCGTCGC ATGAATCGCA ACCAAGTTTA CAACCAGGTG AAGGGACACG CCGAATTGGA ATTAGGGTTG AAAGAGAGTG AATACCCAAG CCATGTGTTT GTTTATCAAG TGCTAGCTCC CTTAGTCGAA AAGAAAAAAG TCCGACATCC AGGGCAAGGT TCTCGGATTG TAATTAAGAC AACGGCAGGA GAGTTAGTAG TTGAGCGTAG CAATCAAGTC TGGCAAATAG ACCATACTCG TTTAGACACT TTATTAGTAG ATGAGAATTT AGAACTAGCT GGCAGTCTCT ACATTACTGT GGTCATCGAT AGCTATTCCG GGTGTGCAAT GGGGTTCTAT CTGGGCTTTG AGGCAGCAGG GTCTCATGAA GTAGCACTAG CCCTGCGTCA TGCGATTTTA GCCAAACAGT ATCCGCCTGA TTACCAAATA CAACATGAGT GGATGCTTGC GGGACTGCCT GAATATATTG TCACAGACCG AGCCAAGGAA TTTAGATCGG GGCATTTGCG ACGAATTGCA ATGGATTTGA ATATTCAACT ACGGTTACGT GCTTATCCAC AGCAAGGAGG TCTGATTGAA AGCCTGTTTG ATAAAGCAAA TAAAGAAGTT TTGTCAATGC TACCTGGCTA TAAAGGCTCC AATGTCCAAA AAAGACCATT AGATGCCGAA AAATATGCCT GTATAACCTA CGAAGAATTT GAGAAAATAT TAACTCGATA TTTTGTTGAC CACTACAATC AACATCTCTA TCCTAGAGTC AAAAATCAGA CACGAATTCA ACGGTGGTGG GCAGGGTTAA TTGGCAAACA ACCTAAGCTA TTAGAAGAAC GCGAGTTAGA TATATGCTTA ATGAAGACAG TACCGCGCTG TGTGCAAGCG TATGGCTGTG TACAGTTTGA GTGTCTAATC TATAGTGCCA CTTGGCTACA AAAGTTTGAG GGACAGCAAG TTACTCTCAG ATACAATCCC AGCAATATTG TTACTCTCCT GGTTTATAGC GTGGAGAAGA ACAATCAAGC GTCTGTTTTT CTAGGTACGG TAAAAGCCAG AGATTTAGAT GAAGAACGTC TGTCCTTGAA GGAGTGGAAG GCAATAAAGC AAAAGGTTCG TTCTTGTGGT AAAACTATTG ACCAGTCGTC AATTTTATCA GAGCGACTAG CTTTAAATGA GTTTGCTCAA GAGAAAATTA GGACTCTTAA ACAACGACGA GCATCAGAAC AAAAACGCAT TAACCGCAAA TCGGTTCAAA GCAAAGTCAT TGAATTATTT CCTGAAGAAA ACGAGACTAC TGTTGTTTTA GAAAAGGAAA CAGACGCTCC TGAGCTATCT CAACAATCAG CAATAAATTT GGTCAGTGAG CCTAAACAAC AATGCGCTAA ATCGAGTCAA AGCATTGCTT ATGACTGGAA CCAAATTATT GAAGAGAATT GGTAG
|
Protein sequence | MLEKKVSQGE QNNLFEVTPI GMELTAPTQQ DHKILIEQIE DAQMKREILL KMDAIEDIQT NSDNGKQERI RQWAQKLGKH PRTITRMLSK ADIEGLAAII KTKRADAGKR RGKKQWQPSV EYWVNFIEKT YRDGNKNSRR MNRNQVYNQV KGHAELELGL KESEYPSHVF VYQVLAPLVE KKKVRHPGQG SRIVIKTTAG ELVVERSNQV WQIDHTRLDT LLVDENLELA GSLYITVVID SYSGCAMGFY LGFEAAGSHE VALALRHAIL AKQYPPDYQI QHEWMLAGLP EYIVTDRAKE FRSGHLRRIA MDLNIQLRLR AYPQQGGLIE SLFDKANKEV LSMLPGYKGS NVQKRPLDAE KYACITYEEF EKILTRYFVD HYNQHLYPRV KNQTRIQRWW AGLIGKQPKL LEERELDICL MKTVPRCVQA YGCVQFECLI YSATWLQKFE GQQVTLRYNP SNIVTLLVYS VEKNNQASVF LGTVKARDLD EERLSLKEWK AIKQKVRSCG KTIDQSSILS ERLALNEFAQ EKIRTLKQRR ASEQKRINRK SVQSKVIELF PEENETTVVL EKETDAPELS QQSAINLVSE PKQQCAKSSQ SIAYDWNQII EENW
|
| |