Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0539 |
Symbol | |
ID | 3682369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 680088 |
End bp | 681767 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637715867 |
Product | transposase IS4 |
Protein accession | YP_321058 |
Protein GI | 75906762 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.892761 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCGTAG CATCTACAAA GATTATGACC CTGCACCCGC GTGATATGTC GCAGATTCCT GAAACAACAG CGCAAGTAGC CCGGAATTCA TTTCCCAAAG GGAACATATA TATGAAGATG CGGGATGAAA TAGGAGTGTT ATATAAGGAT GAGGATTTTG TCAAACTTTA CCGCGCAGAT TGTGGTCAAA GTGGAATATC AGCAGGACAA CTGGCATTAG TGACAGTAAT GCAATTTATC GAAGGTTTAA CGGATAGACA AGCGGCGGAT GCAGTGAGGG GTCATATTGA TTGGAAATAC GCACTATCGT TGGAATTAAA TGACCCAGGG TTTGATTATT CAGTACTTTC AGAATTTCGT CAGCGATTAA TCAAAGCAGG ACGAGAGCGA GAGTTACTCA ACCAAATGCT AGCTCGTTTC CAAGAACTAG GTTGGCTCAA AAATCGCGGC CGTGTCAGAA CTGATTCAAC TCACGTATTA GCCGCAGTAC GACAGTTAAA TCGTTTGGAA TTAGTGGGAG AAACTTTACG TCATACCTTA AATGACTTGG CTTATTTTGC CCCTGATTGG CTCAAATCGA GAGTTGACGT TGATTGGTTT GAACGTTACT CCCTGAGATT TGAGCAATAC CGCTTGCCCA AATCAAAAGC CGAACGTGAG AAATTGAGGC GAAAAATTGG TGAGGATGGT CATCATTTGC TATCCGCTTT GTATGCAGAC TCAACTTGTA ATTGGCTGTG GCAGATTCCA TCAGTGGAAA CATTACGTAT AGTTTGGGTG CAACAATACT ATATTCAATT GCAACAAGTC TATTGGCGAG AACAAGATAA CTTACCACCA AATAGACTAC AGATTGAATC TCCTTACGAT GTTGATGCAC GCAATTCCAG CAAGCGAGAA ATCAACTGGA CTGGTTATAA TCTGCATCTG ACAGAAATTT GTCACCCCAT ACTGCCAAAC TTAATTATCA ATGTGGAAAC GTCCGTGGCC ACAAGTGCGG ATGTTGAGAT GACACCAGTA ATTCATTCTC GTTTAAACCA GAACAATCTT TTGCCACAAG AACATGTTGT CGATACTGGC TATGTCAATG CTCAAAACTT AGTCGATAGT CAATCCCATT TTCATGTTGA TTTAGTAGGA AAAGTTCCCC CCGGAACTAG TTGGCAAGCA ACAGCACAAT CCGGCTTTGA GCAAAATTGC TTCACTATTC ATTGGGATTT GATGCGTGTT GATTGCCCAA TGGGTAAACA AAGTAAGTCC TGGCGTACAA CTGTCGATAG CCATGACAAT CCAGTAGTCA AAATACAATT TGACAAATCC GATTGTTCGC TTTGTTCAAG TCGCTCAAAA TGCACTCGCT CCAAAAAACT ACCGCGTCTT CTGACCCTCA AACCACAGGA ACTACATCTT GCATTACATG ATGCTCGCAT TCGCCAAAAA ACTGAATCTT TTCAACAAAT TTATCACCAA CGTGCTGGCG TTGAAGGCTT GATTTCCCAA GCTACTGGTC GCTACCAATT ACGCCGTTGT CGCTACATTG GTCTTGCCAA AACTCTCTTG CAGCATGTCA TTACTGCTGC TGCTATCAAC TTCAGTCGGA TGTGGGATTG GTGGCAACAT GTCCCACGCA GTCAGACTCG CGTTTCTCAC TTTGCTCGAA TTGCTCCCAC TGCCTCATAG
|
Protein sequence | MSVASTKIMT LHPRDMSQIP ETTAQVARNS FPKGNIYMKM RDEIGVLYKD EDFVKLYRAD CGQSGISAGQ LALVTVMQFI EGLTDRQAAD AVRGHIDWKY ALSLELNDPG FDYSVLSEFR QRLIKAGRER ELLNQMLARF QELGWLKNRG RVRTDSTHVL AAVRQLNRLE LVGETLRHTL NDLAYFAPDW LKSRVDVDWF ERYSLRFEQY RLPKSKAERE KLRRKIGEDG HHLLSALYAD STCNWLWQIP SVETLRIVWV QQYYIQLQQV YWREQDNLPP NRLQIESPYD VDARNSSKRE INWTGYNLHL TEICHPILPN LIINVETSVA TSADVEMTPV IHSRLNQNNL LPQEHVVDTG YVNAQNLVDS QSHFHVDLVG KVPPGTSWQA TAQSGFEQNC FTIHWDLMRV DCPMGKQSKS WRTTVDSHDN PVVKIQFDKS DCSLCSSRSK CTRSKKLPRL LTLKPQELHL ALHDARIRQK TESFQQIYHQ RAGVEGLISQ ATGRYQLRRC RYIGLAKTLL QHVITAAAIN FSRMWDWWQH VPRSQTRVSH FARIAPTAS
|
| |