Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_4585 |
Symbol | |
ID | 8393869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013163 |
Strand | - |
Start bp | 2924 |
End bp | 4579 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644984651 |
Product | transposase IS4 family protein |
Protein accession | YP_003142302 |
Protein GI | 257062244 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.34521 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCTAC ATCCCAAACC AATTCCGCCG ATTCCCGAAG ATACAGTACG AGTAGCCCGT GCAGCTTTCC GGTCAGGCAA TATTTATCTA CAACTACGAG ATACTTTGGA AACTATCTAT GTTGATGAAG ATTTCGCTGA TTTATTCTCT GTCAAAGGAC AACCAGCCCA ATCCCCTTGG CGACTAGCTT TAATCTGTAT AATGCAATAT ATGGAGAATC TCTCAGATCG TCAAGTAGCA GAAGCAGTAC GAGGGCGAAT TGATTGGAAA TATGTACTTT CTTTACCCTT AGAAGATAGT GGGTTCGATT ACTCTGTATT AAGTGAGTTT CGTAAACGTT TAATTGAAGG AGGTTCTGAA GAATTATTAC TTAATAAAAT CCTCGAAAAG TTTAAAGAAA AAGGAATCTT AAAAAAACCC AAACAACAAA GAACTGATTC AACTCATATC CTCGCAGCAA TTCGGCCTTT AAACCGTTTA GAAACCTTGG GGGAGACTAT GAGGGCAGCC CTAAATAGTT TATCCGTAGC TGCCCCTGAT TGGTTAAGAA GAAATCTGCT CAAAGATTGG TATGATTTTT ATGGAAGAAG GATTGAAAAT TACCGTCTGC CAAAACTTGA TTCAGAACGA ACCCAACTCG GCGAGAAAAT CGGAAAAGAT GGTTTCAATT TACTCAATCA AATTTATCAT CATGATAGTC CTGACTGGTT ACGTCATCTC AGAGCAGTAG AAACATTAAG ACAAGTCTGG ATTCAACAAT TCTATGCCCC AGAAATAGAC AGAGTTCAAT TAAGACCTCC AAAGGATATG CCTCCCTCTA CCATAGCCAT CCATTCTCCC TATGACTTGG AGGCTCATTA TTCCTCTAAA CGTAGCGTGA ATTGGGTGGG TTACAAAGTT CATTTAACGG AGATTTGTGA CGAAGATTCT CCTCACTTTA TTACTCAGGT TACTACCACT TTATCTACGG TGACAGATGA AGTAGTAGTT CCATCTATTC ATGAAGCTTT AGAAGAGCAA TCGTTGCTGC CTAATCAGCA TTTAGTTGAT TTAGGTTATA CCCCGGCTGA AAATTTAATT TCATCTCAAA GAGACTACGA TTTAGAATTA ATTGGACCCG TGCGGAGTGA CCCTTCGTGG CAAAGTCGAA ATCACCCAAA ATTTGCTGCT GAAAATTTCA CAATTGATTG GGAGAAAAAA GTGGCGACTT GTCCAAAAGG TCATCAAAGT ATAACATGGA CACAAAAAAA AGATGTGGGA GGACAGCCGA TAATTAGTAT CAGATTTTCG ACCTTCTCTT GTAATAATTG TCGCTCTCGT TCTCGATGTA CCCGTGCTAA AACTGAACCC AGAAAATTAA CCATTCGAGA TCAAAATGAG TATCTGGCTT TAAAAAATCG TAGAGCCGTT CAAAACACTC GTGAATTTCA AGATATTTAT CGAAAAAGAG CCGGAATAGA AGGAACTTTA TCCCAGGGAA TTAGAAAATC CGGTTTACGC CAATCTCGAT ATGTCGGAGA GGCGAAAACT CATTTACAGC ACATTTTTAC GGCAGTTGCG ATTAATTTAT ATCGTCTTGA TAATTGGTTA AATGATATTC CCCTAGCTTC TACTCGTTAT TCTCGTTTTT GTTTTCTTAA GTCCAAAACA GGCTAG
|
Protein sequence | MSLHPKPIPP IPEDTVRVAR AAFRSGNIYL QLRDTLETIY VDEDFADLFS VKGQPAQSPW RLALICIMQY MENLSDRQVA EAVRGRIDWK YVLSLPLEDS GFDYSVLSEF RKRLIEGGSE ELLLNKILEK FKEKGILKKP KQQRTDSTHI LAAIRPLNRL ETLGETMRAA LNSLSVAAPD WLRRNLLKDW YDFYGRRIEN YRLPKLDSER TQLGEKIGKD GFNLLNQIYH HDSPDWLRHL RAVETLRQVW IQQFYAPEID RVQLRPPKDM PPSTIAIHSP YDLEAHYSSK RSVNWVGYKV HLTEICDEDS PHFITQVTTT LSTVTDEVVV PSIHEALEEQ SLLPNQHLVD LGYTPAENLI SSQRDYDLEL IGPVRSDPSW QSRNHPKFAA ENFTIDWEKK VATCPKGHQS ITWTQKKDVG GQPIISIRFS TFSCNNCRSR SRCTRAKTEP RKLTIRDQNE YLALKNRRAV QNTREFQDIY RKRAGIEGTL SQGIRKSGLR QSRYVGEAKT HLQHIFTAVA INLYRLDNWL NDIPLASTRY SRFCFLKSKT G
|
| |