Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2997 |
Symbol | |
ID | 7104489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 3101442 |
End bp | 3103424 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643476025 |
Product | hypothetical protein |
Protein accession | YP_002373140 |
Protein GI | 218247769 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01766] transposase, IS605 OrfB family, central region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCACA TAACCGTTGT CCAATGTCGC CTAATCGCCC CAGAAAGCAC CCTACAACAC ATCTGGAAAA TGATGGCACA GCAACAAACC CCACTCATTA ACCAACTACT CCACGACATT AACACCCATC CTGACATCAA CACCTGGTTA ACCGCCAACC AACTCCCCTC AAAACTCGTT GAAACCCTTG CCCAACCCCT GAAAACCCAA TCCCCCTACC AAGGACTACC AGGACGTTTC ATCACCTCCG CTATTATCCT TGTCAAAGAA ATGTACGCCT CATGGTTTGC TATTCAAACC CAAAAACGTC TTTCTCTAGA GGGGAAAAAA CGCTTCCTGA CCATTCTCAA AAGTGACAAA CAATTAATAC AAGACAGTCA AACCGACTTT CTAACCTTAT GTTATAAAGC CCAACAACTG CTCAAACGAA CCCAGAACAA ACTTAAACTC GACGAACCTC AACATAGTGA AAAAGCCCAT TGGTCAATCA TTAACGCCCT TTATCCCGCC TACAACAACG CTAAAACCCC TATATCTCGC GCAGCTTTTG CCCTTCTTAT CAAAAATAAC GGTCAAGTTC CCGACACCCC GGAAAACCCC GACTATTACC AACAACGCCG TAAACGCAAA GAAATCCAAA TTAGACGCTT AGAAGAACAA CTCAAAGCCT CACTCCCCAA AGGTCGTATC CTTGACTCAA AACACTGGGA AAATACTCTT AAATTAGCCC AAACTCCTAT TACCACTATC GAAGAAATTA CCTCTCTCCA AACCCAACTT TTACAAAAAT ATTCTCATCT TCCCTTTCCC GTTTTCTATG GAACCAACAC CGACTTAACT TGGTTTAAAA ACCCTCAAGG TCGCATCTGT GTTAAATTCA ACGGACTCAA TCAATATCCT TTTCAAATTG CTTGTAATAA ACGACAATAT CCTTGGTTTC AACGCTTTTT TACGGATTAT CAAAGTTATA AATCCCATAA ACAACAAGTT CCCACAGGAT TAATGGTATT ACGTTCAGCC CGTCTTCTTT GGCAACCCAC TAATGGTCAA GGAGAACCTT GGAACACCCA TCATCTTAGC CTTCATTGTG CCATTGATAA CGACCTTTGG ACTATCTCAG GTATTCAACA AGTTAAACAG CAAAAAATTC TTCAAACCGA GCAAAAAATC GCTAATTTCC ATAGTAAAGC CTTAGAAAAA GAATTAACCC CTAACCAACA ACAACGACTT AAAGCCAGTC AAACCTCTCT TAACCTATTA AAAACCTTCG ATATTAATGA ATTTTTTCCC TCAAAATGTT CCCTCTATCA AGGTTCTCCT GATATCATTT TAGGGGTAAG TATTGGTTTA GAAAACCCTG CTACCATAGC TATTATCAAT ATTTCTACAC AAGAAATTCT GACCTATCGC ACCACCAAGC AACTCTTAAG TCGAACTCGA AAAGTTCGCA ATAAAAAGCC TAACTCAAAT AACTCTAATC AAAGTTTATC TTCAGCCTAT AAACAGATTT CTAATTATGA ATTATTCTTA CAATATCAAC AACAAAAACA TCATAATCAA CATCAACGAC ATAACGCCCA AATTAATGAT GCAAATAATA ATTACGGTGA AGCAAACTTA GGATTATATC TTAACCGACT TTTAGCCAAA GCGATTCTTG AACTTGCTCA ACAATATCAA GTTAGTTTAA TTATTCTTCC CTCATTAAAA AATAAGCGTG AACTCATTGA AAGTGAAATT CGTGCTAAAG CTGAACTAAA ATATCCTGGT TGTAAGGAAA AACAAGACAG TTACGCAAAA GATTATCGTA CTAACGTTCA TCAATGGAGT TATCAACAAC TTATCAAATG TATTGAGTCC AAAGCTGCTC AAATTGGGAT TGATACAGCC ACAGGCAAGC AGATGAATTT AGAAACTTCT CAAGACCAAG CCAGAAATTT AGTCCTTAAT TTTTGTCAAA AATTCTCCCC AACTCAGGTA TAA
|
Protein sequence | MTHITVVQCR LIAPESTLQH IWKMMAQQQT PLINQLLHDI NTHPDINTWL TANQLPSKLV ETLAQPLKTQ SPYQGLPGRF ITSAIILVKE MYASWFAIQT QKRLSLEGKK RFLTILKSDK QLIQDSQTDF LTLCYKAQQL LKRTQNKLKL DEPQHSEKAH WSIINALYPA YNNAKTPISR AAFALLIKNN GQVPDTPENP DYYQQRRKRK EIQIRRLEEQ LKASLPKGRI LDSKHWENTL KLAQTPITTI EEITSLQTQL LQKYSHLPFP VFYGTNTDLT WFKNPQGRIC VKFNGLNQYP FQIACNKRQY PWFQRFFTDY QSYKSHKQQV PTGLMVLRSA RLLWQPTNGQ GEPWNTHHLS LHCAIDNDLW TISGIQQVKQ QKILQTEQKI ANFHSKALEK ELTPNQQQRL KASQTSLNLL KTFDINEFFP SKCSLYQGSP DIILGVSIGL ENPATIAIIN ISTQEILTYR TTKQLLSRTR KVRNKKPNSN NSNQSLSSAY KQISNYELFL QYQQQKHHNQ HQRHNAQIND ANNNYGEANL GLYLNRLLAK AILELAQQYQ VSLIILPSLK NKRELIESEI RAKAELKYPG CKEKQDSYAK DYRTNVHQWS YQQLIKCIES KAAQIGIDTA TGKQMNLETS QDQARNLVLN FCQKFSPTQV
|
| |