Gene PCC8801_2997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2997 
Symbol 
ID7104489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3101442 
End bp3103424 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content39% 
IMG OID643476025 
Producthypothetical protein 
Protein accessionYP_002373140 
Protein GI218247769 
COG category 
COG ID 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCACA TAACCGTTGT CCAATGTCGC CTAATCGCCC CAGAAAGCAC CCTACAACAC 
ATCTGGAAAA TGATGGCACA GCAACAAACC CCACTCATTA ACCAACTACT CCACGACATT
AACACCCATC CTGACATCAA CACCTGGTTA ACCGCCAACC AACTCCCCTC AAAACTCGTT
GAAACCCTTG CCCAACCCCT GAAAACCCAA TCCCCCTACC AAGGACTACC AGGACGTTTC
ATCACCTCCG CTATTATCCT TGTCAAAGAA ATGTACGCCT CATGGTTTGC TATTCAAACC
CAAAAACGTC TTTCTCTAGA GGGGAAAAAA CGCTTCCTGA CCATTCTCAA AAGTGACAAA
CAATTAATAC AAGACAGTCA AACCGACTTT CTAACCTTAT GTTATAAAGC CCAACAACTG
CTCAAACGAA CCCAGAACAA ACTTAAACTC GACGAACCTC AACATAGTGA AAAAGCCCAT
TGGTCAATCA TTAACGCCCT TTATCCCGCC TACAACAACG CTAAAACCCC TATATCTCGC
GCAGCTTTTG CCCTTCTTAT CAAAAATAAC GGTCAAGTTC CCGACACCCC GGAAAACCCC
GACTATTACC AACAACGCCG TAAACGCAAA GAAATCCAAA TTAGACGCTT AGAAGAACAA
CTCAAAGCCT CACTCCCCAA AGGTCGTATC CTTGACTCAA AACACTGGGA AAATACTCTT
AAATTAGCCC AAACTCCTAT TACCACTATC GAAGAAATTA CCTCTCTCCA AACCCAACTT
TTACAAAAAT ATTCTCATCT TCCCTTTCCC GTTTTCTATG GAACCAACAC CGACTTAACT
TGGTTTAAAA ACCCTCAAGG TCGCATCTGT GTTAAATTCA ACGGACTCAA TCAATATCCT
TTTCAAATTG CTTGTAATAA ACGACAATAT CCTTGGTTTC AACGCTTTTT TACGGATTAT
CAAAGTTATA AATCCCATAA ACAACAAGTT CCCACAGGAT TAATGGTATT ACGTTCAGCC
CGTCTTCTTT GGCAACCCAC TAATGGTCAA GGAGAACCTT GGAACACCCA TCATCTTAGC
CTTCATTGTG CCATTGATAA CGACCTTTGG ACTATCTCAG GTATTCAACA AGTTAAACAG
CAAAAAATTC TTCAAACCGA GCAAAAAATC GCTAATTTCC ATAGTAAAGC CTTAGAAAAA
GAATTAACCC CTAACCAACA ACAACGACTT AAAGCCAGTC AAACCTCTCT TAACCTATTA
AAAACCTTCG ATATTAATGA ATTTTTTCCC TCAAAATGTT CCCTCTATCA AGGTTCTCCT
GATATCATTT TAGGGGTAAG TATTGGTTTA GAAAACCCTG CTACCATAGC TATTATCAAT
ATTTCTACAC AAGAAATTCT GACCTATCGC ACCACCAAGC AACTCTTAAG TCGAACTCGA
AAAGTTCGCA ATAAAAAGCC TAACTCAAAT AACTCTAATC AAAGTTTATC TTCAGCCTAT
AAACAGATTT CTAATTATGA ATTATTCTTA CAATATCAAC AACAAAAACA TCATAATCAA
CATCAACGAC ATAACGCCCA AATTAATGAT GCAAATAATA ATTACGGTGA AGCAAACTTA
GGATTATATC TTAACCGACT TTTAGCCAAA GCGATTCTTG AACTTGCTCA ACAATATCAA
GTTAGTTTAA TTATTCTTCC CTCATTAAAA AATAAGCGTG AACTCATTGA AAGTGAAATT
CGTGCTAAAG CTGAACTAAA ATATCCTGGT TGTAAGGAAA AACAAGACAG TTACGCAAAA
GATTATCGTA CTAACGTTCA TCAATGGAGT TATCAACAAC TTATCAAATG TATTGAGTCC
AAAGCTGCTC AAATTGGGAT TGATACAGCC ACAGGCAAGC AGATGAATTT AGAAACTTCT
CAAGACCAAG CCAGAAATTT AGTCCTTAAT TTTTGTCAAA AATTCTCCCC AACTCAGGTA
TAA
 
Protein sequence
MTHITVVQCR LIAPESTLQH IWKMMAQQQT PLINQLLHDI NTHPDINTWL TANQLPSKLV 
ETLAQPLKTQ SPYQGLPGRF ITSAIILVKE MYASWFAIQT QKRLSLEGKK RFLTILKSDK
QLIQDSQTDF LTLCYKAQQL LKRTQNKLKL DEPQHSEKAH WSIINALYPA YNNAKTPISR
AAFALLIKNN GQVPDTPENP DYYQQRRKRK EIQIRRLEEQ LKASLPKGRI LDSKHWENTL
KLAQTPITTI EEITSLQTQL LQKYSHLPFP VFYGTNTDLT WFKNPQGRIC VKFNGLNQYP
FQIACNKRQY PWFQRFFTDY QSYKSHKQQV PTGLMVLRSA RLLWQPTNGQ GEPWNTHHLS
LHCAIDNDLW TISGIQQVKQ QKILQTEQKI ANFHSKALEK ELTPNQQQRL KASQTSLNLL
KTFDINEFFP SKCSLYQGSP DIILGVSIGL ENPATIAIIN ISTQEILTYR TTKQLLSRTR
KVRNKKPNSN NSNQSLSSAY KQISNYELFL QYQQQKHHNQ HQRHNAQIND ANNNYGEANL
GLYLNRLLAK AILELAQQYQ VSLIILPSLK NKRELIESEI RAKAELKYPG CKEKQDSYAK
DYRTNVHQWS YQQLIKCIES KAAQIGIDTA TGKQMNLETS QDQARNLVLN FCQKFSPTQV