Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1533 |
Symbol | |
ID | 7104171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 1607712 |
End bp | 1608992 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643474606 |
Product | hypothetical protein |
Protein accession | YP_002371743 |
Protein GI | 218246372 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTCCC CTGACACTCA AGCTATTTTA CTTCTTTGTG CAAGTTTTGG TCAAAATCGT CAAACCGAAC CCTTTCCTTT AACTCTTGGT GAATATAATA CCCTTGCTGG TTGGTTAAGA GCAGAAAATA TGTTTCCTCA AGACCTACTT AACCCTAATT TTATCAGTCG TCTTTCTCAG TTAACCATAG GTAAATTAGA TTCTAAACGA TTAGTTGCAT TGCTACAAAG AGGGGGATTA TTAGCTTTGA CTGTTGAAAA ATGGCTTAAT CAAGGTTTAT GGATTATTAG TCGTGGAGAT GCTGACTATC CCCTGCGATT AAAACAACAA TTAAAATATT TAGCTCCTCC TATTTTATAT GGAATTGGGA ACAAAGATTT ATTATCAAAA GGGGGGTTAG CTGTTGTTGG TTCTCGTAAT GTGGATCAAG AAGGATTAGA CTATACTTAT CACGTTGTAG AAGCTTGTGC AGAACAAAAT ATTCAAGTGA TTTCAGGAGG TGCAAAAGGG GTTGATCAAG CTTCGATGTT AGGAACGTTA AAAGTAGGAG GTACAGTGAT TGGCGTATTA GCTAATAACT TACTTAAAGC ATCTGTTGAT GGAAAATATC GTACCAGTAT TAAAGAAGGA AAACTAACTT TAATTTCTGC GGTTGATCCT AATGCTTCCT TTCATGTTGG TAACGCTATG AGACGTAATA AATATATCTA TGCTTTGGCT AATTATGGGT TAGTTATTAG TGCTGACTAT AACACAGGTG GAACATGGGC AGGAGCAACA GAAGCTTTAA ATACAATTAA GGATGTCCCT ATTTTAGTGC GAATACAGGG AACAATATCA GAAGGCAATC AACATTTATT AAAACAAGGT GCAAAACCTT TTCCTGAAAC TCCTTGGAAT CGTCCGATTA AAGAATTAAT TGAAACTACT GTATCAGAAT ATAAAAGCAT AGAATTTCGT CAAAATAATA CTCAATTGAA TTTATTTAGT CAGGATAATC ATTCTGTTGT TTCTGACAAT AAAGATGAAC TTACACCGCA AGATCCTGAT ATCTCTTCCC GTGATGATGC TTTGAAATCG GCCTCAGAAA GACTTTATTA TGCTGTTTTA CCTATCATTC TTCAAGAACT AAACCAACCA CAAGATCCGA AATCTTTAGC AACTAATCTA GATGTTCAAG TTGGTCAACT AAGCAAATGG CTAAAAAAAG CAGTTACAGA TAAAAAAGTT ATCAAACAGA CTAAAAATAA CCAAGTTATT TATAAATCAA ATAAAGTATA A
|
Protein sequence | MLSPDTQAIL LLCASFGQNR QTEPFPLTLG EYNTLAGWLR AENMFPQDLL NPNFISRLSQ LTIGKLDSKR LVALLQRGGL LALTVEKWLN QGLWIISRGD ADYPLRLKQQ LKYLAPPILY GIGNKDLLSK GGLAVVGSRN VDQEGLDYTY HVVEACAEQN IQVISGGAKG VDQASMLGTL KVGGTVIGVL ANNLLKASVD GKYRTSIKEG KLTLISAVDP NASFHVGNAM RRNKYIYALA NYGLVISADY NTGGTWAGAT EALNTIKDVP ILVRIQGTIS EGNQHLLKQG AKPFPETPWN RPIKELIETT VSEYKSIEFR QNNTQLNLFS QDNHSVVSDN KDELTPQDPD ISSRDDALKS ASERLYYAVL PIILQELNQP QDPKSLATNL DVQVGQLSKW LKKAVTDKKV IKQTKNNQVI YKSNKV
|
| |