Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2378 |
Symbol | |
ID | 7104646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 2447540 |
End bp | 2450590 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643475417 |
Product | WD-40 repeat protein |
Protein accession | YP_002372545 |
Protein GI | 218247174 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAATA ACTCTTCTAA AAGCAGTGTT TTTATTTCCT ATTCTCGTCG AGATAAAACC TTTGTTCAAA AACTCCATCA CGCTTTAAAT GAAGGAAAAC GCGATATCTG GGTAGACTGG GAAGATATTG CCCCGACGGA AGATTGGCGA GAAGCGATCG CTCAAGGGAT TCAATCTGCT GACAATTTTT TATTTATTAT TAGTCCTGAT TCGGTTAAGT CTATAGAATG TAATAAGGAA ATTGATCATG CTATTAAATG CAATAAGCGA TTAGTCCCGA TTTTATATCG AGAGGTTGAG GATAATAGTG TTCGACCAGA ACTAGCCAAA CATAACTATA TTTTTTTTCA AAATGAAGAG AGTTTTTTTG AAAATTTAGA GAAACTAGAG CAAGCTTTAG ATACTGATTT AGCTTATATA AAAGAACATA CTTATCTGTT GAGTCGAGCT ATTTTATGGC AACAAAAACA GCGAGATCCT AGTTATTTAT TGCAAGGAAC CGCGCTCAAA GAAGCTCAAG AATGGATTAC CCATAGTCTA AATCAAACGC CTAGACCGAC CCAACTACAC AATGATTATA TTATTACCAG TATTCAAAAA AGTAAACAGT TTTTACGCAG AATTGCTATT GTCGTTGGGG CATTAGGATT AATTGCTTTT GCTTCTTTTT TAGTGGCTTT GTCAGAACGA AATCAGGCTA AAGAAGCAGA ATTAAAAGCT AAGAGTGAAG AAGTTAAAGC CTTAACAGGA TGGGCTCAAG CAAGGCTTTT AAGACATGAA CAATTAGATG CGTTAATTAA TATTATTAGA GCGGTTGATA AACTGAAGGA TTTACCGCAA TCTTCTTCTG AAGAAACCTT TTCTGATTTT GCTCAAATAC CAATTCATGA TCAGGTTGAA CAAACCCTAA GACAAGTGGT TTATACCTTG CAAGAATTAA ATCATTTACA AGCTGATCAA AAAACCATTT ATGATGTACA ATTTTCCCCA GATCATCGAT GGATTGCTTC TGCTAGTGCA GATACTAAAG TCAATCTCTG GAAAAATAAT CAACGACAAA CTAGCTTGCT TCATCAAGGC GTTGTGTGGC GGATTGGGTT TTCTCCTGAT AGTCAAATGA TGGTTTCTGC TAGTGAAGAT AAAACCGTTA AATTGTGGCA ACTTAATCCT CAAGGCAATT GGACTCTTAA GCAAACTTTA ATCCATCCTG TTCCTGTGAG ATCTGTTACC TTTACTTTTA CGGATCAATG TTCACAAACT GGTCAAAAGA TTGCTTCTGC GGGAACTGAT GGTCTGATCC GAATATGGAA TTTAGAAGGA AAATTACAAA GAACCTTTCA AGCGCATACA GGAACAATTA ATGATCTCAA AATTTCTCCC AATTGTCAAA CCCTTGCTAG TGCTAGTGAA GATAGAACGG CTAAATTATG GACGTTGGAT GGACAAAAAA AAGCGACTCT TCTTGGACAT GAAAATCAAG TTTGGACTAT CAATTTTTCT CCCGATGGTC AACGAATTGT GACGGGGAGT TTTGATACGA CGATTAAGCT GTGGGATCAA ACGGGACAAC TACTAAAAAC CCTCGAAGGA CACGCTAATT GGGTCATGAG TGTTATTTTC TCCCGTAATA GTCAAGAAAT TGTTTCCGGT GGTGAGGATG CTATGCTGAA ATTCTGGAGT CGAGAGGGAG ACTTATTTGC CTCTTTATTA AGTCCCCATG GGGATATCGG AAGTATTAAT ATTTCGGCGG ACAATCAATA TTTAGTCTTT ACTGGAGATA GTGGCAAAAT GAGTCTATGG CAGCAGGGAG GAAGTGTCAT TGAAATTCTA CGCGGCCATA CTTCTGGTGT CACGGGGGTT CATTTTTCGC CTGATGGACA ATTAATGGCT TCAGTGAGTA ATGATCAAAC GGTAAAATTG TGGCAATTTG ATCCCCAAGC AAAGCGCATG GAATTGCAGC AAACGCTGGA ATATCGCAAA GGAGAACCCG AAGGAGGACT GAAAAATGTT AATTTCACTC CCGATGGTCA ATATTTGATT ACGACAAGTT ATGACAACAC TTTACAATCT TGGAATGTTA AAAAAGCTTT AACCCATTCT TCTATTCAAG GAGAAATTAT TGCCAAAAAT AATACGGTTG TTAATCGTTT TAGGATTTCA TCTGATGGTA AACGATTGGC GTTGGCAAGT GCAGATGGGA CGATTAAACT GTGGGATCTC AAGTCTCAAA AATTGTTAAA AATTTTGACA ACTAATCAAT CTCCATCCTT GACGAATAAT GGGATCAATC AATGTCAGAA AATTCAGCAA GGATATCCTC CTCAGTCTAC AGATGTGGCG TTTTCAAAGA ATAATCAATA TTTGGTTGCT TCTTATTCTG ATGGTTGCCT AAAACTTTGG AATCTTGAGG GTCAATTGAT TCAAGAATTT CGGGGTCATC CACAATGGAT TAATGCGTTA AGATTTAGTC CTGATGGCCA GTTATTAGCG ACCACGAGTC GAGATAATAC GATTAAGCTT TGGCAGTGGG AAAAAACCCA ATTTAAGATC GATCAACCGA CTAAAATTTT GAAAGGTCAT CAAGACTGGG TTTGGAATGT GGCTTTTACG TCTGATGGAA AGAAATTAGC GTCAGGGGGA AAAGATAACA CGGTTAAACT TTGGAATATT ACTACTCAAT CACCATCGGA TCAATCGGAT CTTATTGTTA CACTTCAAAG TCACATTGAT TGGGTAACAT CCGTTGATTT TAGTCCCTGT AATCAGGATA ATAAAGATTA TCCTAATTGT CATCAAAGGC TTCAATTAGC CTCAGCAAGT GCCGATCAAA CAATTATTTT TTGGAAGATG GAAGAGGTAT TACGGATTGA AACAAAAGAT AATCATGAAA CAGCCTTACA ATCGTTGTTT AAAAAAGGGT GTCAATGGCT TTCTGTGTAC CTGGAAACCA ATCCAGATAC ACCAGAAGCG AGTGATATTC GTTCCGCTTG TGGAGAGACT AAACCTCCGT CGGATCAACC GGGAAATAAA ATTCTATCCC CTGATCAATG A
|
Protein sequence | MTNNSSKSSV FISYSRRDKT FVQKLHHALN EGKRDIWVDW EDIAPTEDWR EAIAQGIQSA DNFLFIISPD SVKSIECNKE IDHAIKCNKR LVPILYREVE DNSVRPELAK HNYIFFQNEE SFFENLEKLE QALDTDLAYI KEHTYLLSRA ILWQQKQRDP SYLLQGTALK EAQEWITHSL NQTPRPTQLH NDYIITSIQK SKQFLRRIAI VVGALGLIAF ASFLVALSER NQAKEAELKA KSEEVKALTG WAQARLLRHE QLDALINIIR AVDKLKDLPQ SSSEETFSDF AQIPIHDQVE QTLRQVVYTL QELNHLQADQ KTIYDVQFSP DHRWIASASA DTKVNLWKNN QRQTSLLHQG VVWRIGFSPD SQMMVSASED KTVKLWQLNP QGNWTLKQTL IHPVPVRSVT FTFTDQCSQT GQKIASAGTD GLIRIWNLEG KLQRTFQAHT GTINDLKISP NCQTLASASE DRTAKLWTLD GQKKATLLGH ENQVWTINFS PDGQRIVTGS FDTTIKLWDQ TGQLLKTLEG HANWVMSVIF SRNSQEIVSG GEDAMLKFWS REGDLFASLL SPHGDIGSIN ISADNQYLVF TGDSGKMSLW QQGGSVIEIL RGHTSGVTGV HFSPDGQLMA SVSNDQTVKL WQFDPQAKRM ELQQTLEYRK GEPEGGLKNV NFTPDGQYLI TTSYDNTLQS WNVKKALTHS SIQGEIIAKN NTVVNRFRIS SDGKRLALAS ADGTIKLWDL KSQKLLKILT TNQSPSLTNN GINQCQKIQQ GYPPQSTDVA FSKNNQYLVA SYSDGCLKLW NLEGQLIQEF RGHPQWINAL RFSPDGQLLA TTSRDNTIKL WQWEKTQFKI DQPTKILKGH QDWVWNVAFT SDGKKLASGG KDNTVKLWNI TTQSPSDQSD LIVTLQSHID WVTSVDFSPC NQDNKDYPNC HQRLQLASAS ADQTIIFWKM EEVLRIETKD NHETALQSLF KKGCQWLSVY LETNPDTPEA SDIRSACGET KPPSDQPGNK ILSPDQ
|
| |