Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_4533 |
Symbol | |
ID | 7095912 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011723 |
Strand | + |
Start bp | 23089 |
End bp | 27300 |
Gene Length | 4212 bp |
Protein Length | 1403 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643467513 |
Product | hypothetical protein |
Protein accession | YP_002364809 |
Protein GI | 218203956 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.0371536 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTGGA AAAGATTTAC CCGCCGCGAT GCTTGCCCCG TCTGTAACGG AGATCGCCAT GACTGTCGGC AAAATCTCGA CACCCACCTG ATCCACTGTC GAAGCCTCGA AGCCAACCCC CTAGACTACA TTTACCGTGG CCAGGACTCC CTAGGCTTCG GAATGTGGGC ATACAAGCCT GATGCTGACG CTTGGAGTGA AGAACGACGC GAAGAGTGGG AAAAGGAAAA AGAAGCCAGG AAACGGGAAC GGGAAAGGCA AAATCAAGAA GAACTTAAGA AGCTGCTACC GCTCAAAGAA CGGGATCTAG TGATTCGGTC AATCTTAGGT CAGTTAGAAT TAAGCGATTG CCACCGCCAA AGATTAAAAC AAAGGGGGTT AACAGACAGC CAAATTGACG AAGCTAACTA TCGTTCCGTT AAACAATGGC AAAAACTCGA TTATCCTGTC GATAACCGAC TCTCAGGGGT TAATCAATGG GGCAACGGGT TAACTAACAA CACAGACGGG ATTTTAATCC CCATTCCCAA TGAAGACGGA CTATATACAG CATTAAGGGT TAATAACCTT GATACAGATA TTAATGGCCT TGGAAAATAT CTATGGGTAT CATCCAAAAA TAGTCGAGGC ATACCCATCG ATCTCCCCAA TGGAGAACTA CCATTGGCAG TTTATTTCCC CGACAAGCCC TGTCAAACTA ATCAAATAGG TCTTTGTGAG GGAGTAGAGT ATAAGCCTCG AATAGCTGCC AATAGACTAG GGATTCCAGT GATTGGATTT CCTGGGTCGA ATTTTTCTAT TAGCCCAAAA ACTCTTGAAG CATTAATCAA AAAAATATGG ACGCGATGGA TATGTACGAA CTCATCGACG AAATCAAAGG GCGAATCGAA CTCTTTGAGT ATTACGAAAA CGGAAATTTT GCCTTTAACA CCCACGAATG TGGATGGGAA CCAGAACATT TATTCGCACT CTGGGAATCA GAAATCCGAT GGCATTACCC TTCTTGGAGA GCCAGTCTCA CCATTTTCGG ACTATGGTTT GAGCGAAAAG AATTATACAG AGCAATGCTC AATCATCCTG ATTGCTGATG CAGGAGTAGC CATTAATCCA CAAATCTCAA CTAGCCATAC TTCAACTCTG AAAATGATTG AGGGTTGGGG ATATAATGTT TCTTTGATGG ATTGGGGACA ATTAACCAAC AAAGAAGGAT TAGATATTGA TGAGATTGAC CATGAAACCC TAAAAAACAT CAAACTTATC TCTTTAGATA AATTCCGTGA AAAAGTTCGC TTTGAATCCC GAAAATCCTA CCCCAATCAG CTAATTCAAC TGCAAGAAAA GCTGAAAAAC CTAACATATA AACCCGACAT CTTATTGACC CAAGACGACT TAATTGACGG TAAATATCTA CCCACAGAAC TCCTTTTACA GTTAATCCCT AAAACCGGAA TCATTAACCT CAAGTCCTGT AAATCCAGTG GTAAAAGTCA CTTTAACAAA GAATTAATCC AACAAAAAAG ACAAGAAGGC TACAAAATTA TCTCCCTCGT CCCTCGAATT GTTTTAGGGA GAGGACAAGC CAAAGAATGG GGCATACAAT GGGACATTTC AGTAGAAGAT CCCCTCCTGA AAAAAGTCTC TCGTCTGACC CTCTATGAAA ACCAAGAAAC ATTAGGAATA TGTTTTGACT CCCTTTGGAA ATTAGCTGAT AGGGATTTTT CAAAAACCCT AATTATCATT GATGAAGCCG AACTAGGACT CCCCCATTTA TTAACTTCAT CAACTTGTAA AGATAACCGA CCCAAACTCC TAAACACCTT TGACAAGCTC CTCTATAACT CATTAAAATA TCAAGGACTT GTCCTCTTAT CGGATGCAGA CTTAAGCGAC ATCAGTGTTA ATTATGTTAA AGCCTTAGCC CCCGAAAATA CCCCCATTTT TACCATTGTT AATGAAGCCC AAACCGTCAG TTATGACGCG AGTATTTTCT CCCAGAAAAA ATGGATTAAA CAGGAAATCC TCAACGCCAT TGATAATAAT GAAAAGATTC ATATCACCAC CGATTCCCAG AAAGAAGCCG AACAAATGGA CAGGGAATTA AGCAAACAGT ATCCCCAGAA AAAAGTGATC CGAATTGATA GTAAAACCAC CCAAGAAGAT TGGGGAAGAG ACTTTGTAGA AAGGATCAAC GAGTCCCTTA AAAAAGAACG CCCTGACATC TTAATTACTA CCCCCTCAAT GGCAACAGGG ACAAGTATTG ATGGAAAAAT TAATAGTTTT TCATCTTTAG AAGGAACAAA TTTAAGTCAA GAAGGAACAA AGTTAAGTCA GCAAGAGACT CAGTTTAACT TAACCACAAT CCCTAACCCA GAAAGACAAT TAGATGAAAA TCAACCATCA GAAAATAGTC CTTCCTCCTC CTTAAATGAG TCCATTGATT TAGAAGTCAA ACATTGGTTT GATAAAGTCT TTGGTATCTT TTTAGGAGTC TTAACCCCCT CCCAATGCCG TCAAGCATTA ATGAGATATC GCCAACCCGT CCCCCGATAT ATCTATATTA AATCAGTCGG AATGCTCTCA GGTTGTCGCT CATTTTATCC CCAAGAAATC AACCAAAGCT TTCATGAATA CCATGATGAA GGATTAGCCA TAACCGATAT TCTACAAGAC ATAGAGGCTA GTGATCCCTT AGAATTTGCC TTGAAGATCC AAGCCATGGT TAACCCTGAA ACCAAACAAT GGATTAACCC CCATATTGAC CATTATTGTC AATTTAAAGC CAGAGATAAC TATGGATTAG CCAACCTAAG AGAAATCTTC ATAGAAGAAC TCACCCAAGA AGGCCATAGC GTGAGTATAA TAGACATTGA AGACCGAAAA ATTAACTCCA AAACCCAAGA ATTTATCGCA GGAGAAAAAC AGGCAGGAGT TGAAATTGAT CTCGAAGAAG CCACTGCCAT TTCAACCGCA GAAGATATTC CAATAGAACA AGCCCAAGCT ATCAGTAAAA AAGCCAATCC TAGCACCAAA GAACTGCATC AAGCAGCCAA AGCCTTCCTA TTAGAAGAAC TCCCAGGAGT GAAATTAACC CCCGACTTTG TTCTTAAAGC CGTCGTAAAA GATCATCGTC GTTGGTTAAA TCAGGTCAAA CTGTATTGGT ACTTAATGAA TCCCAACGCA GCCATTTACC ATGATAAAAA ACATTATCAG CATAAATTTA AGCAATTTGC GAGCCATGAC TTGGTATTTC TCCCAGACCT GAGAAGTTAC TCCCCTTTAC TCAACGAAAT TGAAACCTTG GGACTATTTA ACGTAATTGA CTTAAATAAC CCTGACCAAG AATATCGCTC AGATGACCCC AAACTTCAAG AATTTAAGCG AAAATGCTGC TACAGAAAAA GACGACTCTC TAATTTATTC AACATAACCG TCAGCAAAGA CTCCCATCCC ATCCATCTTG TCAACCGTTT CCTAGGAAAA TTTGGTTTGG TCTTGAAAGG ACAGCAGAAG CGAATAGAGG GGAAACAAGT ATGGAGCTAT AAACTTGACC TAGACTCCTT AAAAGATCCA GACCGTTTAG CCGTAGAAAA AGCCCTCGAC GCAAAATGGG CTAAAATCCA GACCAAGCAT GGCTTACAGC CTGTCACAGA ATCCCCTCAA TTAACTTATA TAAATACGGA AAGTTCTGTG ACACAAGATC TCCAATTGGA AAATCTACTG GAAAATCTAC TGGAAAATCA GACACAAGAT CTCCTCTTGG GAAATTTACG GGAAAATCAG ACACGAGATC TCCGATTAGG AAATCTACGG GAAAATCAGA TATCGGAGTC GACGGCTTTT AACCCTGAGA CCGTCACTTT CTCCCCCAGG TCAAATCCCA TTCAGAGTGA TAGGGGTCAA ACTCAGTCAC TGACTAACTT AGAGCAAGTA ACAGAACCCA ACGAAATAAC TAATATAAAA CCATGCAGTT CTGTGACAGG TTCTACCAAC GAACCGACCG TTAATCATGA CGTAACTTTC TTTATGGACT TACTCACCAT CTGGGAAAAT GCCAAAATTC GACGATTTAA CACATTTGAG CAGTTACTTG AATTGATTAA TCGCTTAGAA GTTCGATTAA ATCGTTGTTA CCATCAATTA CAACAGCTTT GTCCCGATTT TATCGAGCGA TGGTACAATG CCATCCTTCG ACAAGAGCAA TTAATTGGCT GA
|
Protein sequence | MNWKRFTRRD ACPVCNGDRH DCRQNLDTHL IHCRSLEANP LDYIYRGQDS LGFGMWAYKP DADAWSEERR EEWEKEKEAR KRERERQNQE ELKKLLPLKE RDLVIRSILG QLELSDCHRQ RLKQRGLTDS QIDEANYRSV KQWQKLDYPV DNRLSGVNQW GNGLTNNTDG ILIPIPNEDG LYTALRVNNL DTDINGLGKY LWVSSKNSRG IPIDLPNGEL PLAVYFPDKP CQTNQIGLCE GVEYKPRIAA NRLGIPVIGF PGSNFSISPK TLEALIKKIW TRWICTNSST KSKGESNSLS ITKTEILPLT PTNVDGNQNI YSHSGNQKSD GITLLGEPVS PFSDYGLSEK NYTEQCSIIL IADAGVAINP QISTSHTSTL KMIEGWGYNV SLMDWGQLTN KEGLDIDEID HETLKNIKLI SLDKFREKVR FESRKSYPNQ LIQLQEKLKN LTYKPDILLT QDDLIDGKYL PTELLLQLIP KTGIINLKSC KSSGKSHFNK ELIQQKRQEG YKIISLVPRI VLGRGQAKEW GIQWDISVED PLLKKVSRLT LYENQETLGI CFDSLWKLAD RDFSKTLIII DEAELGLPHL LTSSTCKDNR PKLLNTFDKL LYNSLKYQGL VLLSDADLSD ISVNYVKALA PENTPIFTIV NEAQTVSYDA SIFSQKKWIK QEILNAIDNN EKIHITTDSQ KEAEQMDREL SKQYPQKKVI RIDSKTTQED WGRDFVERIN ESLKKERPDI LITTPSMATG TSIDGKINSF SSLEGTNLSQ EGTKLSQQET QFNLTTIPNP ERQLDENQPS ENSPSSSLNE SIDLEVKHWF DKVFGIFLGV LTPSQCRQAL MRYRQPVPRY IYIKSVGMLS GCRSFYPQEI NQSFHEYHDE GLAITDILQD IEASDPLEFA LKIQAMVNPE TKQWINPHID HYCQFKARDN YGLANLREIF IEELTQEGHS VSIIDIEDRK INSKTQEFIA GEKQAGVEID LEEATAISTA EDIPIEQAQA ISKKANPSTK ELHQAAKAFL LEELPGVKLT PDFVLKAVVK DHRRWLNQVK LYWYLMNPNA AIYHDKKHYQ HKFKQFASHD LVFLPDLRSY SPLLNEIETL GLFNVIDLNN PDQEYRSDDP KLQEFKRKCC YRKRRLSNLF NITVSKDSHP IHLVNRFLGK FGLVLKGQQK RIEGKQVWSY KLDLDSLKDP DRLAVEKALD AKWAKIQTKH GLQPVTESPQ LTYINTESSV TQDLQLENLL ENLLENQTQD LLLGNLRENQ TRDLRLGNLR ENQISESTAF NPETVTFSPR SNPIQSDRGQ TQSLTNLEQV TEPNEITNIK PCSSVTGSTN EPTVNHDVTF FMDLLTIWEN AKIRRFNTFE QLLELINRLE VRLNRCYHQL QQLCPDFIER WYNAILRQEQ LIG
|
| |