Gene PCC8801_4533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4533 
Symbol 
ID7095912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011723 
Strand
Start bp23089 
End bp27300 
Gene Length4212 bp 
Protein Length1403 aa 
Translation table11 
GC content41% 
IMG OID643467513 
Producthypothetical protein 
Protein accessionYP_002364809 
Protein GI218203956 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.0371536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTGGA AAAGATTTAC CCGCCGCGAT GCTTGCCCCG TCTGTAACGG AGATCGCCAT 
GACTGTCGGC AAAATCTCGA CACCCACCTG ATCCACTGTC GAAGCCTCGA AGCCAACCCC
CTAGACTACA TTTACCGTGG CCAGGACTCC CTAGGCTTCG GAATGTGGGC ATACAAGCCT
GATGCTGACG CTTGGAGTGA AGAACGACGC GAAGAGTGGG AAAAGGAAAA AGAAGCCAGG
AAACGGGAAC GGGAAAGGCA AAATCAAGAA GAACTTAAGA AGCTGCTACC GCTCAAAGAA
CGGGATCTAG TGATTCGGTC AATCTTAGGT CAGTTAGAAT TAAGCGATTG CCACCGCCAA
AGATTAAAAC AAAGGGGGTT AACAGACAGC CAAATTGACG AAGCTAACTA TCGTTCCGTT
AAACAATGGC AAAAACTCGA TTATCCTGTC GATAACCGAC TCTCAGGGGT TAATCAATGG
GGCAACGGGT TAACTAACAA CACAGACGGG ATTTTAATCC CCATTCCCAA TGAAGACGGA
CTATATACAG CATTAAGGGT TAATAACCTT GATACAGATA TTAATGGCCT TGGAAAATAT
CTATGGGTAT CATCCAAAAA TAGTCGAGGC ATACCCATCG ATCTCCCCAA TGGAGAACTA
CCATTGGCAG TTTATTTCCC CGACAAGCCC TGTCAAACTA ATCAAATAGG TCTTTGTGAG
GGAGTAGAGT ATAAGCCTCG AATAGCTGCC AATAGACTAG GGATTCCAGT GATTGGATTT
CCTGGGTCGA ATTTTTCTAT TAGCCCAAAA ACTCTTGAAG CATTAATCAA AAAAATATGG
ACGCGATGGA TATGTACGAA CTCATCGACG AAATCAAAGG GCGAATCGAA CTCTTTGAGT
ATTACGAAAA CGGAAATTTT GCCTTTAACA CCCACGAATG TGGATGGGAA CCAGAACATT
TATTCGCACT CTGGGAATCA GAAATCCGAT GGCATTACCC TTCTTGGAGA GCCAGTCTCA
CCATTTTCGG ACTATGGTTT GAGCGAAAAG AATTATACAG AGCAATGCTC AATCATCCTG
ATTGCTGATG CAGGAGTAGC CATTAATCCA CAAATCTCAA CTAGCCATAC TTCAACTCTG
AAAATGATTG AGGGTTGGGG ATATAATGTT TCTTTGATGG ATTGGGGACA ATTAACCAAC
AAAGAAGGAT TAGATATTGA TGAGATTGAC CATGAAACCC TAAAAAACAT CAAACTTATC
TCTTTAGATA AATTCCGTGA AAAAGTTCGC TTTGAATCCC GAAAATCCTA CCCCAATCAG
CTAATTCAAC TGCAAGAAAA GCTGAAAAAC CTAACATATA AACCCGACAT CTTATTGACC
CAAGACGACT TAATTGACGG TAAATATCTA CCCACAGAAC TCCTTTTACA GTTAATCCCT
AAAACCGGAA TCATTAACCT CAAGTCCTGT AAATCCAGTG GTAAAAGTCA CTTTAACAAA
GAATTAATCC AACAAAAAAG ACAAGAAGGC TACAAAATTA TCTCCCTCGT CCCTCGAATT
GTTTTAGGGA GAGGACAAGC CAAAGAATGG GGCATACAAT GGGACATTTC AGTAGAAGAT
CCCCTCCTGA AAAAAGTCTC TCGTCTGACC CTCTATGAAA ACCAAGAAAC ATTAGGAATA
TGTTTTGACT CCCTTTGGAA ATTAGCTGAT AGGGATTTTT CAAAAACCCT AATTATCATT
GATGAAGCCG AACTAGGACT CCCCCATTTA TTAACTTCAT CAACTTGTAA AGATAACCGA
CCCAAACTCC TAAACACCTT TGACAAGCTC CTCTATAACT CATTAAAATA TCAAGGACTT
GTCCTCTTAT CGGATGCAGA CTTAAGCGAC ATCAGTGTTA ATTATGTTAA AGCCTTAGCC
CCCGAAAATA CCCCCATTTT TACCATTGTT AATGAAGCCC AAACCGTCAG TTATGACGCG
AGTATTTTCT CCCAGAAAAA ATGGATTAAA CAGGAAATCC TCAACGCCAT TGATAATAAT
GAAAAGATTC ATATCACCAC CGATTCCCAG AAAGAAGCCG AACAAATGGA CAGGGAATTA
AGCAAACAGT ATCCCCAGAA AAAAGTGATC CGAATTGATA GTAAAACCAC CCAAGAAGAT
TGGGGAAGAG ACTTTGTAGA AAGGATCAAC GAGTCCCTTA AAAAAGAACG CCCTGACATC
TTAATTACTA CCCCCTCAAT GGCAACAGGG ACAAGTATTG ATGGAAAAAT TAATAGTTTT
TCATCTTTAG AAGGAACAAA TTTAAGTCAA GAAGGAACAA AGTTAAGTCA GCAAGAGACT
CAGTTTAACT TAACCACAAT CCCTAACCCA GAAAGACAAT TAGATGAAAA TCAACCATCA
GAAAATAGTC CTTCCTCCTC CTTAAATGAG TCCATTGATT TAGAAGTCAA ACATTGGTTT
GATAAAGTCT TTGGTATCTT TTTAGGAGTC TTAACCCCCT CCCAATGCCG TCAAGCATTA
ATGAGATATC GCCAACCCGT CCCCCGATAT ATCTATATTA AATCAGTCGG AATGCTCTCA
GGTTGTCGCT CATTTTATCC CCAAGAAATC AACCAAAGCT TTCATGAATA CCATGATGAA
GGATTAGCCA TAACCGATAT TCTACAAGAC ATAGAGGCTA GTGATCCCTT AGAATTTGCC
TTGAAGATCC AAGCCATGGT TAACCCTGAA ACCAAACAAT GGATTAACCC CCATATTGAC
CATTATTGTC AATTTAAAGC CAGAGATAAC TATGGATTAG CCAACCTAAG AGAAATCTTC
ATAGAAGAAC TCACCCAAGA AGGCCATAGC GTGAGTATAA TAGACATTGA AGACCGAAAA
ATTAACTCCA AAACCCAAGA ATTTATCGCA GGAGAAAAAC AGGCAGGAGT TGAAATTGAT
CTCGAAGAAG CCACTGCCAT TTCAACCGCA GAAGATATTC CAATAGAACA AGCCCAAGCT
ATCAGTAAAA AAGCCAATCC TAGCACCAAA GAACTGCATC AAGCAGCCAA AGCCTTCCTA
TTAGAAGAAC TCCCAGGAGT GAAATTAACC CCCGACTTTG TTCTTAAAGC CGTCGTAAAA
GATCATCGTC GTTGGTTAAA TCAGGTCAAA CTGTATTGGT ACTTAATGAA TCCCAACGCA
GCCATTTACC ATGATAAAAA ACATTATCAG CATAAATTTA AGCAATTTGC GAGCCATGAC
TTGGTATTTC TCCCAGACCT GAGAAGTTAC TCCCCTTTAC TCAACGAAAT TGAAACCTTG
GGACTATTTA ACGTAATTGA CTTAAATAAC CCTGACCAAG AATATCGCTC AGATGACCCC
AAACTTCAAG AATTTAAGCG AAAATGCTGC TACAGAAAAA GACGACTCTC TAATTTATTC
AACATAACCG TCAGCAAAGA CTCCCATCCC ATCCATCTTG TCAACCGTTT CCTAGGAAAA
TTTGGTTTGG TCTTGAAAGG ACAGCAGAAG CGAATAGAGG GGAAACAAGT ATGGAGCTAT
AAACTTGACC TAGACTCCTT AAAAGATCCA GACCGTTTAG CCGTAGAAAA AGCCCTCGAC
GCAAAATGGG CTAAAATCCA GACCAAGCAT GGCTTACAGC CTGTCACAGA ATCCCCTCAA
TTAACTTATA TAAATACGGA AAGTTCTGTG ACACAAGATC TCCAATTGGA AAATCTACTG
GAAAATCTAC TGGAAAATCA GACACAAGAT CTCCTCTTGG GAAATTTACG GGAAAATCAG
ACACGAGATC TCCGATTAGG AAATCTACGG GAAAATCAGA TATCGGAGTC GACGGCTTTT
AACCCTGAGA CCGTCACTTT CTCCCCCAGG TCAAATCCCA TTCAGAGTGA TAGGGGTCAA
ACTCAGTCAC TGACTAACTT AGAGCAAGTA ACAGAACCCA ACGAAATAAC TAATATAAAA
CCATGCAGTT CTGTGACAGG TTCTACCAAC GAACCGACCG TTAATCATGA CGTAACTTTC
TTTATGGACT TACTCACCAT CTGGGAAAAT GCCAAAATTC GACGATTTAA CACATTTGAG
CAGTTACTTG AATTGATTAA TCGCTTAGAA GTTCGATTAA ATCGTTGTTA CCATCAATTA
CAACAGCTTT GTCCCGATTT TATCGAGCGA TGGTACAATG CCATCCTTCG ACAAGAGCAA
TTAATTGGCT GA
 
Protein sequence
MNWKRFTRRD ACPVCNGDRH DCRQNLDTHL IHCRSLEANP LDYIYRGQDS LGFGMWAYKP 
DADAWSEERR EEWEKEKEAR KRERERQNQE ELKKLLPLKE RDLVIRSILG QLELSDCHRQ
RLKQRGLTDS QIDEANYRSV KQWQKLDYPV DNRLSGVNQW GNGLTNNTDG ILIPIPNEDG
LYTALRVNNL DTDINGLGKY LWVSSKNSRG IPIDLPNGEL PLAVYFPDKP CQTNQIGLCE
GVEYKPRIAA NRLGIPVIGF PGSNFSISPK TLEALIKKIW TRWICTNSST KSKGESNSLS
ITKTEILPLT PTNVDGNQNI YSHSGNQKSD GITLLGEPVS PFSDYGLSEK NYTEQCSIIL
IADAGVAINP QISTSHTSTL KMIEGWGYNV SLMDWGQLTN KEGLDIDEID HETLKNIKLI
SLDKFREKVR FESRKSYPNQ LIQLQEKLKN LTYKPDILLT QDDLIDGKYL PTELLLQLIP
KTGIINLKSC KSSGKSHFNK ELIQQKRQEG YKIISLVPRI VLGRGQAKEW GIQWDISVED
PLLKKVSRLT LYENQETLGI CFDSLWKLAD RDFSKTLIII DEAELGLPHL LTSSTCKDNR
PKLLNTFDKL LYNSLKYQGL VLLSDADLSD ISVNYVKALA PENTPIFTIV NEAQTVSYDA
SIFSQKKWIK QEILNAIDNN EKIHITTDSQ KEAEQMDREL SKQYPQKKVI RIDSKTTQED
WGRDFVERIN ESLKKERPDI LITTPSMATG TSIDGKINSF SSLEGTNLSQ EGTKLSQQET
QFNLTTIPNP ERQLDENQPS ENSPSSSLNE SIDLEVKHWF DKVFGIFLGV LTPSQCRQAL
MRYRQPVPRY IYIKSVGMLS GCRSFYPQEI NQSFHEYHDE GLAITDILQD IEASDPLEFA
LKIQAMVNPE TKQWINPHID HYCQFKARDN YGLANLREIF IEELTQEGHS VSIIDIEDRK
INSKTQEFIA GEKQAGVEID LEEATAISTA EDIPIEQAQA ISKKANPSTK ELHQAAKAFL
LEELPGVKLT PDFVLKAVVK DHRRWLNQVK LYWYLMNPNA AIYHDKKHYQ HKFKQFASHD
LVFLPDLRSY SPLLNEIETL GLFNVIDLNN PDQEYRSDDP KLQEFKRKCC YRKRRLSNLF
NITVSKDSHP IHLVNRFLGK FGLVLKGQQK RIEGKQVWSY KLDLDSLKDP DRLAVEKALD
AKWAKIQTKH GLQPVTESPQ LTYINTESSV TQDLQLENLL ENLLENQTQD LLLGNLRENQ
TRDLRLGNLR ENQISESTAF NPETVTFSPR SNPIQSDRGQ TQSLTNLEQV TEPNEITNIK
PCSSVTGSTN EPTVNHDVTF FMDLLTIWEN AKIRRFNTFE QLLELINRLE VRLNRCYHQL
QQLCPDFIER WYNAILRQEQ LIG