Gene PCC8801_4558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4558 
Symbol 
ID7106058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011727 
Strand
Start bp737 
End bp4258 
Gene Length3522 bp 
Protein Length1173 aa 
Translation table11 
GC content46% 
IMG OID643477428 
Producthypothetical protein 
Protein accessionYP_002374526 
Protein GI218249156 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCAATC CCACCTCAAT CCGTACTGAG TTCATCACCG AGTCGAAGAT CGCCCCTAAC 
CTCTTCGAGA AAGCGGTGAC GTTCCTCCCC GACCTGGAAA TCAACCACGT CACCCACGAA
GTCGAGGGAA CGCCCCTTTA CGATGCCCTG GGCTGGCCAT ACACCCGCTT TGGTCATCAA
GCCAAACCGA ATCTATTGGG AGCAGCCTTT ATTCAAGAAA CGGGGGAACC CTGGCAATGC
AAAATCTATG GAGAGTTAAA TAAACCCCGT CAGGACAAGG AGACAAGGGG ACAAGGGGGA
CAAGGAGACA AGGAGGACAA GAAGGACTTT TGTGGAGAAC AATCTCCCGT GTCTCCCGTG
TCTCCCGTGT CTCCCCCATC CGAAAAAAGA ACAGGACAAT ACTACGCCCC CAAAGGAATC
GGAGACGTTC CTTACCTCCC CCCCGTCCCC AGAGAATTTA TTGAGAAGCT CGCCCTAGAA
TACGACTTAC CCCTACCCCC CGAAAATACC TCCTTCTGGC AGTGGTTTCA GGAAAATAAG
GTTATCCCCC TACTCCTAAC CGAAGGGGGC AAGAAGAGCC TTTCAGCGTT GTCATCAGAC
ATCGTTGCCA TTGCCCTCTA TGGCTGTCTA TGTGGCGTTG ATTCAGAAAC CGGAGCCGTT
AAAGAATCTT TACGTCCGTA CATTGAAGGG CGGCGGGTCT ACATTGCCTT TGACCAAGAC
AAAAAACCCA AAACACGCCA AACCGTGGCC AAAGCCACCC AGAAACTCGC TTTAGCCATC
ACTAAAGCCT CGGGCGAGCC TTACCAAGTC CTCTGGAATA CCGAAGACGG CAAAGGCGTT
GACGATGTTA TTAAAAATCA AGGCGGAGAT TACTTCAAAC ACCGGATTAA ATTAGCCCAA
CTCATTGACC CCATTCAACA ACAATTAAAC ACCCTTACCC GTGACATTGA CTTAACCGTT
AATCAAGCCA ATTTAGAAGG AATCCTCAAA CACCTTCCCC GCACCGGAAA ACTCCACATC
AGTTCCCCTA AATGTACCCG CAAATCCTCA GCCATCATTG AGCCATTAGT CAAAGAATGG
AAACAGAAAA AACAATTAGT CATCTCCATC GTTCCCCGCA TCCTCTTAGG CAAAGAACAA
GCCGTCAGGT GGGACATTAA CTGGATAGAC GAGTATGGCA AGGCTCATTT TGAATCTTAC
GATACCATTG GGCTCTGTTT TGATTCCCTT GGGAAACTCT CCTCTAAAAA TTGGTCAGGA
GCTTTAATTC TCTTTGATGA GATTCGCCAA GGATTCAAGC ATTTTATCTC CTCTCCCACC
CTAGAAGAGA GACGCTCTTT TCTCCTTAAG CTTCTCCAAG AAAAGTTACC CCAAGCCGTT
AACGGGGGCG GGTTAATTGT GGGCTGTGAC TCCGATTTAA CCGATGTAGA AATCAACTAC
ATTGACGAAA TTTGCCCCGT CGGAAAGACC TTTATTGTTA AAAACGAGTA TCAACCCAAA
AAAGGATTAG TCATCTTTAA TACAGGTAAA TATGATGAAA CCCTGGATGA GATTCTCCAC
CGCTATGAAA ATGGCGAGAA CTTATTTATC TTCTGTGACA CCAAAGCGAA CAGTCAAGCC
ATTCATGACC GCTTAAAACA ACTAGATCCT CACGCCACCC ATTGGTTATT AAATGGCGAT
ACAACCTCAG AGGCAGAAAA CAAAGCCATC ATCGAGAATA ATATTAATGA CTCCCTCAAA
CAACAAAAGC CAAGAAGTTT AGTCTTTACC ACTTCCATGA GTACAGGCAT CAGTATCGAT
GGCTGGATAA ACCATCAATT TCATGCTGAG GTCTTTGACC ACTTTACCTA TGGCTTTGTC
ATTGCCCAAG GGGGAATCTT AGAACCCGTC GAAATCACTC AGAGTATGGC ACGGGTCAGA
AAGAATCTAG ATTTTACCGT TTATTCAGGC ACAGGGAAAA AGCCAGATGA CCTCTTAAAC
TCCTGTCATC CTGATGTCAT TAAACGACAA ATTTATAAGC GAAATCATCG AGCCTTTGAC
CTGCAACTCA TCACCGCCGA AATCTTAGAA GAAAAACTCG GAAGGGAACC CACCCACTAT
GAAATTCTGG CAGAAATGAT GAGCAAATGT GACCCCGAAA CCGGGATGGT CATTGACCCC
CATCTCGACT TATATTGTCG AGCTAAAGCT CGGTTAAACT ATGCTTCACA AAACTTTGAC
CTGATGCTCT ATCAACAATT AATTGATGAG GGGTATCAAT TAGCTCGATA CGATTGCCTA
GAAACCACTT CGACGGGGAA TCAAATCAGA GAAATCAAAG AAGCGCACAA ATGGGAGGAA
GCTGAGGCCA CGGCAGACTC ATTAGATATT TGTATAGAAG AGGCGATAGA GATTAGTTAT
TCAAATGCGC CGGTCGAACA ACGTCGTCAG GCCGCCAAGG CCTTCTTAAA ACAGGAACTC
CCCGGAGTGG AACTCACCCC CGACTTTATT TATAAGGCCG TTATTAAAGA CAAACGACGC
TGGTTGAGGG GACATAAACT CTTTTGGTAT TGTCAACATC CTGACATAAC CAGGGAGATA
GATCTGGGGC ATTATCTTAA CAAAATTAAA CAATTTGCCA ATGGGGTCAT CTTTTTGCCA
GATCTTCGTA ACGTGAGTGT TATGGTAGAT GAAATTAATG AATTGGGACT ATGGGAGTTA
ATTGACCTAG AGAACCCCAG AGAATTATCC AAAGATGACC CCAGGGTGCT CTCGTTTATG
GAGAGGGCTT ATTTTAGACG CTATAAACTC TATAACGCCC TGGGATTAAC CGTGACTGAA
AAAACCGACT CTATTAAATT CATTGAACGG TTGCTGCAAC GGTTAGGGTT AGGATTAGTC
TTGACCCGAA CTGAAAAACA GGGTGAGAAG AAAATCCGAT ATTATTCTCT GAATACAGAA
GCATTGAATG ACCCCGATAG GGTAGCGGTA TTGGAGGCAC TTACCCGACG ATTTCTCGGA
GCAGTGCCAA CTAACTATCA ACTAGCTGAA AAGCTCCCCC CATCTGTCAT AGAGTCGGGG
ACAGGACTCT CTCAAGAGTA TATACAAAAA AGGAGTCCTG TCCCCCCAGA CTTAGTACAC
AAGGAGTCAC TAGGGACAGG ACACGCTAAA GAGTGTATAC AAAAGAGAAG TTCTGTCCCC
CCAGACTTGG TACACAAGGA GTCGCTAGGG ACAAGAGACG CTAAAGAGTG TATACAAAAG
AGGAGTCCTG TCCCCTGTGC GAAGGGTCAA ACAAAGGGCT CTCTGAGCCA AAGGCCAGGG
GCACAAACGT TTAAGCTACC CATGAATCGA GGGTTTGCTC ACTGGTGTTC CTTTAAGTAC
GGGGAGGTCA TTTATGATTC TCGTCAAATA GTCCCTAGTC CCTTGTGGGT AATTGGTTCC
TCGGAGAGTG AGGGGTGTGG GCTGATCGTA GCGTCAGAAG ATGGCATAGA GATCCTTCCC
TATCGTTATG GGGTGCGGTG GCCATCTCAA CAAGTAAGTT GA
 
Protein sequence
MTNPTSIRTE FITESKIAPN LFEKAVTFLP DLEINHVTHE VEGTPLYDAL GWPYTRFGHQ 
AKPNLLGAAF IQETGEPWQC KIYGELNKPR QDKETRGQGG QGDKEDKKDF CGEQSPVSPV
SPVSPPSEKR TGQYYAPKGI GDVPYLPPVP REFIEKLALE YDLPLPPENT SFWQWFQENK
VIPLLLTEGG KKSLSALSSD IVAIALYGCL CGVDSETGAV KESLRPYIEG RRVYIAFDQD
KKPKTRQTVA KATQKLALAI TKASGEPYQV LWNTEDGKGV DDVIKNQGGD YFKHRIKLAQ
LIDPIQQQLN TLTRDIDLTV NQANLEGILK HLPRTGKLHI SSPKCTRKSS AIIEPLVKEW
KQKKQLVISI VPRILLGKEQ AVRWDINWID EYGKAHFESY DTIGLCFDSL GKLSSKNWSG
ALILFDEIRQ GFKHFISSPT LEERRSFLLK LLQEKLPQAV NGGGLIVGCD SDLTDVEINY
IDEICPVGKT FIVKNEYQPK KGLVIFNTGK YDETLDEILH RYENGENLFI FCDTKANSQA
IHDRLKQLDP HATHWLLNGD TTSEAENKAI IENNINDSLK QQKPRSLVFT TSMSTGISID
GWINHQFHAE VFDHFTYGFV IAQGGILEPV EITQSMARVR KNLDFTVYSG TGKKPDDLLN
SCHPDVIKRQ IYKRNHRAFD LQLITAEILE EKLGREPTHY EILAEMMSKC DPETGMVIDP
HLDLYCRAKA RLNYASQNFD LMLYQQLIDE GYQLARYDCL ETTSTGNQIR EIKEAHKWEE
AEATADSLDI CIEEAIEISY SNAPVEQRRQ AAKAFLKQEL PGVELTPDFI YKAVIKDKRR
WLRGHKLFWY CQHPDITREI DLGHYLNKIK QFANGVIFLP DLRNVSVMVD EINELGLWEL
IDLENPRELS KDDPRVLSFM ERAYFRRYKL YNALGLTVTE KTDSIKFIER LLQRLGLGLV
LTRTEKQGEK KIRYYSLNTE ALNDPDRVAV LEALTRRFLG AVPTNYQLAE KLPPSVIESG
TGLSQEYIQK RSPVPPDLVH KESLGTGHAK ECIQKRSSVP PDLVHKESLG TRDAKECIQK
RSPVPCAKGQ TKGSLSQRPG AQTFKLPMNR GFAHWCSFKY GEVIYDSRQI VPSPLWVIGS
SESEGCGLIV ASEDGIEILP YRYGVRWPSQ QVS