Gene PCC8801_2137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2137 
Symbol 
ID7103400 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2210839 
End bp2213096 
Gene Length2258 bp 
Protein Length752 aa 
Translation table11 
GC content47% 
IMG OID643475194 
Productglycoside hydrolase starch-binding 
Protein accessionYP_002372325 
Protein GI218246954 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2755] Lysophospholipase L1 and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCGAT TTCAGATAAC GGCACACACG CAGGTAGGAG AATCCATCGC TATTGTGGGG 
AACATCCCTG AGTTTGGCGA ATGGGATGTC ACCAAATGCT TAGAACTACG CACCAGTGGC
GATCGCTATC CTCTGTGGTG GGTAGAGACG GATATCGACC TCAGTCCATT TTTAGATCCC
GCCAACGACC AAAGAATCGA GTATAAATAT GTGCGATTAT ACCCCGATGA AGGCGTGGAA
TGGGAAACCC AGGGGTCAAA CCGTTGGCTA CCCCTTGACC CTCAGCCGGG GTCTTTTACC
ATCACCGTAG AAGATGGCCA ATTTGGTCGC GTACAACCGT GGCCCTTTGG TTACTGGGAT
GCACCGAGGA CACCTCTACC CAAAGCTAAA GACGGACTAA AAATTGTGGT CATCGGCAGT
TCCGTCGCTG AAGGATACAA CGCTTGGCTC TTTAAAGGGT GGGTTTGGCG GTTAGAACAA
GCCTTAAATG CAAAATACGG ACATCAGGTG GTCAATGTTT CCCAGTTAGG AACGAACATT
ACGACCACTA TGGAACGGTT TTCTCGCGTT GTTCCTCCCG AAAAGCCCGA TATTGTCATT
ATTTCCCTCT CTCTGGGCAA TGAAGGACTG GCCTATTGTC CTCCCCACGA ACGACCAGCC
GTTCAGCGAC GCTTTGAAAC CGGATTACAG GAACTGGTCA AAATGACCCA AGACTTGGGA
GCCATGCCCA TGTTAGGAGC AGTCTATCCC CACGGAGACT ATACCCCTGA CCATAACTGG
TTCCTACAGG ATACTCACCA GCGAATGCGA AGCTGGGGGA TTCCCCTTCT GAATTGGTTA
GCCGCCTTAA ATAACGGCCA AGGTCGCTGG AAACCGGGAA TCTCCTTTGA ACCCCCTCAC
CCGAATACGG AAGGACACCG TCTGATGTAT GAAGCGATCG ATCTGAGTCT CTTCAATGTT
ACTCAGGCCG AATTAGCGCA AAAAAAAAAG GACTCCAGCC AGCAAAAGAC GAAATAATCC
TTTATTCCGA CGAAAAGGGT TTTCAAATCG TTAGTGAGAG ACATCAAAGA AGTTTACGAG
TCATCAATAC CTCAGAACAT CCCTACACCC TTACTCCTTC GTGGACGGAA CTGCAACAGC
CCCTACAAAC AACAGGAGTC TTAAAACCAG GGATCTATCT CTCTAAAACC GTCGCTCAAT
CAATCCCACA GTCTTTTTGG GTTCGAGACG ACGGAAGCAT TGAAACAACC CTTAATATCT
TGCCTTCTGT CGATCTGGAA TATTCCCCTG CTTTCGAGTT CTTTTCGCCT AAAATTTCCG
AAATTTTATT TTACGACGGG CATTTAGGGA TTTTAAAACA AGGCGATTTT CTCGTCCGAG
TCATCAACGA ATCTGACCAC GAATACAGCA TCCAACCCAT GTGGAAAGAG GTGTGTCATG
CCTTTAAACA GATGCCGAGT GGGGTCTACG TCGATGTTGT TGAACCCGAT ACCCCTTTTC
GTACCATGAT GATCGGTCAA GATGGACTAG AAAGTCGCGT TAAAGTCCCT CCCAAGTCGG
CGGTATGCTT TGAATATCAA TGCAAGTTAT CGGATATCAG CCGTGTGGCG ATTCTGCCAT
TAGGCGATCG CTGTGCTATT CGCATGGTGT TGCACAAAAT GGAATACGAT GGACCCGCCT
ATCCCTTTGA CCTAACCCGG ACGACGAATC TCAGCGATGT AGCTGATATT ATTGAAAGTG
GGTTTTGGGA TATGTGGAAC CCCGCTTTTC TCGACTACAA CGATGAAGCT GGCCGAATTT
ACCATACTAA ATGGACGGGT TTATCTTTTG CCCACGAAGT CGAAGAGACA GACGACCCAA
TTAACGATAT GTCCCCAGTC TATGAACGTA TGCGGACTCG TTATGAGGCG CGTTCGGCTC
GTTTTTGGTA CACCATTAAT CATTGCGATG AAGTCCTGTT TATTCGGACG GGTTTTGCAA
CGCGCAGCCA GGTCATCGAT TTAGCCGATA AACTTGCAGA AAAATGTCAG GGAAAACCCT
TCCGCATTAT GATTATTTCG GCTCAGTCTA GCGACGAGTT TGCCGGACTT CCTAATGTTT
TGCATTACAG TATGTATTTT AATCCCGATC AAATGTACGA AGATTTAGGC TACTGGATGC
ACTGTACTAA TGTCATGCGC TCTATCCTTG ACTCGGTGGG AATATCGAGT AAAAATCTCT
TTTGGTGTCC CCCTAAAATC CCCAAAAGTT CTATTTAG
 
Protein sequence
MYRFQITAHT QVGESIAIVG NIPEFGEWDV TKCLELRTSG DRYPLWWVET DIDLSPFLDP 
ANDQRIEYKY VRLYPDEGVE WETQGSNRWL PLDPQPGSFT ITVEDGQFGR VQPWPFGYWD
APRTPLPKAK DGLKIVVIGS SVAEGYNAWL FKGWVWRLEQ ALNAKYGHQV VNVSQLGTNI
TTTMERFSRV VPPEKPDIVI ISLSLGNEGL AYCPPHERPA VQRRFETGLQ ELVKMTQDLG
AMPMLGAVYP HGDYTPDHNW FLQDTHQRMR SWGIPLLNWL AALNNGQGRW KPGISFEPPH
PNTEGHRLMY EAIDLSLFNV TQAELAQKKK GLQPAKDEII LYSDEKGFQI VSERHQRSLR
VINTSEHPYT LTPSWTELQQ PLQTTGVLKP GIYLSKTVAQ SIPQSFWVRD DGSIETTLNI
LPSVDLEYSP AFEFFSPKIS EILFYDGHLG ILKQGDFLVR VINESDHEYS IQPMWKEVCH
AFKQMPSGVY VDVVEPDTPF RTMMIGQDGL ESRVKVPPKS AVCFEYQCKL SDISRVAILP
LGDRCAIRMV LHKMEYDGPA YPFDLTRTTN LSDVADIIES GFWDMWNPAF LDYNDEAGRI
YHTKWTGLSF AHEVEETDDP INDMSPVYER MRTRYEARSA RFWYTINHCD EVLFIRTGFA
TRSQVIDLAD KLAEKCQGKP FRIMIISAQS SDEFAGLPNV LHYSMYFNPD QMYEDLGYWM
HCTNVMRSIL DSVGISSKNL FWCPPKIPKS SI