Gene Cyan8802_2531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2531 
Symbol 
ID8391856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2559768 
End bp2561486 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content42% 
IMG OID644980494 
Productprotein of unknown function DUF1400 
Protein accessionYP_003138231 
Protein GI257060343 
COG category[R] General function prediction only 
COG ID[COG4188] Predicted dienelactone hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.732033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.144228 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCCT ATCGACTCTT TTCTCAAGGC TTAATGCGTC AATTGATACA AGGATTAACC 
TTAGCAACCC TTTCAGCCTC TTTAACTACG CTTCCGGTTG CCGCTTACCA AAGATTACAT
TTTATCTATC CGCCTATCAA TCGATCGCTA GGGATTGATT CCTTGACTTT GTTTGCTGAA
GAAGGAATTG TTAATCGAGA ATTAGAAGAT TATTTGAATT TAGCCGGGGT TAATGACCAG
CAAAAAGCAG AATTTCGAGA AGCTTTACGC AAAAAAGCCC CTGTTGATCC CATCCAACTG
TCCCGCTTTC TTAACTCTTC CCTTGGAGAA TCTATCTTAG AACGCTTAGG AGTCCTCATT
TCCATTCGCG GGGGACGCAA TGGGAAATAT GCCATCAGAG GGGCGATGGT TAAAGCTGCT
TTAGATCCCC AAGAAGGACT GACGGTACTC AACGTCTTCC GAAACCTAGC AGTGGATATG
CAGTTTAATT TAGATGATAT CTTTATCACT GCTGACTACA TCGATCTTTT AGGACGGGGA
ACCGATGGGG TAGTGGAGGA AATGAAACGC CTAGCAGCAA TGGAAGCCAA TAACGGAACC
CCTGTTAATT TTTCCACATT GCCAGATCTG CGCCAACCGG GAAGCTACGG GGTTGCTCCT
GAGAGAATCT GGCAACTTAA AGATGAAAGC CGCGATCGCA ATTTTGATGC CCTCGTCATT
CAGCCGCAAC GCTGGCGTGA GGGAAAAAAT CCCGTAGTTA TTATATCCCA CGGGTTAGCC
TCTCGGCCGG AAGATTTTGC TGATCGTGCT AAACAACTCG CTTCCTACGG TTATTTAGTC
GTCTTACCCC GACATATTGG CAGCGATACC CGACAACTGC AAGCTATGTT AGATGGCTTT
TCGCGGGAAG TTTATAAAGT GAGTGAATTT ATTGATCGTC CCCTCGATAT TAGCTACGTT
ATTGATGAAT TAGAAAGACG CAATAATCCA GAGTTTCAAG GACGTTTAGA TCTGCAAAAT
GTGGGCGTTA TGGGGCATTC TTTTGGTGGT TACACCGCTT TAGCTGTAGC GGGTGCGTCC
TTAGATTTTG CCACTCTAGA AAACCAATGC AGTCGTAGAA TTTGGGGTCC GAATCTTTCT
TTATTGCTTC AATGTCAAGC TTTAGAATTG CCTCGAAAAG AGTATAATTT TCGGGATGAA
CGAGTGACTT CTATCTTGAT TATTAATCCG GTTACGAGTG CTATTTTCGG ACAAAAAGGG
CTTAATCAAG TGAAAATTCC GGTCATGATT GGGGCAGGAA GTAGTGATCC AGCAACGCCA
GCAGCCGTGG AACAACTTAA AGCTTTTGTT TGGATTAATA CTGATGATAA ATATTTACTG
TTAGTAGAAG GACAAGCTCA TGTTAACTTT TCTAAGTTAG ATGCGAGTAC CAAAGCGTTA
ATTGATTCCT TACCGAATTT ACAAGTTCCT AAACAAGAGA TTATTGATAG TTATGGCAAT
GCTTTGTTAC CAGCTTTTTC TGAGGTTTAT GTTGCTAAAA ACGAAGCTTT TCGTCCTTTT
TTGACCTCGG CTTACGGAAA ATATATTAGT GAACAACCTA ATGCTTTGCA TCTTGTCCAA
GCTGAAGCGG ATGTTCCTTT AAGTGAGTTA TTTAATCGCT TAAAACCTGA ACATTTTCCT
GCTATTTATT CCCCTAGAAT CAGTAACAGT AATCCGTAA
 
Protein sequence
MNPYRLFSQG LMRQLIQGLT LATLSASLTT LPVAAYQRLH FIYPPINRSL GIDSLTLFAE 
EGIVNRELED YLNLAGVNDQ QKAEFREALR KKAPVDPIQL SRFLNSSLGE SILERLGVLI
SIRGGRNGKY AIRGAMVKAA LDPQEGLTVL NVFRNLAVDM QFNLDDIFIT ADYIDLLGRG
TDGVVEEMKR LAAMEANNGT PVNFSTLPDL RQPGSYGVAP ERIWQLKDES RDRNFDALVI
QPQRWREGKN PVVIISHGLA SRPEDFADRA KQLASYGYLV VLPRHIGSDT RQLQAMLDGF
SREVYKVSEF IDRPLDISYV IDELERRNNP EFQGRLDLQN VGVMGHSFGG YTALAVAGAS
LDFATLENQC SRRIWGPNLS LLLQCQALEL PRKEYNFRDE RVTSILIINP VTSAIFGQKG
LNQVKIPVMI GAGSSDPATP AAVEQLKAFV WINTDDKYLL LVEGQAHVNF SKLDASTKAL
IDSLPNLQVP KQEIIDSYGN ALLPAFSEVY VAKNEAFRPF LTSAYGKYIS EQPNALHLVQ
AEADVPLSEL FNRLKPEHFP AIYSPRISNS NP