Gene Cyan8802_2323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2323 
Symbol 
ID8391643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2339300 
End bp2340436 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content44% 
IMG OID644980292 
ProductGUN4 domain protein 
Protein accessionYP_003138034 
Protein GI257060146 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.963126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATC AAACATCAAC TCCGTTGCTT TTTATTTCTT ACCGTCGAGA CGATAGTGCT 
GATGTAACGG GGAGAATTTA TGATCGTTTA ATTCAATATT TTGGGAAAGA CACGATTTTT
AAAGATGTGG ACTCGATTCC CATCGGCGTT GATTTCCGTC AGTATATCGA TCAAGAAGTG
GGGCGATGTC AAATCTTATT AGCGATTATT GGTCAACAAT GGCTCAATAT TACTGATACC
ACGGGAAAAC GTCGCCTAGA CGATCCCCAA GATTTTGTTA GACTCGAAAT TGAATCCGCC
CTGAAGCGCA ATATTCCCGT GGTTCCCGTT CTGGTTAGGG GAGCAAAGGT TCCTACTGAA
CAAGAATTAC CCCCCAGTTT AAGGGAATTG GCTTACCGGA ATGGGAGTTT AGTGCGATCT
GATCCCGATT TTCACGGAGA TCTCGATCGC TTAATTCTGG GGATTGAGCG CCATCTTGAA
GAACATCAAG CCAAATCGCC TCAACCCTCC TTAAAGACTT CCTTTCCCTT CAAATTCAAG
TCCTGGTGGT TGCTAGGAGG ATTAGGGGGG GCGATCGCTC TTATCCTGGG TATTGGCTCG
CTTTTGTCCC AAGTTTCGAT CTTTGTTGAC ATTCAACCCC TTCAATACAA ACAACTGGAA
AAATTTTTAA ACGAGCAAAA TTGGCAAGCG GCTGATCGAG AAACGGCAAA AATCATGTTA
GCAGCAACGG GAAGAGAACA AGAAAAATGG ATCGATAAAA AGGGGATCAA TCAGATGTCT
TGCCAAGAGA TTCGCAAGAT CGACGATCTT TGGCTCAAAG CGAGTCAAGG AAAGTTTGGG
TTTAGTACAC AGCGAGAAAT CTGGAGAAAA GTCGCTAATA ACGATAAATT TGGCGATCTA
ATAGGCTGGC GACAGAATAA TCAATGGCTA ACGACCGATC AATTACAGTT TAATTTAAGT
GCACCGAAGG GGCATTTACC GTCGAGTTCC CGTGAAGGCA AATTATCAGG GGGATGGTTA
GTCTGGTATT TATTACCGAT GACGACGACG GGCAATCAAT CAGATTCTAA GGCGAGTCAG
TGTTGGCCAG AGGAAAAAGC AGTTAGTTTC TCCGATTCTG CTTCTGCATT TTCTTGA
 
Protein sequence
MKNQTSTPLL FISYRRDDSA DVTGRIYDRL IQYFGKDTIF KDVDSIPIGV DFRQYIDQEV 
GRCQILLAII GQQWLNITDT TGKRRLDDPQ DFVRLEIESA LKRNIPVVPV LVRGAKVPTE
QELPPSLREL AYRNGSLVRS DPDFHGDLDR LILGIERHLE EHQAKSPQPS LKTSFPFKFK
SWWLLGGLGG AIALILGIGS LLSQVSIFVD IQPLQYKQLE KFLNEQNWQA ADRETAKIML
AATGREQEKW IDKKGINQMS CQEIRKIDDL WLKASQGKFG FSTQREIWRK VANNDKFGDL
IGWRQNNQWL TTDQLQFNLS APKGHLPSSS REGKLSGGWL VWYLLPMTTT GNQSDSKASQ
CWPEEKAVSF SDSASAFS