Gene Cyan8802_2429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2429 
Symbol 
ID8391750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2444689 
End bp2447739 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content38% 
IMG OID644980393 
ProductWD-40 repeat protein 
Protein accessionYP_003138134 
Protein GI257060246 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATA ACTCTTCTAA AAGCAGTGTT TTTATTTCCT ATTCTCGTCG AGATAAAACC 
TTTGTTCAAA AACTCCATCA CGCTTTAAAT GAAGGAAAAC GCGATATCTG GGTAGACTGG
GAAGATATTG CCCCGACGGA AGATTGGCGA GAAGCGATCG CTCAAGGGAT TCAATCTGCT
GACAATTTTT TATTTATTAT TAGTCCTGAT TCGGTTAAGT CTATAGAATG TAATAAGGAA
ATTGATCATG CTATTAAATG CAATAAGCGA TTAGTCCCGA TTTTATATCG AGAGGTTGAG
GATAATAGTG TTCGACCAGA ACTAGCCAAA CATAACTATA TTTTTTTTCA AAATGAAGAG
AGTTTTTTTG AAAATTTAGA GAAACTAGAG CAAGCTTTAG ATACTGATTT AGCTTATATA
AAAGAACATA CTTATCTGTT GAGTCGAGCT ATTTTATGGC AACAAAAACA GCGAGATCCT
AGTTATTTAT TGCAAGGAAC CGCGCTCAAA GAAGCTCAAG AATGGATTAC CCATAGTCTA
AATCAAACGC CTAGACCGAC CCAACTACAC AATGATTATA TTATTACCAG TATTCAAAAA
AGTAAACAGT TTTTACGCAG AATTGCTATT GTCGTTGGGG CATTAGGATT AATTGCTTTT
GCTTCTTTTT TAGTGGCTTT ATCAGAACGA AATCAGGCTA AAGAAGCAGA ATTAAAAGCT
AAGAGTGAAG AAGTTAAAGC CTTAACAGGA TGGGCTCAAG CAAGGCTTTT AAGACATGAA
CAATTAGATG CGTTAATTAA TATTATTAGA GCGGTTGATA AACTGAAGGA TTTACCGCAA
TCTTCTTCTG AAGAAACCTT TTCTGATTTT GCTCAAATAC CAATTCATGA TCAGGTTGAA
CAAACCCTAA GACAAGTGGT TTATACCTTG CAAGAATTAA ATCATTTACA AGCTGATCAA
AAAACCATTT ATGATGTACA ATTTTCGCCA GATCATCGAT GGATTGCTTC TGCTAGTGCA
GATACTAAAG TCAATCTCTG GAAAAATAAT CAACGACAAA CTAGCTTGCT TCATCAAGGC
GTTGTGTGGC GGATTGGGTT TTCTCCTGAT AGTCAAATGA TGGTTTCTGC TAGTGAAGAT
AAAACCGTTA AATTGTGGCA ACTTAATCCT CAAGGCAATT GGACTCTTAA GCAAACTTTA
ATCCATCCTG TTCCTGTGAG ATCTGTTACC TTTACTTTTA CGGATCAATG TTCACAAACT
GGTCAAAAGA TTGCTTCTGC GGGAACTGAT GGTCTGATCC GAATATGGAA TTTAGAAGGA
AAATTACAAA GAACCTTTCA AGCACATTCA GGAACAATTA ATGATCTCAA AATTTCTCCA
AATTGTCAAA CCCTTGCTAG TGCTAGTGAA GATAGAACGG CTAAATTATG GACGTTGGAT
GGACAAAAAA AAGCGACTCT TCTTGGACAT GAAAATCAAG TTTGGACTAT CAATTTTTCT
CCCGATGGTC AACGAATTGT GACGGGGAGT TTTGATACGA CGATTAAGCT GTGGGATCAA
ACGGGACAAC TACTAAAAAC CCTCGAAGGA CACGCTAATT GGGTCATGAG TGTTATTTTC
TCCCGTAATA GTCAAGAAAT TGTTTCCGGT GGTGAGGATG CTATGCTGAA ATTCTGGAGT
CGAGAGGGAG ACTTATTTGC CTCTTTATTA AGTCCCCATG GGGATATCGG AAGTATTAAT
ATTTCGGCGG ACAATCAATA TTTAGTCTTT ACTGGAGATA GTGGCAAAAT GAGTCTATGG
CAGCAGGGAG GAAGTGTCAT TGAAATTCTA CGCGGCCATA CTTCTGGTGT CACGGGGGTT
CATTTTTCGC CTGATGGACA ATTAATGGCT TCAGTGAGTA ATGATCAAAC GGTAAAATTG
TGGCAATTTG ATCCCCAAGC AAAGCGCATG GAATTGCAGC AAACGCTGGA ATATCGCAAA
GGAGAACCCG AAGGAGGACT GAAAAATGTT AATTTCACTC CCGATGGTCA ATATTTGATT
ACGACAAGTT ATGACAACAC TTTACAATCT TGGAATGTTA AAAAAGCTTT AACCCATTCT
TCTATTCAAG GAGAAATTAT TGCCAAAAAT AATACGGTTG TTAATCGTTT TAGGATTTCA
TCTGATGGTA AACGATTGGC GTTGGCAAGT GCAGATGGGA CGATTAAACT GTGGGATCTC
AAGTCTCAAA AATTGTTAAA AATTTTGACA ACTAATCAAT CTCCATCCTT GACGAATAAT
GGGATCAATC AATGTCAGAA AATTCAGCAA GGATATCCCC CTCAGTCTAC AGATGTGGCT
TTTTCAAAGA ATAATCAATA TTTGGTTGCT TCTTATTCTG ATGGTTGCCT CAAACTTTGG
AATCTTGAGG GTCAATTGAT TCAAGAATTT CGGGGTCATC CACAATGGAT TAATGCGTTA
AGATTTAGTC CTGATGGCCA GTTATTAGCG ACCACGAGTC GAGATAATAC GATTAAACTT
TGGCAGTGGA GAAAAACCCA ATTTAAGATC GATCAACCGA CTAAAATTTT GAAAGGTCAT
CAAGACTGGG TTTGGAATGT GGCTTTTACG TCTGATGGAA AGAAATTAGC GTCAGGGGGA
AAAGATAACA CGGTTAAACT TTGGAATATT ACTACTCAAT CACAATCGGA TCAATCGGAT
CTTATTGTTA CACTTCAAAG TCACATTGAT TGGGTAACAT CCGTTGATTT TAGTCCCTGT
AATCAGGATA ATAAAGATTA TCCTAATTGT CATCAAAGGC TTCAATTAGC CTCAGCAAGT
GCCGATCAAA CAATTATTTT TTGGAAGATG GAAGAGGTAT TACGGATTGA AACAAAAGAT
AATCATGAAA CAGCCTTACA ATCGTTGTTT AAAAAAGGGT GTCAATGGCT TTCTGTGTAC
CTGGAAACCA ATCCAGATAC ACCAGAAGCG AGTGATATTC GTTCCGCTTG TGGAGAGACT
AAACCTCCGT CGGATCAACC GGGAAATAAA ATTCTATCCC CTGATCAATG A
 
Protein sequence
MTNNSSKSSV FISYSRRDKT FVQKLHHALN EGKRDIWVDW EDIAPTEDWR EAIAQGIQSA 
DNFLFIISPD SVKSIECNKE IDHAIKCNKR LVPILYREVE DNSVRPELAK HNYIFFQNEE
SFFENLEKLE QALDTDLAYI KEHTYLLSRA ILWQQKQRDP SYLLQGTALK EAQEWITHSL
NQTPRPTQLH NDYIITSIQK SKQFLRRIAI VVGALGLIAF ASFLVALSER NQAKEAELKA
KSEEVKALTG WAQARLLRHE QLDALINIIR AVDKLKDLPQ SSSEETFSDF AQIPIHDQVE
QTLRQVVYTL QELNHLQADQ KTIYDVQFSP DHRWIASASA DTKVNLWKNN QRQTSLLHQG
VVWRIGFSPD SQMMVSASED KTVKLWQLNP QGNWTLKQTL IHPVPVRSVT FTFTDQCSQT
GQKIASAGTD GLIRIWNLEG KLQRTFQAHS GTINDLKISP NCQTLASASE DRTAKLWTLD
GQKKATLLGH ENQVWTINFS PDGQRIVTGS FDTTIKLWDQ TGQLLKTLEG HANWVMSVIF
SRNSQEIVSG GEDAMLKFWS REGDLFASLL SPHGDIGSIN ISADNQYLVF TGDSGKMSLW
QQGGSVIEIL RGHTSGVTGV HFSPDGQLMA SVSNDQTVKL WQFDPQAKRM ELQQTLEYRK
GEPEGGLKNV NFTPDGQYLI TTSYDNTLQS WNVKKALTHS SIQGEIIAKN NTVVNRFRIS
SDGKRLALAS ADGTIKLWDL KSQKLLKILT TNQSPSLTNN GINQCQKIQQ GYPPQSTDVA
FSKNNQYLVA SYSDGCLKLW NLEGQLIQEF RGHPQWINAL RFSPDGQLLA TTSRDNTIKL
WQWRKTQFKI DQPTKILKGH QDWVWNVAFT SDGKKLASGG KDNTVKLWNI TTQSQSDQSD
LIVTLQSHID WVTSVDFSPC NQDNKDYPNC HQRLQLASAS ADQTIIFWKM EEVLRIETKD
NHETALQSLF KKGCQWLSVY LETNPDTPEA SDIRSACGET KPPSDQPGNK ILSPDQ