Gene Cyan8802_4288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4288 
Symbol 
ID8393640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4423189 
End bp4426482 
Gene Length3294 bp 
Protein Length1097 aa 
Translation table11 
GC content34% 
IMG OID644982198 
Producthypothetical protein 
Protein accessionYP_003139909 
Protein GI257062021 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.400054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTACAAT TAGGCAATTC ATCTCAACAG TTTCGCGGAA TTAACCGAAC TATTTGTATT 
GGATTAGGAG GAACTGGATT AGAAATTTTA ATGAGAATCC GGCGATCAAT TGTTAATAAG
TATGGAGACT TAAACAACCT TCCTATCGTT AGTTTTGTTT ATATTGATAC CGATAAAGCT
GGTTCACAAT CATCTATTTT ACGCACAGGA AATATTCATC ATGGGGTCGA TCTAAATTTA
CGGGAATCTG AGAAAGTAAA CGCTACTATG ACAGCCATTG AAGTTGAGAA CTTTAAGAAA
GGATTAGAAC GACGTTCTAG CTATGATAGA CAGGGTCCTT ATGACCATAT TAATCTCTGG
TTTCCTCGTC AATTATCCAA CAATATTAAA GCTATTGAAG ATGGTGCAAA AGCAATTAGA
CCAGTCGGAA GATTAGCCTT TTTTCACAAC TATCGAAAGA TTAAAACCTC TATAGATGCA
GCAGAAAGAA GAACTAGAGA TCATAGTTCA AAACTATTAA AATCAGGTTT AAGAGTCGAT
GATCAATTGA ACATTTTTGT CGTTGGGTCT TTGTCTGGTG GTACGGGAAG CGGAATGTTT
TTAGATATAG CTTATAATCT GAGACGTGAT TATAGCAAAC AAGGAGTTCA AATCGTTGGT
TATTTAGTTG TTGCTCCCAA TCTTTATACT TCTCCTGCCA ATCCTAATCC GACGATTTTT
GCGAATACCT ATGCTGCTCT TAAAGAGCTT AATTATTATA GCACACCTGG AACTCAATTT
AAAGCTTGCT ATGATATACA AAACATGGCA ATTGTAGACG AAGATCGTCC TCCTTTTGAC
TATACTTATT TAGTTTCTAA TGGCAATAAT ACGGGATATC AAATATTAGA TAAACGTAAG
CTTTGTAACG TTATTGCTCA TAAAATTGCC TTAGACTTTT CTGGAGAATT AGCTCCCGTA
GTTAAAGGAC AACGGGATAA TTTTGCACAG CATTTACTCC AATTAGATGA TTATCCCCGT
CCTAATGTTC AAAGATATCT ATCTTTTGGA CTAGCTGCCA TTTATTTTCC CCGCGATCTC
ATCGTTCGTA TTGCTTTAAC TAGGATTAGT TTAAATCTAG TTAATTTTTG GTTGTATGGG
TTTGGTCAAA GTCCCGATCC TAGGCAATTA CTCGAAGAAT TTTTATTACA AGAACGCTGG
CATGATGATA TAGTAGAAAG GGATGGATTT ACCAAACGAC TAGAATCAAC TCCCCAAGAA
GTTAATAAAA CTTTTAGTCA AACTCTTAGC AATTGGCGAA ATAAATTAGA AACCATAATT
GATGATTGCA AAAGTAAGGA TGATCGGATA TCTTTGCGTC AGCAATTACC GAGAGAATTT
CGAGAACAAT TTCGTAAAAC TCAACCAGGA GAAAACGAAA CAACAAGGGG TATTTGGTTG
ACTAAAATAC AACAATCTCG ACCAGGAATT ACTAAAGAAC TAAAAGAAAC AATTGATCGG
TTTATTGAAG CTTTATTGAC TCCGAACAAT AGTAATTTTT CTATTAAGAA TAGTCGAGAT
TGGTTAGATG CACTTATTAG CGAACTACAT GATTATCAAC ACGATCTCGA AGAAAGGATT
CAAAACTTTG GAGGGATGGA AAATATAGAA AGTATCGACA AAAAATGGCG AGATACAGAA
CAAATTATTG AAGATATTGA ACAGAAATTT CAGTTATTTG GTCAAAAAAA TTCTGATATT
CAAAATGAAG CTAAACGAAG TGTTCGACAA ATTAGCAATT TGGTTAAACA TAATTTTGAT
TTAGTTGTGA ACCAAGAAGC GTTACAAATT GTCAAAGATT TACAGCAATA TGTTCAAGAT
TGGTCGACCC AATTAGCGAG TTTTTATCGG TTAGTGGATA ACCTAAAAAG TGACTATCAA
CGACAAGAAA CTGAATGGAA ACAACTGGAT ATTGATGAAA TGAGTGGGGA AGCAATTTTT
GATGATCAAG ATATTGACAG TTGCTACAAT GAATTATTGC CAGAAAATGA CTATAAAGAT
CAGTTAATTG TTTTAAGTAG AGATATAACT ACAGCATCAG TGCGTTCAAC ATCTCTGATT
AATTTTGTTG AACGGACAAC TTTTGGTGCA GCTTATATTG AGAGCAAAAC CCAAAATAAA
ATTCAACAAG ATATCAATCT AACTGTCAAT AGTTCTTTTG GTTTTCGTAG TCTTAAAATT
GTCAAATCTG TTATTAAGCG ATTTCTCGAA CATTATACCT CTCTAGAACG ATCTATCCGT
TTAGAGAGTA TCCTTAAAGA AGCAGAACCT CTGATTAATC TTAACTTAAA CGATCCGTAT
TTTCGAGATA ATCCTGCTAA AAGAACTCAA ATCATTGGGT TTAAGGATAC TGATGAACAA
GAAGTTAAAG ACTTCAAAGA AATTTTGCTA CGCGACTTAA AAAATATCAC CGAGAATGTC
ACTAAACCAA CTCAAGCAGA AGATGAGATT TTAATGGTAA CAGAATATGC TTCTTTTCCT
CTAAGATTGA TTGAAGGATT AACAGAAATG CAAAACTATT ACCTACGGGA AAAAAATATT
AGTAATGTTT GTTTACATAA TGATCCCCCT GAACAGTTTC CCGATATTAT TCCTCCTGAT
GTGAAAACAT TGGAAAGATT AGAAGAAATC TTTTATCCCT GCCTTGCGTT TGAATTATTG
AAGGAAAACC CATCAACCCA AGAATTAGAA TTAGAACATT ATGATCAGAT TCGAGATACT
TATTATACCG TCTCTCTTAG TCCTATCTGG AATCTAGCGT TAGAAACTCT TCATCAACAT
CTTGACATAA TTAATGCGCT AGAAGAACTT CTAAAACAAG CAGAAAATGA GATAGAACGA
GAACCTGAAC GTTGGCAAAA TTACTATCTC CCTAAATTAC GAGAATTTGT AAAGAAAGTT
GATCAACTTT CGCCTGAACA TCCTAATTAT CCTTATAAGT CAACAGTTGT GGGAACTCAA
GGAAATCTTG AAACCTTAGA CAAAGAAGGA GTTATTATTC GCTTTCAACG TCGCATGAAA
GACAAGGTTA ATACTTTACA ATCTGATCAA AAAATACTCA ATACTCAAGA AAATGTTCAA
AAGGTATTAT CGAGTGATTC TGATATTATT GATATTGAAA CAGATCCTCC AAAAACTCAA
CAATTTTCGG ATGATTTTAT GGTAAAATTA CGAGAATTAG GGCAAATGCG TATAGATGGA
TTGCTTACCG AAGAAGAATT TCAAATAGCT AAGAAAAAAC TGTTAGGTAG TTAA
 
Protein sequence
MVQLGNSSQQ FRGINRTICI GLGGTGLEIL MRIRRSIVNK YGDLNNLPIV SFVYIDTDKA 
GSQSSILRTG NIHHGVDLNL RESEKVNATM TAIEVENFKK GLERRSSYDR QGPYDHINLW
FPRQLSNNIK AIEDGAKAIR PVGRLAFFHN YRKIKTSIDA AERRTRDHSS KLLKSGLRVD
DQLNIFVVGS LSGGTGSGMF LDIAYNLRRD YSKQGVQIVG YLVVAPNLYT SPANPNPTIF
ANTYAALKEL NYYSTPGTQF KACYDIQNMA IVDEDRPPFD YTYLVSNGNN TGYQILDKRK
LCNVIAHKIA LDFSGELAPV VKGQRDNFAQ HLLQLDDYPR PNVQRYLSFG LAAIYFPRDL
IVRIALTRIS LNLVNFWLYG FGQSPDPRQL LEEFLLQERW HDDIVERDGF TKRLESTPQE
VNKTFSQTLS NWRNKLETII DDCKSKDDRI SLRQQLPREF REQFRKTQPG ENETTRGIWL
TKIQQSRPGI TKELKETIDR FIEALLTPNN SNFSIKNSRD WLDALISELH DYQHDLEERI
QNFGGMENIE SIDKKWRDTE QIIEDIEQKF QLFGQKNSDI QNEAKRSVRQ ISNLVKHNFD
LVVNQEALQI VKDLQQYVQD WSTQLASFYR LVDNLKSDYQ RQETEWKQLD IDEMSGEAIF
DDQDIDSCYN ELLPENDYKD QLIVLSRDIT TASVRSTSLI NFVERTTFGA AYIESKTQNK
IQQDINLTVN SSFGFRSLKI VKSVIKRFLE HYTSLERSIR LESILKEAEP LINLNLNDPY
FRDNPAKRTQ IIGFKDTDEQ EVKDFKEILL RDLKNITENV TKPTQAEDEI LMVTEYASFP
LRLIEGLTEM QNYYLREKNI SNVCLHNDPP EQFPDIIPPD VKTLERLEEI FYPCLAFELL
KENPSTQELE LEHYDQIRDT YYTVSLSPIW NLALETLHQH LDIINALEEL LKQAENEIER
EPERWQNYYL PKLREFVKKV DQLSPEHPNY PYKSTVVGTQ GNLETLDKEG VIIRFQRRMK
DKVNTLQSDQ KILNTQENVQ KVLSSDSDII DIETDPPKTQ QFSDDFMVKL RELGQMRIDG
LLTEEEFQIA KKKLLGS