Gene Cyan8802_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2004 
Symbol 
ID8391320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2022106 
End bp2023839 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content48% 
IMG OID644979985 
Productprotein of unknown function DUF1555 
Protein accessionYP_003137730 
Protein GI257059842 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID[TIGR02595] PEP-CTERM putative exosortase interaction domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.160905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACTTC CTAATAATAA CATTTCCAAG GGCTTCACAG CTACCCTGAT TGCGCTGGGC 
ATGAGTGGGG CATTAACCTT AATTCCCAAA CCTGTTAGCG CACAACACAT AACCACTGCT
GGAGCTCAAC TAACTGGACT CAATGGATTC AACTACGAAC CCATTTTCAC CGTTGGCGAG
ACGATTAATG GCTATCAACC CCCTGGTATC CTCGATGGAA TTGGGGCAGT TGAAGGCAGT
AGTATTGGTC TAGGGAACAA TATTGTTCGA GTCTTGGTTA ACCATGAAAT AGCCCATACT
CCTGGGTCGG ATGGAACCCC CCAAGGATCT TCCTATACCC TAGAAAATGG AACTGTCATT
CAAGGGGGAG CGCGGATCAG CTATTTTGAT ATTGACAAAA ATAGCCGTCA AATTGTCGAT
TCAGGGCTTG CTTTTGACAC CATTTTTAAC CGAGCGTTAG AAGTGGTCGA TGATCCCTTG
ACCGACTTTG ACCTAGGACG AACTGCATTA AGTCGTTTTT GCTCCAGTGC AGCCTTTGTG
GCGGATGCCT ATGGTGCTGG TCTAGGCTTT GAAGATACTA TCTATATTAC GGGAGAAGAA
ACCACCGACG GAAGCATATA CGCGCTAGAT ATCGCTAACG GGGATCTGTA TGCGGTTCCA
TCCCTAGGAC GGGGTGGTTG GGAAAATGTG ACCCAAATCA ATACCGGACT AACTGATACG
GTTGCCCTAT TATTGGCCGA TGATACATCG GATTCTCCGA TGTATCTCTA TGTAGGGACT
AAGGATGCGG GGGGTAATTT CCTCGAACGC AATGGATTAG CCGATGGAAC CATCTATGCT
TGGGTTCCTG ATTCGGGTAA TAATACTCCT GAAACCTTTA ATGGTACAGG CAGTTCTGAA
ACAGGGACTT GGGTTGCTCT GACTAATGAA GGAACCGGAG CCGGATTTAG TGGAGGGTAC
GCCTTGGCAC AAACTCTTCG GGATGAAGCC TTTGCCGAAG GAGCGTTCCA CTTCTCTCGT
CCTGAAGATG TGGCGACCAA TCCCGACGAT GATCGAGAAG TTGTCTTAGC CTCAACGGGT
GACGGAGATC TGTTCAATGG GTCTGATAAC TGGGGAATGA CCTATATTTT CGACTTGAAT
GGCTTACAGT TTGATGCAAG TGGATTAGAT TTGGCTAATT CTACTACTAC CCTCAGCATT
CTCTATGACG GCAACGATAC GTCTGCTTGT AGTGCCCAAT TTCCAGGGGG GTCGGATTTC
GGTTTACGGA GTCCTGATAA CTTGGATTGG GGTCAAGACG GTTACATTTA TGTTCAAGAA
GACCGCGCAA CGACTCCAGG TAGTCTTTTT GGGGGAACTT CCGGCGAAGA AGCCTCGATT
TGGCAACTCA ACCCCAATGA AAATTGTGAC TTAACCCGTG TTGCTCAGGT TGATCGCGGA
GCTACTCTAT TGCCGGGACA GAGTGACATC TCTCCCACCG ACCTTGGAAA CTGGGAAACC
TCTGGTATCT TAGATGTGAC CGCTTTTTTC CCCACTAAAC CCGGAGAAAA GCTCTTTATC
TTGGATGTAC AGGCTCACAG TGTCCGAGGA GGGGATATCA ACTCTAATAA CTTGGTTCAA
GGGGGACAAC TCGGCTTCCT GTCTCAGCAA GTCCCTGAGC CCAGTTCTCT TTTGGGGTTA
GGGTTATTCG GTTTGTCGGC TTTCGGGTTA AAGCGTAAGC GCGACCAGCA ATAA
 
Protein sequence
MILPNNNISK GFTATLIALG MSGALTLIPK PVSAQHITTA GAQLTGLNGF NYEPIFTVGE 
TINGYQPPGI LDGIGAVEGS SIGLGNNIVR VLVNHEIAHT PGSDGTPQGS SYTLENGTVI
QGGARISYFD IDKNSRQIVD SGLAFDTIFN RALEVVDDPL TDFDLGRTAL SRFCSSAAFV
ADAYGAGLGF EDTIYITGEE TTDGSIYALD IANGDLYAVP SLGRGGWENV TQINTGLTDT
VALLLADDTS DSPMYLYVGT KDAGGNFLER NGLADGTIYA WVPDSGNNTP ETFNGTGSSE
TGTWVALTNE GTGAGFSGGY ALAQTLRDEA FAEGAFHFSR PEDVATNPDD DREVVLASTG
DGDLFNGSDN WGMTYIFDLN GLQFDASGLD LANSTTTLSI LYDGNDTSAC SAQFPGGSDF
GLRSPDNLDW GQDGYIYVQE DRATTPGSLF GGTSGEEASI WQLNPNENCD LTRVAQVDRG
ATLLPGQSDI SPTDLGNWET SGILDVTAFF PTKPGEKLFI LDVQAHSVRG GDINSNNLVQ
GGQLGFLSQQ VPEPSSLLGL GLFGLSAFGL KRKRDQQ