Gene Cyan8802_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4149 
Symbol 
ID8393500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4277116 
End bp4279116 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content49% 
IMG OID644982064 
Productglycine oxidase ThiO 
Protein accessionYP_003139776 
Protein GI257061888 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2022] Uncharacterized enzyme of thiazole biosynthesis 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.572473 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAA GTAACGACAT TCTCATCATC GGCGGCGGAA TTATCGGACT AGCCATTGCC 
GTTGACCTTA AATTACGCGG TGCATCTGTT ACTGTCCTTG ACCGCAACTT TCCCCATAGG
GCAAGTCAAG CAGCAGCCGG AATGTTAGCC CCCTTCGCAG AAAATCTTCC CCCTGGTCCA
ATGCTGGATC TTTGCTTGAA GTCCCGATGG CTATACCCGG AATGGGTTCG TAAACTGCAA
GACCTCACAG GACTCGATTT AGGCTACAAT CCCTGTGGTA TCCTCGCCCC CGTTTATGAG
TTACCCTCGG AACAATTTTG TCATAATACC GCTTCTCAAT GGCTAGATAA AACGGCTATT
CGTCTGTATC AACCCGGGTT AGGGGATGAT GTGGTCGGAG GATGGTGGCA TCCCGAAGAT
GGCCAAGTAG ACAACCGCCA AGTAATGGCA GCCTTACAGC AAGCAGCCCA ACAATTAGGT
ATTCAGGTAA AGAACGGTGT CACAGTTCAG ACGATCCAAC AGCGTCAGGG AAAAATAGCC
AGTATTTTAA CATCTGAAGG CGAATTTGAA GCGAAAACCT ATGTTTTAGC GAGTGGATCT
TGGGCAAGTC AGATTTTACC CTTACCCGTC CGTCCGATCA AAGGGCAAAT GTTAGCCGTC
ACTATGCCAC AGCAACCCGG AGAACCTTTC CCTCTGCAAC GGGTGTTATT TGGTCCGAGT
ACCTATCTAG TCCCCCGACG CAATGGACGC TTAATTATTG GGGCAACCTC CGAAGACGTG
GGATGGACTC CTCATAATAC TCCCCAAGGG ATCGCTACGT TAATCCAACA GGCAACTCGA
CTCTATCGGG CGATCGCAGA CTGGCCGATT GAAGAAATTT GGTGGGGTTA TCGTCCAGGG
ACACCGGATG AATTACCGAT TTTAGGGCAA AGTTCCTGTG AAAATTTGAT TTTAGCCACG
GGACACTACC GTAACGGGAT TTTACTCGCT CCTGTGACCG CTAGTTTAAT CGCCGATTTA
ATTATTAATC AAACATCCGA TCCGCTTTTA GATGCTTTTC GAGGCGATCG CTTTTATACC
CAACCTAGTC CCACAACCGT AATTATGACC GCTTTTAATA GTATTCCGAC AAAATCCCAG
AACGGAACCA ACGGATCACC CCCCTATCGA GAACTTACTC CGACTAACGC TGATGAATTA
ATCATTGCAG GTCGTCGCTT TCGATCGCGC TTGATGACGG GAACTGGGAA ATATCCTACC
ATTGCCAGTA TGCAGCAAAG TGTAGCCGTC AGTGGGTGTC AAATTGTGAC CGTAGCCGTT
CGACGAGTTC AAACGAAAGC CCCCGGCCAT GAAGGGTTAG CCGAAGCCCT CGACTGGAGT
AAAATTTGGA TGTTGCCCAA TACCGCCGGA TGTCAAACGG CCGAGGAAGC CATACGAGTC
GCTAGATTAG GGCGGGAAAT GGCTAAATTA TTGGGTCAAG AGGACAATAA TTTCGTAAAA
TTAGAAGTTA TCCCCGATTC TAAATATTTG TTACCTGACC CGATTGGCAC GCTACAAGCT
GCGGAACAAT TGGTTAAGGA AGGGTTTGCC GTTTTGCCCT ACATCAACGC TGATCCTCTG
TTGGCTAAGC GTTTGGAAGA GGTGGGGTGT GCGACGGTGA TGCCCTTGGG ATCTCCCATC
GGATCGGGTC AAGGTATCCG AAATACCGCT AATATTGCCA TTATCATCGA AGAAGCGACG
GTTCCGGTGG TGGTGGATGC GGGGATAGGA ACCCCCAGTG AAGCTGCCCA GGCGATGGAA
TTGGGGGCGG ATGCAGTGTT AATTAATAGT GCGATCGCTT TGGCTAAAGA TCCTGTAATC
ATGGCTAAGG CCATGGGAAT GGCAACAGAA GCGGGACGGT TAGCCTATCT CGCGGGACGG
ATACCCGTTA AAGAATATGC TAGTGCCAGT TCTCCCTTAA CGGGCAATAT TAACAGTAAT
CAGTTAGCCG CGATCGGTTA A
 
Protein sequence
MNASNDILII GGGIIGLAIA VDLKLRGASV TVLDRNFPHR ASQAAAGMLA PFAENLPPGP 
MLDLCLKSRW LYPEWVRKLQ DLTGLDLGYN PCGILAPVYE LPSEQFCHNT ASQWLDKTAI
RLYQPGLGDD VVGGWWHPED GQVDNRQVMA ALQQAAQQLG IQVKNGVTVQ TIQQRQGKIA
SILTSEGEFE AKTYVLASGS WASQILPLPV RPIKGQMLAV TMPQQPGEPF PLQRVLFGPS
TYLVPRRNGR LIIGATSEDV GWTPHNTPQG IATLIQQATR LYRAIADWPI EEIWWGYRPG
TPDELPILGQ SSCENLILAT GHYRNGILLA PVTASLIADL IINQTSDPLL DAFRGDRFYT
QPSPTTVIMT AFNSIPTKSQ NGTNGSPPYR ELTPTNADEL IIAGRRFRSR LMTGTGKYPT
IASMQQSVAV SGCQIVTVAV RRVQTKAPGH EGLAEALDWS KIWMLPNTAG CQTAEEAIRV
ARLGREMAKL LGQEDNNFVK LEVIPDSKYL LPDPIGTLQA AEQLVKEGFA VLPYINADPL
LAKRLEEVGC ATVMPLGSPI GSGQGIRNTA NIAIIIEEAT VPVVVDAGIG TPSEAAQAME
LGADAVLINS AIALAKDPVI MAKAMGMATE AGRLAYLAGR IPVKEYASAS SPLTGNINSN
QLAAIG