Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_4149 |
Symbol | |
ID | 8393500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 4277116 |
End bp | 4279116 |
Gene Length | 2001 bp |
Protein Length | 666 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644982064 |
Product | glycine oxidase ThiO |
Protein accession | YP_003139776 |
Protein GI | 257061888 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2022] Uncharacterized enzyme of thiazole biosynthesis |
TIGRFAM ID | [TIGR02352] glycine oxidase ThiO |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.572473 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCAA GTAACGACAT TCTCATCATC GGCGGCGGAA TTATCGGACT AGCCATTGCC GTTGACCTTA AATTACGCGG TGCATCTGTT ACTGTCCTTG ACCGCAACTT TCCCCATAGG GCAAGTCAAG CAGCAGCCGG AATGTTAGCC CCCTTCGCAG AAAATCTTCC CCCTGGTCCA ATGCTGGATC TTTGCTTGAA GTCCCGATGG CTATACCCGG AATGGGTTCG TAAACTGCAA GACCTCACAG GACTCGATTT AGGCTACAAT CCCTGTGGTA TCCTCGCCCC CGTTTATGAG TTACCCTCGG AACAATTTTG TCATAATACC GCTTCTCAAT GGCTAGATAA AACGGCTATT CGTCTGTATC AACCCGGGTT AGGGGATGAT GTGGTCGGAG GATGGTGGCA TCCCGAAGAT GGCCAAGTAG ACAACCGCCA AGTAATGGCA GCCTTACAGC AAGCAGCCCA ACAATTAGGT ATTCAGGTAA AGAACGGTGT CACAGTTCAG ACGATCCAAC AGCGTCAGGG AAAAATAGCC AGTATTTTAA CATCTGAAGG CGAATTTGAA GCGAAAACCT ATGTTTTAGC GAGTGGATCT TGGGCAAGTC AGATTTTACC CTTACCCGTC CGTCCGATCA AAGGGCAAAT GTTAGCCGTC ACTATGCCAC AGCAACCCGG AGAACCTTTC CCTCTGCAAC GGGTGTTATT TGGTCCGAGT ACCTATCTAG TCCCCCGACG CAATGGACGC TTAATTATTG GGGCAACCTC CGAAGACGTG GGATGGACTC CTCATAATAC TCCCCAAGGG ATCGCTACGT TAATCCAACA GGCAACTCGA CTCTATCGGG CGATCGCAGA CTGGCCGATT GAAGAAATTT GGTGGGGTTA TCGTCCAGGG ACACCGGATG AATTACCGAT TTTAGGGCAA AGTTCCTGTG AAAATTTGAT TTTAGCCACG GGACACTACC GTAACGGGAT TTTACTCGCT CCTGTGACCG CTAGTTTAAT CGCCGATTTA ATTATTAATC AAACATCCGA TCCGCTTTTA GATGCTTTTC GAGGCGATCG CTTTTATACC CAACCTAGTC CCACAACCGT AATTATGACC GCTTTTAATA GTATTCCGAC AAAATCCCAG AACGGAACCA ACGGATCACC CCCCTATCGA GAACTTACTC CGACTAACGC TGATGAATTA ATCATTGCAG GTCGTCGCTT TCGATCGCGC TTGATGACGG GAACTGGGAA ATATCCTACC ATTGCCAGTA TGCAGCAAAG TGTAGCCGTC AGTGGGTGTC AAATTGTGAC CGTAGCCGTT CGACGAGTTC AAACGAAAGC CCCCGGCCAT GAAGGGTTAG CCGAAGCCCT CGACTGGAGT AAAATTTGGA TGTTGCCCAA TACCGCCGGA TGTCAAACGG CCGAGGAAGC CATACGAGTC GCTAGATTAG GGCGGGAAAT GGCTAAATTA TTGGGTCAAG AGGACAATAA TTTCGTAAAA TTAGAAGTTA TCCCCGATTC TAAATATTTG TTACCTGACC CGATTGGCAC GCTACAAGCT GCGGAACAAT TGGTTAAGGA AGGGTTTGCC GTTTTGCCCT ACATCAACGC TGATCCTCTG TTGGCTAAGC GTTTGGAAGA GGTGGGGTGT GCGACGGTGA TGCCCTTGGG ATCTCCCATC GGATCGGGTC AAGGTATCCG AAATACCGCT AATATTGCCA TTATCATCGA AGAAGCGACG GTTCCGGTGG TGGTGGATGC GGGGATAGGA ACCCCCAGTG AAGCTGCCCA GGCGATGGAA TTGGGGGCGG ATGCAGTGTT AATTAATAGT GCGATCGCTT TGGCTAAAGA TCCTGTAATC ATGGCTAAGG CCATGGGAAT GGCAACAGAA GCGGGACGGT TAGCCTATCT CGCGGGACGG ATACCCGTTA AAGAATATGC TAGTGCCAGT TCTCCCTTAA CGGGCAATAT TAACAGTAAT CAGTTAGCCG CGATCGGTTA A
|
Protein sequence | MNASNDILII GGGIIGLAIA VDLKLRGASV TVLDRNFPHR ASQAAAGMLA PFAENLPPGP MLDLCLKSRW LYPEWVRKLQ DLTGLDLGYN PCGILAPVYE LPSEQFCHNT ASQWLDKTAI RLYQPGLGDD VVGGWWHPED GQVDNRQVMA ALQQAAQQLG IQVKNGVTVQ TIQQRQGKIA SILTSEGEFE AKTYVLASGS WASQILPLPV RPIKGQMLAV TMPQQPGEPF PLQRVLFGPS TYLVPRRNGR LIIGATSEDV GWTPHNTPQG IATLIQQATR LYRAIADWPI EEIWWGYRPG TPDELPILGQ SSCENLILAT GHYRNGILLA PVTASLIADL IINQTSDPLL DAFRGDRFYT QPSPTTVIMT AFNSIPTKSQ NGTNGSPPYR ELTPTNADEL IIAGRRFRSR LMTGTGKYPT IASMQQSVAV SGCQIVTVAV RRVQTKAPGH EGLAEALDWS KIWMLPNTAG CQTAEEAIRV ARLGREMAKL LGQEDNNFVK LEVIPDSKYL LPDPIGTLQA AEQLVKEGFA VLPYINADPL LAKRLEEVGC ATVMPLGSPI GSGQGIRNTA NIAIIIEEAT VPVVVDAGIG TPSEAAQAME LGADAVLINS AIALAKDPVI MAKAMGMATE AGRLAYLAGR IPVKEYASAS SPLTGNINSN QLAAIG
|
| |