Gene PCC8801_4109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4109 
Symbol 
ID7101899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4306670 
End bp4308670 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content49% 
IMG OID643477098 
Productglycine oxidase ThiO 
Protein accessionYP_002374197 
Protein GI218248826 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating)
[COG2022] Uncharacterized enzyme of thiazole biosynthesis 
TIGRFAM ID[TIGR02352] glycine oxidase ThiO 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAA GTAACGACAT TCTCATCATC GGCGGCGGAA TTATCGGACT AGCCATTGCC 
GTTGACCTTA AATTACGAGG TGCATCTGTC ACTGTCCTTG ACCGCAATTT TCCCCATAGG
GCAAGTCAAG CAGCAGCCGG AATGTTAGCC CCCTTCGCAG AAAATCTTCC CCCTGGTCCG
ATGCTAGATC TCTGCTTGAA GTCCCGATGG CTATACCCGG AATGGGTTCG TAAACTGCAA
GACCTCACAG GACTCGATTT AGGCTACAAT CCCTGTGGTA TCCTCGCCCC CGTTTATGAG
TTACCCTCGG AACAATTTTG TCATAATACC GCTTCTCAAT GGCTAGATAA AACGGCTATT
CGTCTGTATC AACCCGGGTT AGGGGATGAT GTGGTCGGAG GATGGTGGCA TCCCGAAGAT
GGCCAAGTAG ACAACCGCCA AGTAATGGCA GCCTTACAGC AAGCAGCCCA ACAATTAGGT
ATTCAGGTAA AGAACGGTGT CACAGTTCAG ACGATCCAAC AGCGTCAGGG AAAAATAGCC
AGTATTTTAA CATCTGAAGG CGAATTTGAA GCGAAAACCT ATGTTTTAGC GAGTGGATCT
TGGGCAAGTC AGATTTTACC CTTACCCGTC CGTCCGATCA AAGGGCAAAT GTTAGCCGTC
ACTATGCCAC AGCAACCCGG AGAACCTTTC CCTCTGCAAC GGGTGTTATT TGGTCCGAGT
ACCTATCTGG TCCCCCGACG CAATGGACGC TTAATTATTG GGGCAACCTC CGAAGACGTG
GGATGGACTC CTCATAATAC TCCCCAAGGG ATCGTTACGT TAATCCAACA GGCAACTCGA
CTCTATCCGG CGATCGCAGA CTGGCCGATT GAAGAAATTT GGTGGGGTTA TCGTCCAGGG
ACACCAGATG AATTACCAAT TTTAGGGCAA AGTTCCTGTG AAAATTTGAT TTTAGCCACG
GGACACTACC GTAACGGGAT TTTACTCGCT CCTGTGACCG CTAGTTTAAT CGCCGATTTA
ATTATTAATC AAACATCCGA TCCGCTTTTA GATGCTTTTC GAGGCGATCG CTTCTATACC
CAACCTAGTC CCACAACCGT AATTATGACC GCTTTTAATA GTATTCCGAC AAAATCCCAG
AACGGAACCA ACGGATCACC CCCCTATCGA GAACTTACTC CGACTAACGC TGATGAATTA
ATCATTGCAG GTCGTCGCTT TCGATCGCGC TTGATGACGG GAACTGGGAA ATATCCTACC
ATTGCCAGTA TGCAGCAAAG TGTAGCCGTC AGTGGGTGTC AAATTGTGAC CGTAGCCGTT
CGACGAGTTC AAACGAAAGC CCCCGGCCAT GAAGGGTTAG CCGAAGCCCT CGACTGGAGT
AAAATTTGGA TGTTGCCCAA TACCGCCGGA TGTCAAACGG CCGAGGAAGC CATACGAGTC
GCTAGATTAG GGCGGGAAAT GGCTAAATTA TTGGGTCAAG AGGACAATAA TTTCGTAAAA
TTAGAAGTTA TCCCCGATTC TAAATATTTG TTACCTGACC CGATTGGCAC GCTACAAGCT
GCGGAACAAT TGGTTAAGGA AGGGTTTGCC GTTTTGCCCT ACATCAACGC TGATCCTCTG
TTGGCTAAGC GTTTGGAAGA GGTGGGGTGT GCGACGGTGA TGCCCTTGGG ATCTCCCATC
GGATCGGGTC AAGGTATCCG AAATACCGCT AATATTGCCA TTATCATCGA AGAAGCGACG
GTTCCGGTGG TGGTGGATGC GGGGATAGGA ACCCCCAGTG AAGCTGCCCA GGCGATGGAA
TTGGGGGCGG ATGCGGTGTT AATTAATAGT GCGATCGCTT TGGCTAAAGA TCCTGTAATC
ATGGCTAAGG CCATGGGAAT GGCAACAGAA GCGGGACGGT TAGCCTATCT CGCGGGACGG
ATACCCGTTA AAGAATATGC TAGTGCCAGT TCTCCCTTAA CGGGCAATAT TAACAGTAAT
CAGTTAGCCG CGATCGGTTA A
 
Protein sequence
MNASNDILII GGGIIGLAIA VDLKLRGASV TVLDRNFPHR ASQAAAGMLA PFAENLPPGP 
MLDLCLKSRW LYPEWVRKLQ DLTGLDLGYN PCGILAPVYE LPSEQFCHNT ASQWLDKTAI
RLYQPGLGDD VVGGWWHPED GQVDNRQVMA ALQQAAQQLG IQVKNGVTVQ TIQQRQGKIA
SILTSEGEFE AKTYVLASGS WASQILPLPV RPIKGQMLAV TMPQQPGEPF PLQRVLFGPS
TYLVPRRNGR LIIGATSEDV GWTPHNTPQG IVTLIQQATR LYPAIADWPI EEIWWGYRPG
TPDELPILGQ SSCENLILAT GHYRNGILLA PVTASLIADL IINQTSDPLL DAFRGDRFYT
QPSPTTVIMT AFNSIPTKSQ NGTNGSPPYR ELTPTNADEL IIAGRRFRSR LMTGTGKYPT
IASMQQSVAV SGCQIVTVAV RRVQTKAPGH EGLAEALDWS KIWMLPNTAG CQTAEEAIRV
ARLGREMAKL LGQEDNNFVK LEVIPDSKYL LPDPIGTLQA AEQLVKEGFA VLPYINADPL
LAKRLEEVGC ATVMPLGSPI GSGQGIRNTA NIAIIIEEAT VPVVVDAGIG TPSEAAQAME
LGADAVLINS AIALAKDPVI MAKAMGMATE AGRLAYLAGR IPVKEYASAS SPLTGNINSN
QLAAIG