Gene PCC8801_2202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2202 
Symbol 
ID7102450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2277057 
End bp2278343 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content45% 
IMG OID643475256 
Productdihydroorotase 
Protein accessionYP_002372386 
Protein GI218247015 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATACTT CTACTGTATT TCGTCAAGTC CGCGTATTAG ACCCTTTCGC TAATACAGAC 
ATGATCGCCG ATGTTTGGCT CAATGATGGT AAAATTCAAG CCATTGACCC GCATCTTAAC
GTTATTTCCC CAGAAACAAC TATTATTGAG GCACAAGGAT TAATTCTCGG AACAGGGTTA
GTTGACCTCT ACAGTTATAG TGGAGAACCG GGGTTTGAAG ACAGGGAAAC CTTAACTTCT
CTTGCTGCTG CGGCTATTGC TGGAGGGTTT ACCCGTGTCG CTATCTTACC TCAGACTAAA
CCCGGGGTTG ATAATGCCGC GACTTTCTCT TTTTTACAAC AAAAAGCCCA AACTTTGCCT
AATTCCCCCC ATCTTCATTT TTGGGGAAAC CTCACCCTAG GGGGACAAGG GAAACAAATG
ACAGAATTAG CCGAATTAGC GATGGCTGGC GTGGTGGGGT TTACGGACGG TCAAAGTATC
GAGAATTTGG GCTTATTAAG ACGAATTTTA GAATATTTAA AACCATTAGA AAAACCCGTG
GCCTTAGTTC CTGAGTCTTC CTCTCTCAAA GGGAATGGAG TCATGCGAGA GGGACTCCTT
TCCATTCACT ATGGACTCCC TGGAAACCCT GCGATCGCTG AATCTTCGGC GATCGCTACT
ATTTTAGAGA TAGTAGCCGA AATTAATACC CCCGTCCATC TCATGGGTAT TTCTACCCGT
CGCGGGGTGG AATTAATGGC CTCAGCAAAA GCGAGAGGTC TACCCATCAC AGCAAGTACC
TCTTGGATGC ACTTACTCCT AGATACTCAC GATATTTCCA ATTATGACCC TAGTTTACGC
TTAGAACCGC CTTTAGGCAA CCCCGAAGAT CGTCAAGCGT TAATTGAGGG AGTCAGAGAG
GGAATTATTG ATGCGATCGC CGTTAACCAT CGTTCCTTGA CCTATGAAGA GAAAACGGTC
GCTTTTGCTG AAGCTCCAAC GGGGGCGATC GGGTTAGAAT TAGCCTTACC CTTATTATGG
GATCAATTAG TCGTTCAGGG GGAATGGTCG CCCCTACAAT TATGGAAGGC TTTAAGTTGT
TACCCTTGTC AATGTTTAGG GTTAGAAGTC GCGGGTTTAC AAGTAGGACA ACCGGCCGAA
TTAATTTTAT TTGATCCCCA AAAAACTTGG CAAGTAGACG GGACAACTCT TCAGTGTTTA
GGAAGGAATA CCCCTTGGTA TCAACAGGAA ATTAAGGGAC GAGTGATCAC CTCATTCGTT
GGCAAAGAAA ATAATACCCT GACCTGA
 
Protein sequence
MNTSTVFRQV RVLDPFANTD MIADVWLNDG KIQAIDPHLN VISPETTIIE AQGLILGTGL 
VDLYSYSGEP GFEDRETLTS LAAAAIAGGF TRVAILPQTK PGVDNAATFS FLQQKAQTLP
NSPHLHFWGN LTLGGQGKQM TELAELAMAG VVGFTDGQSI ENLGLLRRIL EYLKPLEKPV
ALVPESSSLK GNGVMREGLL SIHYGLPGNP AIAESSAIAT ILEIVAEINT PVHLMGISTR
RGVELMASAK ARGLPITAST SWMHLLLDTH DISNYDPSLR LEPPLGNPED RQALIEGVRE
GIIDAIAVNH RSLTYEEKTV AFAEAPTGAI GLELALPLLW DQLVVQGEWS PLQLWKALSC
YPCQCLGLEV AGLQVGQPAE LILFDPQKTW QVDGTTLQCL GRNTPWYQQE IKGRVITSFV
GKENNTLT