Gene PCC8801_4419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4419 
Symbol 
ID7104864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4647155 
End bp4648468 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content43% 
IMG OID643477398 
Productdihydroorotase 
Protein accessionYP_002374497 
Protein GI218249126 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAAC TCCTCATTCG TCACGGTCAG ATTCTTTTGC CAGATGGCCA GTTGCTTCTA 
GGAGATGTTC TATGTGAAAA TGGAACTATC CGAGAAATCG CTCCAGAAAT TTCTGTAAAA
GATCTTAACA CTATCATAGA CGCTAGGGGA TTAACTTTGT TGCCTGGAGT CATTGATCCC
CAGGTACATT TCCGCGAACC GGGATTAGAA CACAAGGAAG ACTTATTTAC CGCTACCCGC
GCTTGTGCCA GAGGGGGGGT AACATCCTTC TTGGAAATGC CCAATACTAA CCCATTAACG
ATTACCCAAG CTACGTTAGA AGATAAATTA CAACGGGCTG CCCAAAAGTG TCTCGTTAAT
TATGGCTTTT TTATTGGGGC AACTCCCGAC AATTTACCGG ATTTATTGAC TGCTAACCCT
ACCTGTGGCA TTAAAATCTT TATGGGGTCG TCCCATGGGG CTTTATTGGT GAGTCGGGAA
GGGGAGTTAG AACCCATTTT TGCCAAAGGA AGTCGTTTAA TTGCAGTTCA TGCCGAAGAT
CAAGCGAGAA TACTGGAACG TCGTCGGGAA TTTGCCGGAA TTAGCGATCC AGCAGTGCAT
TCCCAGATTC AGGATGAAGA AGCTGCCCTC AACGCGACGA AATTAGCCTT AAAACTGTCG
AATAAGTATC AAAGGCGGTT ACACATTCTA CACCTTTCGA CGGGGATAGA AGCGGAATTT
TTGCGAGAAA ATAAGCCCAG TTGGGTAACA GCAGAAGTCA CGCCTCAACA TTTGTTATTA
AATACCGATG CTTATGAGAA AATTGGCACG TTAGCCCAGA TGAATCCTCC CTTGCGATCG
CCTGAAAATA ATGATATTCT TTGGCAAGCT TTGCTTGATG GGGTGATTGA TTTTATTGCG
ACAGATCACG CGCCCCATAC TTTGGAAGAA AAGGCAAAAC CCTATCCTAA TTCGCCTTCG
GGAATGCCAG GGGTAGAGAC TTCTTTACCC TTAATGTTAA CCCAAGCAAT CAAGGGAAAA
TGTAGTGTTG CCCAAGTGGT TAATTGGATG TCTACCGCAG TGGCTAAAGC CTATAAAATC
CCGAATAAGG GATTAATTGA ACCTGGATAT GATGCTGATT TAGTCTTAGT TGATTTAGAT
AATTATTATC CCGTTAAACG AGAAGACTTA CAAACTAAAT GCGGTTGGAG TCCTTTCGAG
GGTTGGGAAT TAACAGGATG GCCGATAGTA ACTATTGTCG GTGGAAAAGT CGTTTATGAT
CGGGGTCAAT TCAATACAGA TATTAGGGGC AAAGCATTAA CTTTTAGTAG TTAA
 
Protein sequence
MTQLLIRHGQ ILLPDGQLLL GDVLCENGTI REIAPEISVK DLNTIIDARG LTLLPGVIDP 
QVHFREPGLE HKEDLFTATR ACARGGVTSF LEMPNTNPLT ITQATLEDKL QRAAQKCLVN
YGFFIGATPD NLPDLLTANP TCGIKIFMGS SHGALLVSRE GELEPIFAKG SRLIAVHAED
QARILERRRE FAGISDPAVH SQIQDEEAAL NATKLALKLS NKYQRRLHIL HLSTGIEAEF
LRENKPSWVT AEVTPQHLLL NTDAYEKIGT LAQMNPPLRS PENNDILWQA LLDGVIDFIA
TDHAPHTLEE KAKPYPNSPS GMPGVETSLP LMLTQAIKGK CSVAQVVNWM STAVAKAYKI
PNKGLIEPGY DADLVLVDLD NYYPVKREDL QTKCGWSPFE GWELTGWPIV TIVGGKVVYD
RGQFNTDIRG KALTFSS