Gene PCC8801_4344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4344 
Symbol 
ID7105141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4557814 
End bp4558866 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content37% 
IMG OID643477323 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_002374422 
Protein GI218249051 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGTAA CTTTTGACCT TTTTACTGTT CATAAACGCT TTGCTTTAAC AATTAGTCGA 
GGAACCTATA CAGAAAATAC CAACATTGGG TTAAAAATTG AATCCGAAGG GGTTGAAGGA
TTGGGAGAAG CCACTCCTTT TTCTGTCGTT AATGGCAAAG AAAAAAACAG CGAACAATTA
CAACAAGAAT TAGAGAAACT CATCCCAATT ATAGAACCTT TTCATCCGCT CGAAAATCAA
GAAATTGAAG GAATTTTAAA ACAAAATAAT GTTAGCTCAT CTCTACGCGC TGCCATTGAT
ACTGCGCTGT ATGATTGGTT AGGAAAAAAA GTCAATTTAC CGCTATGGAA AATTTGGGGA
TTAAACCGCA ATCGCATTCC TCCGACTTCC GTTACCATTG GTATTAGTTC CCCTGAAAAA
GCCGTCGAAA GAGTCAGAAA TTGGCTCAAT TTTATGGACG CTAAGGTCTT AAAAATTAAG
TTAGGTTCTC CCGATGGGAT AGAAGCAGAT AAAGCCATGT TATTAGCTAT TCGTCAAGAA
TGTCCTAATC TCCCTTTAAC CGTTGATGCG AATGGGGGAT GGAATTTAAA AGAGGCGATT
CTAATGAGTG AGTGGTTAGC AACGCAAAAT GTTAAATATA TTGAACAACC TTTTGCCGTT
GGAGAAGAGA GCAATTTACC CCAATTGTAT CAGCGATCGC CGCTTCCTAT TTTTATTGAT
GAAAGTTGTT TTAATAGTGA AGATATTATT ACCTATTCTC CCTCTATTCA TGGCATTAAT
ATTAAACTAA TGAAAGCCGG AGGGTTAAGC GAAGTCATGA GGATGATTGC GATAGCTAAA
GCGTGTAAAC TACAGATTAT GTATGGGTGT TATTCTGATA GTAGTTTAGC CAATACTGCG
CTGTGTCATC TTGCTCCTTA TGCGGATTAT TTAGACTTAG ATAGTCATTT AAACTTAATT
GATGATCCCT TTCAAGGGGT TAGTTTAGAA AGAGGAAGAT TAGTCCTTAA TAATTTACCA
GGATTAGGGG TAAAATATAG AAATGAAACC TAA
 
Protein sequence
MRVTFDLFTV HKRFALTISR GTYTENTNIG LKIESEGVEG LGEATPFSVV NGKEKNSEQL 
QQELEKLIPI IEPFHPLENQ EIEGILKQNN VSSSLRAAID TALYDWLGKK VNLPLWKIWG
LNRNRIPPTS VTIGISSPEK AVERVRNWLN FMDAKVLKIK LGSPDGIEAD KAMLLAIRQE
CPNLPLTVDA NGGWNLKEAI LMSEWLATQN VKYIEQPFAV GEESNLPQLY QRSPLPIFID
ESCFNSEDII TYSPSIHGIN IKLMKAGGLS EVMRMIAIAK ACKLQIMYGC YSDSSLANTA
LCHLAPYADY LDLDSHLNLI DDPFQGVSLE RGRLVLNNLP GLGVKYRNET