Gene Cyan8802_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_0203 
Symbol 
ID8389507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp200373 
End bp201629 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content37% 
IMG OID644978249 
ProductFolC bifunctional protein 
Protein accessionYP_003136007 
Protein GI257058119 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0053743 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCTACCA ATTCCCTCCT AGAACCGTTT GAACGTTTTG GTGTTAACCT TGGACTCGAT 
AGAATTAAAC ACCTTTTAGA AAACTTTGAT AACCCCCATG ATCAAGTTCC TATTATTCAC
GTTGGGGGTA CTAATGGTAA GGGTTCAGTT TGTGCTTATT TGTCTTCAAT TTTAACCGAA
GCAGGGTATC AAGTAGGCCG TTATACTTCT CCCTATTTAA TCGATCAAAC TGAATCAATT
TGTATTAATA ATAAGCCTAT TTCTGAAGCT GATTTTACAA CTATTTTCAA TCAAATTAAA
ACGATTATTG AGTGTCAGAA AATAAGTCTC ACTAAATTTG AAGTATTAAC CGCGATCGCC
TGGATTTATT TTGCTCAACA AAAGGTTGAT ATTGCTATTC TCGAAGTTGG GTTAGGAGGA
AGATTAGACG CGACAAATAT TTGTGATCAT CCCTTGGTAA CTGTTATTAC TTCTATTAGT
CGAGATCACT GTCAAGAATT GGGATCAAAA TTAACTGATA TTGCCTACGA AAAAGCAGGT
ATTTTTAAAC TTGGCTCACC CGCTATTATC GCTCAAATTC CTCTAGAAGC TCAACAGGTT
ATGGAGGCTC GCTTACAAGC ATTAAACTGT CCCATAACCT GGGTAAAATC CGCGATAAAA
AGTCAAGAAA ATTGGGCAAT ATATGATCAT ATTCATTATC CTTTACCACT CTTGGGTGAG
ATCCAATTAA GCAATTCTGC TTTAGCCATT GAAACCATTA AACTTTTACA ACAAAAAGGC
TGGAATATTC CCTTAACTGC TATTCAAAAA GGAATGGAAA AAACCCAATG GTTAGGCAGA
CTACAATGGA TACAATGGCA AGATAAAACT ATTTTAATTG ATGGAGCCCA TAATCAAGCC
TCCGCGCAAG CTTTACGTCA GTATATAGAG AGTTTAAATC AACCCGTCAC TTGGATCATT
GGACTCCTAT CAACGAAAGA ACATGAGGAA ATTTTCCAAG CTTTATTCCG TCCTGATGAT
ACCGTATATT TAGTTCCCGT TCCCGATGAA AAAACCGCTA ACCCAGAAAA GTTATCTGAG
TTAGCCATTC AACTATGTCC TGAGTTGAAA AATAGCCAAG CTTTTTCCAG TTTATGGATA
GCCTTAGAAA CAGCAGTTCA ACACACTGAT CACTTAATTG TTTTATCCGG TTCTCTCTAT
TTAGTCGGAT ACTTTTTAAG GAAAAATAAC GCTATTTTTT CTGAGGATTG TCCTTGA
 
Protein sequence
MSTNSLLEPF ERFGVNLGLD RIKHLLENFD NPHDQVPIIH VGGTNGKGSV CAYLSSILTE 
AGYQVGRYTS PYLIDQTESI CINNKPISEA DFTTIFNQIK TIIECQKISL TKFEVLTAIA
WIYFAQQKVD IAILEVGLGG RLDATNICDH PLVTVITSIS RDHCQELGSK LTDIAYEKAG
IFKLGSPAII AQIPLEAQQV MEARLQALNC PITWVKSAIK SQENWAIYDH IHYPLPLLGE
IQLSNSALAI ETIKLLQQKG WNIPLTAIQK GMEKTQWLGR LQWIQWQDKT ILIDGAHNQA
SAQALRQYIE SLNQPVTWII GLLSTKEHEE IFQALFRPDD TVYLVPVPDE KTANPEKLSE
LAIQLCPELK NSQAFSSLWI ALETAVQHTD HLIVLSGSLY LVGYFLRKNN AIFSEDCP