Gene Phep_1166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1166 
Symbol 
ID8252264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1380374 
End bp1381660 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content42% 
IMG OID644934821 
ProductFolC bifunctional protein 
Protein accessionYP_003091446 
Protein GI255531074 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTATA AGCAAACCAT AGACTATTTA TACAGCAGGC TCCCCATGTT TACACGTGTG 
GGAGCTGTTG CTTTCAAAAA AGACCTGCAC AACACCATTT TGATGTGCGC ACAACTCGAT
AATCCGCAAA ACAAGTTTAA AACCATACAC GTTGGGGGCA CAAACGGAAA GGGATCCACA
TCGCATACAC TGGCTGCTAT TTTTCAGCAG GCCGGCTATA AAACAGGTTT ATACACCTCG
CCTCACCTGA AGGATTTCAG GGAAAGGATC AGGATCAATG GTACAATGGT ACCGGAAACC
TTTGTAACCG ATTTTGTGAA CCGGCAAAAA GAAACTATTG AAGCCATCAG CCCCTCTTTT
TTTGAAGTAA CCGTAGCTAT GGCCTTTGCT TACTTTGCCG AAGAAAAAGT AGACATAGCC
ATCATTGAAG TTGGCCTGGG TGGCAGGCTC GACTCTACCA ATATCATTAC ACCAGAATTG
TCTGTAATTA CCAATATCAG CTTAGACCAT ACCAATATGC TGGGCAATAC CCTGACAGAA
ATTGCTACAG AAAAAGCCGG AATAATCAAA CCCGGTATAC CAGTGGTCAT CGGCGAAAAA
CAAACTGAAA GTGAAGCGGT ATTCCTTACA AAAGCAAGCA CAACCAATAG TAAAATTGTT
TTTGCCGATC AGGAATTGAG TACCACAAAC ACCTTCCGTG AAAAAGAATA CCTGGTTACT
TCCATTCTGA AAAATAAAAT ACTTATTTAT AAGGACCTGC AATTAGACCT TAACGGCATT
TATCAGTTAA AAAATATCCT TACCGTATTG CAGGCTGTCG ACATTCTAAA AAATAAAGGT
TATACCTTAA ATGACAAGGC CCTTTACACT GCGTTAAAAA ATGTAAAAAC CTTAACAGGA
CTGCAGGGCA GATGGCAAAA ACTCAGTGAA CACCCGCTCG TTATATGTGA TACCGGACAC
AATATGGCCG GTATCAAAGA AGTGGTACAA AACCTGAAAG AAACCCCTTT CGAAAAACTG
CACATCGTTA TTGGCATGGT AAAAGATAAA GACATCAGCG GCGTGCTTAC CCTGCTTCCT
ACCGAAGCCA TTTATTATTT CTGCCAGCCA CAACTGGAAA GGGCACTTCC CGCCCAGGAG
CTTGCCCTGC AGGCGAAAGT GCAGCAGCTA CATGGAAATG TTTTTGATAC CGTTACAGCT
GCACTCAATG CAGCAAAAGA AAATGCAGGT AAAGACGATC TGATCTTTAT TGGGGGCAGT
AACTTTGTTG TAGCCGAAGT GCTATAA
 
Protein sequence
MNYKQTIDYL YSRLPMFTRV GAVAFKKDLH NTILMCAQLD NPQNKFKTIH VGGTNGKGST 
SHTLAAIFQQ AGYKTGLYTS PHLKDFRERI RINGTMVPET FVTDFVNRQK ETIEAISPSF
FEVTVAMAFA YFAEEKVDIA IIEVGLGGRL DSTNIITPEL SVITNISLDH TNMLGNTLTE
IATEKAGIIK PGIPVVIGEK QTESEAVFLT KASTTNSKIV FADQELSTTN TFREKEYLVT
SILKNKILIY KDLQLDLNGI YQLKNILTVL QAVDILKNKG YTLNDKALYT ALKNVKTLTG
LQGRWQKLSE HPLVICDTGH NMAGIKEVVQ NLKETPFEKL HIVIGMVKDK DISGVLTLLP
TEAIYYFCQP QLERALPAQE LALQAKVQQL HGNVFDTVTA ALNAAKENAG KDDLIFIGGS
NFVVAEVL