Gene CPF_2785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2785 
Symbolfhs 
ID4203898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp3047750 
End bp3049420 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content31% 
IMG OID638083653 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_697157 
Protein GI110800131 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATG ATATTGAAAT AGCACAAAGT GCTAAAATGG AGCCAATAAT AAATATTGCT 
AAAAAAATAG GTCTTGGAGA AGATGATATT GAGCTTTATG GAAAATATAA ATGTAAGATA
TCTTTAGATG CAATAAAAAA ACTAGAAAAT AATAAGGATG GTAAGCTTGT TTTAGTTACA
GCCATAAATC CAACGCCAGC AGGAGAGGGA AAATCTACTG TTACAGTAGG ATTAGGACAA
GCGTTAAATA AAATAGGAAA AAATACTGTT ATAGCCTTAA GAGAGCCATC ATTAGGACCT
GTATTTGGAA TAAAAGGTGG AGCAGCTGGT GGTGGATATG CTCAAGTAGT TCCTATGGAA
GATATAAATC TTCATTTTAC AGGGGATATG CATGCAATAA CTTCAGCTAA TAATCTTTTA
TGTGCAGCTA TAGATAACCA TATTCATCAA GGTAATTTAT TAAGAATAGA TTCAAGAAGA
ATTGTATTTA AAAGAGTTAT GGATATGAAT GATAGAGCCT TAAGAAATAT AGTTGTTGGA
ATGGGTGGTA AAATAAACGG ATTCTTAAGA GAAGATGGAT TTATGATCAC TGTTGCATCA
GAAATAATGG CTATATTATG TATGGCTAGT GATTTAGAAG ACTTAAAAGA GAGAATGGGA
AATATCTTAA TTGCATATAA CTTAGATGGA GAGCCTGTTT ATGCTAAGGA ATTAGAAGTT
CAAGGAGCTA TGGCTTTACT TATGAAAGAT GCTATAAAAC CTAACCTTGT TCAAACTTTA
GAAAATACTC CAGCAATAAT TCATGGTGGA CCATTTGCCA ATATAGCTCA TGGATGTAAT
TCAATAATAG CAACTAAGAC AGCTTTAAAA ATGAGTGATA TAACTATAAC AGAGGCTGGC
TTTGGAGCGG ATTTAGGTGC AGAAAAATTC TTAGATATAA AATGTAGATA CGGAAATTTA
AATCCTGATT GTGTAGTTTT AGTAGCTACT ATAAGAGCTT TAAAACATCA CGGTGGCGTA
AAGAAAGATG AATTAAACAT ATCAAATGTA GATGCTTTAA ATAAAGGAAT GAAAAACTTA
GAAAAGCAAA TAGAAAATAT AAAAGCTTAT GGAGTACCAG TTGTTGTTGC TATAAATAAG
TTTATAACTG ATTCAGATGA AGAGGTTAAA GCTATAGAAG ACTTCTGCAA AAATATAGGA
GTAGAGGTTA GCTTAACAGA AGTATGGGAA AAAGGTGGAG AAGGCGGAAT TGATTTAGCT
AATAAAGTTA TAAAAACAAT GGAAACTGAG CCTTCAAACT TTAAGATGAT TTACGATTCA
GAAGAATCAA TAAATGATAA AATATTAAAA ATAGTTCAAA CTATATATGG TGGAAAAGGA
GTAAACTATA CTCCTCAAGC TCTTAAACAA ATAGCTGAAA TTGAGAAATT TAATTTAGAT
AAGCTTCCAA TATGTATGGC TAAGACACAG TATTCATTAT CAGATAATCC AAGTCTTTTA
GGAAGACCTG AAAACTTTGA TATAACTGTT AGAGAGGTTA GAGTTTCAAA TGGAGCTGGA
TTCATAGTTG TTTTAACAGG GGATGTTATG ACAATGCCAG GTTTACCTAA AGTTCCTGCT
GCTAATAGAA TGGATATAAA AGACAATGGA GAAATAGTAG GATTATTCTA A
 
Protein sequence
MKNDIEIAQS AKMEPIINIA KKIGLGEDDI ELYGKYKCKI SLDAIKKLEN NKDGKLVLVT 
AINPTPAGEG KSTVTVGLGQ ALNKIGKNTV IALREPSLGP VFGIKGGAAG GGYAQVVPME
DINLHFTGDM HAITSANNLL CAAIDNHIHQ GNLLRIDSRR IVFKRVMDMN DRALRNIVVG
MGGKINGFLR EDGFMITVAS EIMAILCMAS DLEDLKERMG NILIAYNLDG EPVYAKELEV
QGAMALLMKD AIKPNLVQTL ENTPAIIHGG PFANIAHGCN SIIATKTALK MSDITITEAG
FGADLGAEKF LDIKCRYGNL NPDCVVLVAT IRALKHHGGV KKDELNISNV DALNKGMKNL
EKQIENIKAY GVPVVVAINK FITDSDEEVK AIEDFCKNIG VEVSLTEVWE KGGEGGIDLA
NKVIKTMETE PSNFKMIYDS EESINDKILK IVQTIYGGKG VNYTPQALKQ IAEIEKFNLD
KLPICMAKTQ YSLSDNPSLL GRPENFDITV REVRVSNGAG FIVVLTGDVM TMPGLPKVPA
ANRMDIKDNG EIVGLF