Gene CPR_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2471 
Symbolfhs 
ID4205408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2689505 
End bp2691175 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content31% 
IMG OID642567021 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_699725 
Protein GI110802143 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATG ATATTGAAAT AGCACAAAGT GCTAAAATGG AGCCAATAAT AAATATTGCT 
AAAAAAATAG GTCTTGAAGA AGATGATATT GAGCTTTATG GAAAATATAA ATGTAAGATA
TCTTTAGATG TAATAAAAAG ACTAGAAAAT AATAAGGATG GTAAGCTTGT TTTAGTTACA
GCTATAAATC CAACACCGGC AGGAGAGGGA AAATCTACTG TTACAGTAGG ATTAGGACAA
GCGTTAAATA AAATAGGAAA AAATACTGTT ATAGCCTTAA GAGAGCCATC ATTAGGACCT
GTATTTGGAA TAAAAGGTGG AGCAGCTGGT GGTGGATATG CTCAAGTAGT TCCTATGGAA
GATATAAATC TTCACTTTAC AGGGGATATG CATGCAATAA CTTCAGCTAA TAATCTTTTA
TGTGCAGCTA TAGATAACCA TATTCATCAA GGTAATTTAT TAAGAATAGA TTCAAGAAGA
ATTGTATTTA AAAGAGTTAT GGATATGAAT GATAGAGCTT TAAGAAATAT AGTTGTTGGA
ATGGGCGGTA AAATAAACGG ATTCTTAAGA GAAGATGGAT TTATGATCAC TGTTGCATCA
GAAATAATGG CTATATTATG TATGGCTAGT GATTTAGAAG ACTTAAAAGA GAGAATGGGA
AATATTTTAA TTGCATATAA CTTAGATGGA GAGCCTGTGT ATGCTAAGGA ATTAGAAATT
GAAGGAGCTA TGGCTTTACT TATGAAAGAT GCCATAAAAC CTAACCTTGT TCAAACTTTA
GAAAATACTC CAGCAATAAT TCATGGAGGC CCATTTGCAA ATATAGCTCA TGGATGTAAT
TCAATAATAG CAACTAAGAC AGCTTTAAAA ATGAGTGATA TAACTATAAC AGAGGCTGGC
TTTGGAGCAG ATTTAGGTGC AGAAAAATTC TTAGATATAA AATGTAGATA CGGAAATTTA
AATCCTGATT GTGTAGTTTT AGTAGCTACT ATAAGAGCTT TAAAACATCA CGGTGGAGTG
AAGAAAGATG AATTAAATAT ATCAAATGTA GATGCTTTAA ATAAAGGAAT GAAAAACTTA
GAAAAGCAAA TAGAAAATAT AAAAGCTTAT GGAGTACCAG TTGTTGTTGC TATAAATAAG
TTTATAACTG ATTCAGATGA AGAGGTTAAA GCTATAGAAG ACTTCTGCAA AAATATAGGA
GTAGAGGTTA GCTTAACAGA AGTTTGGGAA AAAGGTGGAG AAGGTGGAAT TGATTTAGCT
AATAAGGTTA TAAAAACAAT GGAAAATGAG CCTTCAAATT TTAAAATGAT TTACGATTCA
GAAGAATCAA TAAAGGATAA AATACTAAAA ATAGTTCAAA CTATATATGG GGGAAAAGGA
GTAAACTATA CTCCTCAAGC TCTTAAACAA ATAGCTGAAA TTGAGAAATT TAATTTAGAT
AAACTTCCAA TATGTATGGC TAAGACACAG TATTCATTAT CAGACAATCC AAGTCTTTTA
GGAAGACCTG AAAACTTTGA TATAACTATT AAAGAGGTTA GAGTTTCAAA TGGAGCTGGA
TTCATAGTTG TTTTAACAGG GGATGTTATG ACAATGCCAG GTTTACCTAA AGTTCCTGCT
GCTAATAGAA TGGATATAAA AGACAATGGA GAAATAGTAG GATTATTCTA A
 
Protein sequence
MKNDIEIAQS AKMEPIINIA KKIGLEEDDI ELYGKYKCKI SLDVIKRLEN NKDGKLVLVT 
AINPTPAGEG KSTVTVGLGQ ALNKIGKNTV IALREPSLGP VFGIKGGAAG GGYAQVVPME
DINLHFTGDM HAITSANNLL CAAIDNHIHQ GNLLRIDSRR IVFKRVMDMN DRALRNIVVG
MGGKINGFLR EDGFMITVAS EIMAILCMAS DLEDLKERMG NILIAYNLDG EPVYAKELEI
EGAMALLMKD AIKPNLVQTL ENTPAIIHGG PFANIAHGCN SIIATKTALK MSDITITEAG
FGADLGAEKF LDIKCRYGNL NPDCVVLVAT IRALKHHGGV KKDELNISNV DALNKGMKNL
EKQIENIKAY GVPVVVAINK FITDSDEEVK AIEDFCKNIG VEVSLTEVWE KGGEGGIDLA
NKVIKTMENE PSNFKMIYDS EESIKDKILK IVQTIYGGKG VNYTPQALKQ IAEIEKFNLD
KLPICMAKTQ YSLSDNPSLL GRPENFDITI KEVRVSNGAG FIVVLTGDVM TMPGLPKVPA
ANRMDIKDNG EIVGLF