Gene Phep_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2053 
Symbol 
ID8253157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2369493 
End bp2370779 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content40% 
IMG OID644935701 
Productcitrate synthase I 
Protein accessionYP_003092320 
Protein GI255531948 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.655097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0360596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATA TTGCAGAAAT TAAGATTGAT GGAAAAGTGT ATGAATTCCC CGTTATCACT 
GGAACCGAAG GGGAGAAAGC TATAGATATC TCTAAACTTA GAGATTTAAC AGGTCATATC
ACTTTAGACT TTGGATACAA AAATACGGGC TCTACCAAAA GTGCAATAAC CTTTTTGGAT
GGTGAACAAG GTATATTAAA ATACCGTGGC TATCCGATTG AAGAACTGGC AAAAAAATCT
ACTTTCTTAG AAGTAGCCTA TTTATTGATA TATGGCGACC TGCCCACACA GGTGCAATTG
GATGATTTTC AAAAACAGAT CAGCAGACAT ACACTGATCC ATGAGGATAT GAAGAAATTT
CTGGATGGTT ATCCGTCGAA ATCACATCCT ATGGCCCAGC TCTCTTCACT GGTATGTTCT
TTATCTACTT TCTACCCGGA GTCTTTAAAT GCAAATTCAT CGCCTGAGAC GATGGACCTG
ACCATGATCA AACTGCTGGC CAAGTTTCCG ACCATTGTTT CTTTCATATA TAAAAAATCT
TTAGGCCACC CGCTGATCTA TCCTAAAAAT AAATACGATT ACATCAGCAA TTTCCTGAAC
ATGATCTTTG GTCAGCGTAC AGAGGAAGTT GAGATTGACC CGGTTGTGGT AAATGCCATG
AACACCTTAT TGATCTTACA TGCAGACCAT GAACAGAATT GTTCTACCTC TACAGTAAGG
ATTGTTGGTT CTTCAGATTG TAACTTGTAT GCATCGGTTT CTGCAGGTAT AGACGCCTTA
TGGGGGCCAC TTCATGGCGG CGCGAACCAG GCAGTAATAG AGATGCTGGA ACTAATTAAA
CAAGATGGCG GGGATACAGA AAAATGGATC AATAAAGCCA AAGATAAAAA TGATCCTTTC
CGTATGATGG GTTTTGGGCA CAGGGTATAT AAAAACTTTG ATCCAAGGGC TAAGATCATT
AAAAAGGCTT GTGATGATAT TTTAGAAAAA CTGGGCATCA ACGATCCGGT ACTGGAAATT
GCCAAGAAAC TGGAAGAAGC AGCTTTAAGC GATCCTTATT TTGTACAACG TAAACTATAT
CCTAATGTCG ACTTCTACTC GGGGATCATT TACAGGGCTT TAGGTTTCCC TACGGATATG
TTTACTGTAT TGTTTGCTTT GGGCCGTTTA CCGGGATGGA TTGCACAATG GAAAGAAATG
CATGAAAACA AAGAGCCGAT AGGACGCCCG CGCCAGATTT ACGTTGGTCA TACCGACAGA
ACTTTTACTG CAATAAAAGA CAGGTAA
 
Protein sequence
MSDIAEIKID GKVYEFPVIT GTEGEKAIDI SKLRDLTGHI TLDFGYKNTG STKSAITFLD 
GEQGILKYRG YPIEELAKKS TFLEVAYLLI YGDLPTQVQL DDFQKQISRH TLIHEDMKKF
LDGYPSKSHP MAQLSSLVCS LSTFYPESLN ANSSPETMDL TMIKLLAKFP TIVSFIYKKS
LGHPLIYPKN KYDYISNFLN MIFGQRTEEV EIDPVVVNAM NTLLILHADH EQNCSTSTVR
IVGSSDCNLY ASVSAGIDAL WGPLHGGANQ AVIEMLELIK QDGGDTEKWI NKAKDKNDPF
RMMGFGHRVY KNFDPRAKII KKACDDILEK LGINDPVLEI AKKLEEAALS DPYFVQRKLY
PNVDFYSGII YRALGFPTDM FTVLFALGRL PGWIAQWKEM HENKEPIGRP RQIYVGHTDR
TFTAIKDR