Gene Cphy_3603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3603 
Symbol 
ID5742627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4450172 
End bp4451374 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content37% 
IMG OID641294713 
Producthomoserine dehydrogenase 
Protein accessionYP_001560689 
Protein GI160881721 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000021314 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGATAG CAGTGTTAGG ATTTGGAACC GTAGGTTCTG GAGTATATGA AGTAATAAAA 
ACAAATTATG AAACTATTAC AAAACGTGCT GGTGAAGTAG TTGACATTAA ATATGTGCTG
GATTTAAGAG ATTTTCCAGG AAATCCTGTT CAGGATATCA TTACTCATGA TTTTAGTGTA
ATTGCAAATG ATCCAGAAAT TAAGATTGTT GTTGAAGTTA TGGGTGGTGT AGATCCTGCT
TATTCTTTTG TGAAGGAAGC CTTATTAAAA GGGAAATCTG TCTGTACTTC CAACAAGGAA
CTAGTAGCAA AACATGGAGC AGAACTATTG GAGATAGCAA AACAAAAGAA GATTAACTTT
TTGTTTGAAG CTAGTGTTGG CGGTGGTATA CCGATTATTC GTCCTTTGAA TCAATCCTTA
ACAGCGGATG AGATCGATGA AATAACAGGT ATCTTGAATG GTACTACAAA CTATATTTTA
TCAAAGATGA AAACGCAAGG ATCTGAGTTT GCAACCGTTC TTAAGGAGGC TCAAGAGCTT
GGTTATGCAG AACGTAATCC AGAGGCCGAT GTCGAAGGTT TTGATGCTTG CCGTAAAATT
GCGATTCTTA CCTCGCTTGC ATATGGAATG CATGTTGATT TCGAACAGAT TTATACTGAA
GGAATTACGA AGATAACAGC AGAGGATATT AAGTATGCCA ATGCGTTAGA TGCTAGCATT
AAATTATTAG CGACCAGTAA AAACGTTGAC GGTAAGGTGT ATGCGATGGT TGCTCCTAAG
ATGATAAACG ATAAGCATCC ATTATTTTCT GTAAATGATG TATTTAACGG AATACTTGTT
AAAGGAAATT TATTAGGTGA TGTTATGTTC TATGGAAGCG GAGCAGGCAA ACTTCCAACA
GCAAGTGCAG TTGTTTCTGA TGTTGTAGAT GCAACCAAAC ATATGGGAAT CAACATTATG
ACATTATGGA GCAGCAAACA TCTAATTCCA GCGGATATGA GTACCTATGA GAGTAAATTC
TTTGTTCGTG TACCTTTAGG GGAAGAGGAG ACTGCAAAAG AATTATTTAA GATTGCAAAG
GTTGTTTCAG TACCTGATAT AGACGGAGAG TATGCATTTA TCACGGAGAA GATGACGGAA
GGAGCTTTTG AAGAGGCAGC GAAGAAGCTA TCGATAATTA ACCGTATTCG TGTGGAATTT
TAG
 
Protein sequence
MKIAVLGFGT VGSGVYEVIK TNYETITKRA GEVVDIKYVL DLRDFPGNPV QDIITHDFSV 
IANDPEIKIV VEVMGGVDPA YSFVKEALLK GKSVCTSNKE LVAKHGAELL EIAKQKKINF
LFEASVGGGI PIIRPLNQSL TADEIDEITG ILNGTTNYIL SKMKTQGSEF ATVLKEAQEL
GYAERNPEAD VEGFDACRKI AILTSLAYGM HVDFEQIYTE GITKITAEDI KYANALDASI
KLLATSKNVD GKVYAMVAPK MINDKHPLFS VNDVFNGILV KGNLLGDVMF YGSGAGKLPT
ASAVVSDVVD ATKHMGINIM TLWSSKHLIP ADMSTYESKF FVRVPLGEEE TAKELFKIAK
VVSVPDIDGE YAFITEKMTE GAFEEAAKKL SIINRIRVEF