Gene Cphy_0431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0431 
Symbol 
ID5745191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp550144 
End bp551712 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content37% 
IMG OID641291543 
Productextracellular solute-binding protein 
Protein accessionYP_001557557 
Protein GI160878589 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA GAAAGCAGTT ATTCTTTTTA GGTCTTCTTC TGAGTTGTAT GATTGCACTC 
TGCGCATGCA GTGGCAGTAC AGAAAACAAG GGAAATGATC AAAAAGAAGC AGAAAACGGC
GAGGCTTCAA AAAATGGTGG CTCTGTTATT GTAGGTATAA CAAACGATTT GGATAGTCTT
GACCCACACA AAGCAGTTGC GGCAGGAACT AAGGAAGTTT TGTTTAATAT TTTCGAAGGC
TTAGTAAAGC CTGATAAAGA TGGAAATTTA GTTCCTGCTG TAGCAAGCGA CTATAAGATC
TCGGAGGATG GAATGACCTA TACGTTTTAT TTACGTGAAG GGGTAAAATT CCACAATGGT
GCGCTTGTAA CGGTAGATGA TGTAATATAT TCATTAAAAC GTGCTACCGG CCTGCTTGAA
ACTTCCGATC CAGAAGTGAG AGTTGAGTCG GTATTTTCTT GTGTAGAATC GATAAATGCG
ATTACTACTG AGGATGGAAA ACAGGCCGTA GAAGTCAAAT TAAATCAACC TAATATTGAA
CTTCTATCTT ATTTTACCAT TAGTATAATA CCTAAAGACT ATACCGAGCA GGCGACTAAG
CCGGTTGGTA CAGGTCCATT CCGTTTTGTA TCCTATTCGC CGCTGGTTAG CATTGTGATG
GAGAAGAATC CTGACTATTA TGTTAAAGGG GTTCCTTATC TTGATGAAGT AACGTTTAAG
ATATCTGCAA ATACAGATGC AGCATTTATG GAATTAAGAG CTGGTACTAT CGATATTTTC
CAGCAATTAA CCTATGAACA ATCAAATCAA TTAAAGGATT TGTACAACAT TGAGATTGGG
CATATGAACC TGGTACAGGC TTTGTTTTTA AATCACAAAG CTGCTCCATT TGATAACTTA
AAAGTTCGCC AAGCGCTCAG CTATGCAATT GACAGACAGA TGATTCTTGA TATTGTTGCA
GGTGGAAATG GTACGATTAT TGGAAGTAAT ATGTTTCCTG GATTCGGTAA ATACTACGAT
GAATCCTTAG CAAGTTACTA CACTTATGAT GTAGAAAAAG CAAAACAACT TCTTAATGAG
GCAGGTTATC CGGAAGGATT TCATTTTACA ATCACTGTCC CATCTAATTA TCAAGCACAT
GTTGATACAG CACAGGTGAT TGTGGAACAG TTAAAGAAGG TTGGAATTAC TGCGGAAATC
AAGCTAGTAG AATGGGCAAG TTGGATTTCT GATGTTTACC AAGGAAGAAA CTATGAATCA
ACGATTATAG GACTTGATTC CAATCTTTCT CCTAGTGATA TTGTAAAGCG ATATGAATCT
ACATCAAAAA ACAACTTTTT AAATTATTCA AATTCACAAT TTGACAGCCT TTATCCGAAG
GCTTATGCAT CGGTTAAGGA AGAAGAAAAA GTAGACCTTT ATCATCAGTT GCAAAGAATA
TTAACAGAGG ATGTAGCTTC TGTTTATCTT CAAGATCCAG CTAATTTAGT AGCAGTAAAC
AAAAAATTAG CAGGATATCA ATTCTATCCT GTTTATGTAC AGGATATGTC AACAGTTTAT
TATAAGTAA
 
Protein sequence
MKRRKQLFFL GLLLSCMIAL CACSGSTENK GNDQKEAENG EASKNGGSVI VGITNDLDSL 
DPHKAVAAGT KEVLFNIFEG LVKPDKDGNL VPAVASDYKI SEDGMTYTFY LREGVKFHNG
ALVTVDDVIY SLKRATGLLE TSDPEVRVES VFSCVESINA ITTEDGKQAV EVKLNQPNIE
LLSYFTISII PKDYTEQATK PVGTGPFRFV SYSPLVSIVM EKNPDYYVKG VPYLDEVTFK
ISANTDAAFM ELRAGTIDIF QQLTYEQSNQ LKDLYNIEIG HMNLVQALFL NHKAAPFDNL
KVRQALSYAI DRQMILDIVA GGNGTIIGSN MFPGFGKYYD ESLASYYTYD VEKAKQLLNE
AGYPEGFHFT ITVPSNYQAH VDTAQVIVEQ LKKVGITAEI KLVEWASWIS DVYQGRNYES
TIIGLDSNLS PSDIVKRYES TSKNNFLNYS NSQFDSLYPK AYASVKEEEK VDLYHQLQRI
LTEDVASVYL QDPANLVAVN KKLAGYQFYP VYVQDMSTVY YK