Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0805 |
Symbol | |
ID | 5745285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 1029458 |
End bp | 1031935 |
Gene Length | 2478 bp |
Protein Length | 825 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641291919 |
Product | extracellular solute-binding protein |
Protein accession | YP_001557931 |
Protein GI | 160878963 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA CAAAGAAATT GGTAGCGTTA CTTCTTTGCA TGACTATGAT TCTTTCTTTA GCAGCTTGTA GTAAGAAGGA AAAAGATGTA AATAAGGATA TTACACCAAC TGTAGCTCCT ACTTCAGCTG CTGGCGGTGA AAAAGAGGAA GAAAAACCAG TGATACCGGA TGTTCCAGCT GGTAGAAGAG ATGCATCCAC ACCTCGTTCT GCTGCAAACG AGAAAAATCC GTTGGTTATT AGTACTTTAA CACTGGATGG AAAATTTACA CCATTCTTCG GTACGAGTGA ACCAGATCGT CTTATCTATG AAAAAACTCA AATTACGTTA ATTGCAAACA ACGAGACAGC TGAGCCAGTT GCAGGTGTTG ATGTACCAAC TGCTGCATGG GATTTTAAGA TGACGACAAA CGATGATCAA TCAAAATCTA CCTATAAATT CTGGATTAAG AATGGTGTTC AATTTAGTGA TGGCCATGTT TTAACTGTAG ATGATGTATT ATTTAACTTA TATGTATTAT TAGATCCTAA ATATGATGGT TCCTCAACTT TATATTCAAT GCATATTGAG GGATTAAAAG CTTATCAAAC TCAGATTTTA GATGAGAGTT CAGCAGATGC AAAACTTGCG GAGTTCGCTG AGGCAGCAAA GAAGAAAGTT GACGCCGCAC TTGCTGGTAA TGGTGAGGCA GCAGTTACTG ATAAATTATG GGAACTTGTA AAAGAAAGTG TGACAGCTGA TAGTAATGTA TTAATGAGTA AGCAATATGT TCCAGAAGAT TTTGGTTTAG TTGGACCAGA AGATTTCTTA ACATCAGCAC CTCAATCCAT CATTCTTTAT TATACAGCAA GCTGTATTGG TAGAGATTTA ATTACATACA AAGATGGTGC TTTTGTTATT GATGCGGCTA CAGGTTTAAC AGTAGATAAA ATGAGTACTT ATACAGAGCA AGATTATATT GATGCTTCCA TGAAGTGTAT CAAAGAAACG ATAAATGCTG CTGAATTTGA TGAAGCATTT GATTATACAA CAGTGGATGA TGCTAAGGCA TTCTTCGCAG GAGAAGAAAA ATCTGCATAT CTTGAAGCAA ACAAAGGTAC TGTTAAGAGT ATTAGCGGTA TCACAAAGGG TAAGGAAGTT TGCTCTGATG GAGTAGAACG TGAAACTTTA ACAGTAGTAT TAAATGGTGT TGACCCTAAG GCAATCTGGA ACTTTACTTT TGAAGTTGCA CCTATGCATT ACTATGCAGG GCAGGCAAGA CATGATAAAG CAAATGGCGT TGATTATTTT GGTGTTGATT TTTCCAGCGC AGCATTTATG TTAGAATTAA AAGAATTTAA TGGTTTACCG ATGGGTGCTG GTGCTTATAA GATGACAGAT GCTAATAACT CAGAAAATCC AACTGCCGAT AAGTTTTATG ATAATGGTAT TTGTTATTTT GTAGCAAATG ATAACTTTTT GTTAGGTGCT CCAAAGGTTA AATATTTAAG ATACAAAACA ATTAATGCTG GCTCTGAACT AGACTCTGTA TTGACTGGCG ATGTTCACTA TTCAGATCCA AAGGCTAGTG CAACTACGAT TAATAAGATC ACTTCAGATT CTTCTTATTC ACATATGAAC TATGTGTTAG TTGATAACTT AGGTTATGGT TATATCGGTA TTAATGCACA GCTTGTACCT GACCTTAATG TAAGACGCGC ACTTATGTCT GCTATGGATA CTGCACTTAC CTTAGGAGCT TACCCAGGAG GCTTAGCGCA AGTTATTCAT AGACCAATGA GTCAGGTGTC ATGGGCATAT CCTGAAGGAT GTACGGCAAT GTACCCATTC GATGAAACAG GTGCAGCTTC TAAGGCGTTC TTCTTAAAAG CAGGTTATAA AGAGACAGCA GATGGTAAAT TATTAGCACC AGACGGAAGT AAGCCATCCT TTAAATTTAC ACTTCCTTCA AGTGCAGATG ATCACCCAGC TGGTCAGGTA TTCTTAAAGA CACAAGAAGT ACTTGAGAAG ATCGGTGTTG AAGTAATTAT TGATATTGAT CAAAATCTTT TAAGTAAATT AAACGAGGGT ATTATTTCTG TTTGGGCAGC AGCTTGGCAA GCAACAATAG ATCCTGATAT GTTCCAGGTA TACTGTTCCG ATCCTTCTAA AAATCAAGCT ACTTCTCCAA AGTCTTCAGG ACTTTATTAT AAATTTGAAA ATGGTTCTGA TGAAGAAAAA GCAATTCTTG TACGTTTAAA TGAATTAATT GAGCAGGGAC GTTCTACTCT TAACGTAGAT GAAAGAAAAC CAATTTATTC AGAGGCACTT GATAAGGCTA TGGAAATGGC AGTTGAATTA CCAACCTATC AGAGAAAGAA TATGTATGTT TATAACAAGA GTGTAATTGA TCCAAGTAGT TTAACTCCAG CAGATAAGAC AACACCTTAC AGATCACCAA TTTACGCTAT CTGGGATGTT AGCTTACTTG ACAACTAG
|
Protein sequence | MKKTKKLVAL LLCMTMILSL AACSKKEKDV NKDITPTVAP TSAAGGEKEE EKPVIPDVPA GRRDASTPRS AANEKNPLVI STLTLDGKFT PFFGTSEPDR LIYEKTQITL IANNETAEPV AGVDVPTAAW DFKMTTNDDQ SKSTYKFWIK NGVQFSDGHV LTVDDVLFNL YVLLDPKYDG SSTLYSMHIE GLKAYQTQIL DESSADAKLA EFAEAAKKKV DAALAGNGEA AVTDKLWELV KESVTADSNV LMSKQYVPED FGLVGPEDFL TSAPQSIILY YTASCIGRDL ITYKDGAFVI DAATGLTVDK MSTYTEQDYI DASMKCIKET INAAEFDEAF DYTTVDDAKA FFAGEEKSAY LEANKGTVKS ISGITKGKEV CSDGVERETL TVVLNGVDPK AIWNFTFEVA PMHYYAGQAR HDKANGVDYF GVDFSSAAFM LELKEFNGLP MGAGAYKMTD ANNSENPTAD KFYDNGICYF VANDNFLLGA PKVKYLRYKT INAGSELDSV LTGDVHYSDP KASATTINKI TSDSSYSHMN YVLVDNLGYG YIGINAQLVP DLNVRRALMS AMDTALTLGA YPGGLAQVIH RPMSQVSWAY PEGCTAMYPF DETGAASKAF FLKAGYKETA DGKLLAPDGS KPSFKFTLPS SADDHPAGQV FLKTQEVLEK IGVEVIIDID QNLLSKLNEG IISVWAAAWQ ATIDPDMFQV YCSDPSKNQA TSPKSSGLYY KFENGSDEEK AILVRLNELI EQGRSTLNVD ERKPIYSEAL DKAMEMAVEL PTYQRKNMYV YNKSVIDPSS LTPADKTTPY RSPIYAIWDV SLLDN
|
| |