Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0907 |
Symbol | |
ID | 5741779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 1158767 |
End bp | 1160230 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641292019 |
Product | hypothetical protein |
Protein accession | YP_001558031 |
Protein GI | 160879063 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00145155 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGGAC ATCCCATACG TGGTCTTGTC TTAATACTAG TATTAGTAGC ACTAAATGCG ATAGCATCAG CTGCAGAGGC AGCCATTGAA AATGTAAATG AAGCCCTTGC CGAAAAGCGA GCAGAAGAGG GAGATAAGAA GGCAAAACGC TTAGTAAGAT TGTTAGATAC CCCTCATCGA TACATAAATG TCATCGAGAT CCTTCTAACA CTAGCTAGCT TACTCATTGG TATGACTTAT TCTTTTCAGC TTTATAGAGT GATTGAGAAG CTTGTCGAAA CAAGTACACT TCCAGAAGCT ATGGCAATAA CAACTAGTAT CGCGATGGTA TTGGTTACGA TATTAATCAC TTATCTTATC GTATTATTTG GTATGCTGTT ACCACGAAAG CTGGCACTAA AATATGCGGA TTCCTGTGCA TTTAAAATGG CAGGTATGAT TTTAACCTGT TCTCATTTAT TTGCACCAAT CATTTGGTTA TTGGAGAAGA ATACAAATGG CATACTTCGT TTATTTGGAA TTCGTCCTAG TGACTTAGAA GATAATGTAA CAGAAGAAGA AATTATGTCT ATGGTAAATG AAGGCCATGA ACAGGGAGTT CTTGAAGCAG AAGAAGCTGA GATGATATCT AATATCATCG AATTTAATGA AAAGGCTGCG AAAGACATTA TGACTCATCG CAAAAAGATG ATTGCAATCA ACAGTGCTCT TTGTATCGAA GATGCTCTTC GATTTATGTT AGATGAGAAT TACTCCAGAT TCCCACTTTA TGATGGGGAT ATTGATAATA TTGTAGGTTT GCTACATTTA AAGGATGTTA TGTTATATTT TCTGGATCCT AGACTTAAGG TTGAACCTTT GTCTAAGGTG GCCAGAGAAC CATATTTCAT ACCTGATACA CAAAGTATTG ATGTATTATT CCACGATATG CAAACTAAGA AAATCCATAT GGCAATTGCA ATTGATGAGT ACGGGCAGAC AGCCGGTATT GTTGCTATGG AGGATATCTT AGAGGAAATC GTAGGTGACA TACAGGATGA ATATGATGAC GAAGAGGAGC TTTACACAAG ATTAGAGGAT GACTCTTACT TGTTGTCGGG TGAAGCTTCC CTAGAAGATT TGGAAGACAT ATTATCTCTT CCGTTTGCAG AAGAAGATAT AAAAAATTAC GATACGTTAA ATGGGCTTAT TGTATCATTA CTAGACCATA TTCCAGGAGA CGATGAAAGG GCTACCATTC GATATTGCGG CTATGAATAT GAACTAATGG AAATACAGAA TCGAATGATC ACCTCTGTTC GGGTTCGTAA GATCCCAGAG GAGGAATTAA AAGCTTCCGA TAATGAAGAT AATCAAGTTT CACAAAGACT TGGTGCGGCG ATGACGGATG CGATTGATAC AACAGATGAA AAGATTCTAA GTAATGTTGA GGATATTATC CTTGAAAAGA AAAAGGATAA ATAA
|
Protein sequence | MDGHPIRGLV LILVLVALNA IASAAEAAIE NVNEALAEKR AEEGDKKAKR LVRLLDTPHR YINVIEILLT LASLLIGMTY SFQLYRVIEK LVETSTLPEA MAITTSIAMV LVTILITYLI VLFGMLLPRK LALKYADSCA FKMAGMILTC SHLFAPIIWL LEKNTNGILR LFGIRPSDLE DNVTEEEIMS MVNEGHEQGV LEAEEAEMIS NIIEFNEKAA KDIMTHRKKM IAINSALCIE DALRFMLDEN YSRFPLYDGD IDNIVGLLHL KDVMLYFLDP RLKVEPLSKV AREPYFIPDT QSIDVLFHDM QTKKIHMAIA IDEYGQTAGI VAMEDILEEI VGDIQDEYDD EEELYTRLED DSYLLSGEAS LEDLEDILSL PFAEEDIKNY DTLNGLIVSL LDHIPGDDER ATIRYCGYEY ELMEIQNRMI TSVRVRKIPE EELKASDNED NQVSQRLGAA MTDAIDTTDE KILSNVEDII LEKKKDK
|
| |