Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_2971 |
Symbol | |
ID | 5744032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 3635219 |
End bp | 3636634 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641294071 |
Product | phage uncharacterized protein |
Protein accession | YP_001560067 |
Protein GI | 160881099 |
COG category | [S] Function unknown |
COG ID | [COG5410] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01630] phage uncharacterized protein (putative large terminase), C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000566007 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGATA AAGAGTTAAT AAAGCTAGGT GCAAAGATAG AACTTGCAAG ACGTGAGTTC TTTTTTTATT GTCAGTTAAA AGCTCCTGAC TTCTATAAGT CTGATAGAAA GTACTTGGTT GAGTTATGCA ATGAGTTCCA AGAGTTCTTA GATTCTGACG ATGAAGTAAT GATAGTTAAT GAGCCTCCAA GACACGGGAA GTCTAGGACA GTTGGTAATT TGGTTGAGTG GGTGCTTGGT AAGGACCAAA CACAAAAGAT AATGACTGGA TCATACAATG AAACCCTGTC CACCATGTTC TCTAAAAACG TCCGTAACAG CATTATGGAA CAGAAAGCAG ATAAATATAA GCCTGTGTAC TCAGACGTGT TTCCTGGGGT GTCTATAAAG CGTGGTGATG GTGCTATGAA TCTTTGGAGT TTAGAGGGTG GATATAACAA TTACTTAGCT ACTTCTCCAA CTGGTACTGC AACAGGGTTC GGTTGTTCGC TTATGATAAT CGATGACCTT ATCAAGAATG CGGAGGAAGC AAATAATGAG GCTGTAAAAG AAAAACACTG GGAATGGTTC ACCAATACAA TGCTGTCCCG TCTGGAAGAG GGCGGGAAGA TAATCATTAT TATGACTAGA TGGGCCTCTG ATGATCTGGC TGGAAGAGCG TTAGAGCATT ACAAGGAGCA AGGGGCAAAG ATTCGTCATG TAAGCATGAA AGCTCTCGTT GATAAAGAAA AGAAACAAAT GCTTTGTAGT GAAGTGCTTT CCTATAAATC TTATCTTGGC AAGATAAAGG CTATGGGCGA GGATATCGCA AGCGCTAACT ATCAGCAGGA ACCTATTGAT CTGAAAGGCA AATTATATAG TAGCTTTAAA ACATACGAAA AGTTTCCTAT GGATGATAAA GGCAATCTCC TATTCACTGC TATTAAATCA TATTGCGATA CGGCAGACGA GGGAACAGAT TATCTCTGTA ATATTATTTA TGGTGTTTAT AATAAAGAGG CTTATGTCCT AGATATTTAC TATACCAATG CACCGATGGA AGTTACTGAA ACAGAAACCG CTAAAAGAAT CCATGAACAC GGTGTAAACG TAGCTGATAT AGAAAGTAAT AATGGTGGTC GAGGATTTGC CCGTTCGGTA GTAAGAATCC TAAGAGAAAC ATTTAAGAGT AATAAAACCA AGATTCGGTG GTTCCATCAA AGTAAGAATA AGATTGCTAG AATACTTTCT AACAGTACAT GGGTTATGGA CCACATTTAT TTCCCTAAAA ACTGGCGTGA TAGATGGCCC GATTATTATA GTGCTATGTC TAAATATCAG CGTGAAGGCA AGAACAAACA TGATGATGCG CCAGACGCTA CAACAGGAAT TGCTGAGCGC ATTGATAAAG GCAATGGTGT ATCGGTTTTA AAATAA
|
Protein sequence | MMDKELIKLG AKIELARREF FFYCQLKAPD FYKSDRKYLV ELCNEFQEFL DSDDEVMIVN EPPRHGKSRT VGNLVEWVLG KDQTQKIMTG SYNETLSTMF SKNVRNSIME QKADKYKPVY SDVFPGVSIK RGDGAMNLWS LEGGYNNYLA TSPTGTATGF GCSLMIIDDL IKNAEEANNE AVKEKHWEWF TNTMLSRLEE GGKIIIIMTR WASDDLAGRA LEHYKEQGAK IRHVSMKALV DKEKKQMLCS EVLSYKSYLG KIKAMGEDIA SANYQQEPID LKGKLYSSFK TYEKFPMDDK GNLLFTAIKS YCDTADEGTD YLCNIIYGVY NKEAYVLDIY YTNAPMEVTE TETAKRIHEH GVNVADIESN NGGRGFARSV VRILRETFKS NKTKIRWFHQ SKNKIARILS NSTWVMDHIY FPKNWRDRWP DYYSAMSKYQ REGKNKHDDA PDATTGIAER IDKGNGVSVL K
|
| |