Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0103 |
Symbol | |
ID | 6373747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 98083 |
End bp | 99021 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642682620 |
Product | domain of unknown function DUF1731 |
Protein accession | YP_001958567 |
Protein GI | 189499097 |
COG category | [R] General function prediction only |
COG ID | [COG1090] Predicted nucleoside-diphosphate sugar epimerase |
TIGRFAM ID | [TIGR01777] conserved hypothetical protein TIGR01777 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGATC ATATCGTTAT TACCGGTGCT ACAGGTGTGA TTGGCTCTGA ACTTGCGCAT CAGCTGATCG CCGAAGGTGA GCAAGTGGTT GTTTTTTCGC GTTCTCCTAA TAGTGCTTCG TCCAAGGTTC CCGGTGCGGC GGCATATGCC GCCTGGAATT ATGACAATAG CGATGGAGAA TGGACCCGGT ATATCAGCGG AGCAAAAGCG GTGATACATC TGGCGGGAAA ACCTCTTCTC GATACGCGCT GGACAGAGGA GCATAAGGTT GAGTGCTATA ATTCGAGAGT CGTGGGAACG AAAAATCTTG TCAAGGCTAT CCAGGGGGCC GACATAAAAC CGAAATCCCT GATTTCCGCA TCTGCCATAG GATATTACGG TTCTTACGAA AACTGTGGTG ACTCGCCTGA CCTTGATGAA GCCGCGGCGG AAGGTGAAGA TTTTCTGGCT AAGATCTGTA TCGATTGGGA GAAGGAGGCT GAGAATGTGC CGGAAGGTGT GCGGCTTGTG CTGTTAAGAA CGGGTATTGT GCTTTCAACC AAAGGAGGAA TGCTTCAGCA GATGCTTCTG CCTTTTAACC TTTTCCTTGG CGGGCCTGTC GGTTCAGGTA AACAGTGTAT TTCCTGGATT CACGTAGATG ATGAAGTCGC GATTATCCGC AAGGCTGTAG AGGAGTCTTC TTATAAGGGG CCGATAAATC TTGTTGCTCC GCATCCGGTT TCCATGAAAG AGTTCGCGGG AGATCTTGGA TCTGTGCTTT CAAGACCGTC TCTGATTCCT GTCCCGAAGT TTGCCTTGCA GATTCTGATG GGGGAAGGGG CTGAATACGC AAGTAAAGGG GGCAAGGTCG TTCCGGGGTT TTTGAAGGAG CAGAACTATC GCTTTACGCA TCCGTCTCTT CGTGAAGCGC TGGCTGATCT TGTTGAACAT AACAAGTAG
|
Protein sequence | MEDHIVITGA TGVIGSELAH QLIAEGEQVV VFSRSPNSAS SKVPGAAAYA AWNYDNSDGE WTRYISGAKA VIHLAGKPLL DTRWTEEHKV ECYNSRVVGT KNLVKAIQGA DIKPKSLISA SAIGYYGSYE NCGDSPDLDE AAAEGEDFLA KICIDWEKEA ENVPEGVRLV LLRTGIVLST KGGMLQQMLL PFNLFLGGPV GSGKQCISWI HVDDEVAIIR KAVEESSYKG PINLVAPHPV SMKEFAGDLG SVLSRPSLIP VPKFALQILM GEGAEYASKG GKVVPGFLKE QNYRFTHPSL REALADLVEH NK
|
| |