Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0058 |
Symbol | |
ID | 4571250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 65249 |
End bp | 66577 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639764660 |
Product | protein of unknown function DUF900, hydrolase family protein |
Protein accession | YP_910552 |
Protein GI | 119355908 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAAC TTGTTCAGCA ACTCGCGCTG CTCGATGCGC TGCTCAGTAA CTGGGACGGC GTGCTTGTAA GGATGGGCGC GCCCGACGTT CTGCGCGACG AAGCTCTTGC CGCGCTTGCC GTAAAACTTT CATCGGCAGA ATCGCCCGAC GATCTCGACC TGCTGCTCGA CGATCTTTTC GACCTCGTCG AAGATACTCC GGCCTACGAT TACGTGCGCG GTCTTATCGC GCGCGCCCGG CTCGACACCG GTACGGTTGC CAAAACCCGT GGGGGATATG CCGGAGAACT CGTGAGCGAC GATGAAAAAA GCCTGCTCTT CGGCGCATCC CGGTTAGCCG GTCGCGCGCT CGGCAATGCG GTTTCGGCGG ACGTGGAGCC GTGCCAGGTC GAGGTTTTTT TTGCCACCAA CCGAAAGAGT TCCGACGCTT CCGACCTGCA GTTCACCGGG GAGCATGATG CCGGCGGCTA TACCTGCGGA GTGGCGCACG TTACCATTCC TGTTGGCGTT CATCGGGCAG GGGTGCTCGA AGGGAAGCGA TGGTGGCACC TGCTGCGTGA GAAGGACGAC TCAAGACGGT ACGTGGTGCT CGGAAGCGTT GAAAAACTTG CGGAGGATCT TTTTACGACG AGGATTGTCG AAGGCGCCCC GGAGTGCCGC GACCTGCTTG TTTTTCTGCA CGGTTTTAAC GTCACCTTCG AAGATGCTGC CCGTCAGGCA GCGCAGTTCG CCTTCGACCT GCAGTTTCAG GGCAGGGTCG TACTGTACAG CTGGCCATCC CTCGGTTCGC TTGCCGGTTA CTGCGCAGAC GAGGAGCGTG CATTCCTTTC GAACGGGAGG TTTGCCGGAT TTCTCGGAAT GCTTGAAGAC GGCCCCTGGG ACAAAGTGCA TATACTTGCG CACAGCATGG GCAACCGGGT AATGCTCTAT GGCTTGTCGG GCAACTCTTG GCCAAACGGC AGGATATCAC AGGTCGTTTT CGCCGCTGCC GACGTCTATG TCGAAACCTT TCGCGAACTC TTTCCGAAAA TCAGGGAGAA GGCCGCGCTC TATACCTCCT ACACTTCGAA GAAAGACCGT GCGCTCCTGA TATCGGGGAT TCTCCATAAG GCGAAAAGGA TCGGCATATC CAGAGGTGAG CCCTTCGTCA TGGATGGGCT TGAAACCATC GATGCGTCGA AAGTCGACAA CGGATTTCCC GGGCACGGCT ATTTTGCCGA AGAAACAAAG CTTATCGAGG ATATTTCCCA ACTGCTCGGC AAAGGGCTTT CTGCGGGTCG CCGCAAACTC TACCAGCCGC CATCGAAAGC GTACTGGGAT TTCAGGTAA
|
Protein sequence | MTELVQQLAL LDALLSNWDG VLVRMGAPDV LRDEALAALA VKLSSAESPD DLDLLLDDLF DLVEDTPAYD YVRGLIARAR LDTGTVAKTR GGYAGELVSD DEKSLLFGAS RLAGRALGNA VSADVEPCQV EVFFATNRKS SDASDLQFTG EHDAGGYTCG VAHVTIPVGV HRAGVLEGKR WWHLLREKDD SRRYVVLGSV EKLAEDLFTT RIVEGAPECR DLLVFLHGFN VTFEDAARQA AQFAFDLQFQ GRVVLYSWPS LGSLAGYCAD EERAFLSNGR FAGFLGMLED GPWDKVHILA HSMGNRVMLY GLSGNSWPNG RISQVVFAAA DVYVETFREL FPKIREKAAL YTSYTSKKDR ALLISGILHK AKRIGISRGE PFVMDGLETI DASKVDNGFP GHGYFAEETK LIEDISQLLG KGLSAGRRKL YQPPSKAYWD FR
|
| |