Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_2054 |
Symbol | |
ID | 5694897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | + |
Start bp | 2504046 |
End bp | 2505314 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641264655 |
Product | hypothetical protein |
Protein accession | YP_001529935 |
Protein GI | 158522065 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000265297 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTCGC GGATCATTGT GCTGGTTCTG CTGCTGTTTC TGTCCGGATT TTTTTCATCT TCGGAAACCG CCCTGTTTTC CATCAGCCGT TCCCGCATCC GGTTTCTGGC CAAAAAGAAA AACCGGTTCG ACCTGTTGAT CAAAAAGATG AAGGACAACC CCCACCGGCT GCTGTCCACC ATTCTCATCG GCAACAACCT GGTCAACATC GGGGCATCGG CCCTGGCCAC GGCCCTGGCC ATTGACGTCT TTTCCAACAA TGCCGTGGGC ATCGCCACCG GCGTGATGAC CTTTTTTATT CTGATTTTCG GTGAAATCCT GCCCAAGTCC ATCGCCACCA CCAACAATAT CACCATCGCC CGGATCACCA TCTATCCCCT TTACTGGCTG TCGGTCCTGT TTATGCCGAT CATTCTTTTT TTAAACTTCA TTCCCAGGCT CACCGGCAAA ATGAAACCGA TCCCGGTAAT GACCGAGGAG GAGCTCAAAG CCATTATCGA GGTCACCGAA GAGGAAGGGG AGATCGACAA CGAGGAAAAG GAGTTTATTC ACAACATCTT CAAGCTGGAC GACACCAGCG CGTCGGAAAG CATGACCCCC ACCACCGACA TGTTTGCCGT GGACGTGAAC AAAAAACTGC CCCTGGGCGC CATATTAAAA ACAGGCTATA CCCGCATTCC GGTTTATGAG CACCACATCG GCAACATCAT CGGCATTCTC AATGTAAAAG ATGTTTTTCG GCACTATGTG CAGGCCAAGG GCCCGCCCAA CATCCGCTCC CTGATGAGCA AGCCCTATTT TATTCCGGAG TCCAAAAAAC TCAACAGCCT GCTCAAACAG TTCAAGCTGC GCAAGCACCA CATGGCCATC GTGATCAACG AGCACGGCGA GGTCCTGGGG CTGATCACCC TGGAGGATGT GTTAGAAGAG CTGGTGGGCG ACATTATTGA TGAGACCGAC CGGTATGAGC CCCATATTGT CAAGGTAAAA AAACAGACAT GGCTGGTACT GGGCAAAACC CACATTGAAG AGACCAACGC AAAACTTCCC ATGCATATTC CGGAATCCGG GGAATACGAC ACGTTTTCCG GCTACCTGCT GTATCGAATC GGCAGGATAC CAGAGGAAGG CGAGATCATC ACCATCGACA ATTTTGTTAT CACCATCAAG CGCATGGAGG CCAACCGCAT TCTCTCCCTG CTGGTAGAAG AAAAGGCGTC CCCACCCTCC CTGGATGCAG AAAAACAGAA GGCCGGCAAT GGCTCATGA
|
Protein sequence | MLSRIIVLVL LLFLSGFFSS SETALFSISR SRIRFLAKKK NRFDLLIKKM KDNPHRLLST ILIGNNLVNI GASALATALA IDVFSNNAVG IATGVMTFFI LIFGEILPKS IATTNNITIA RITIYPLYWL SVLFMPIILF LNFIPRLTGK MKPIPVMTEE ELKAIIEVTE EEGEIDNEEK EFIHNIFKLD DTSASESMTP TTDMFAVDVN KKLPLGAILK TGYTRIPVYE HHIGNIIGIL NVKDVFRHYV QAKGPPNIRS LMSKPYFIPE SKKLNSLLKQ FKLRKHHMAI VINEHGEVLG LITLEDVLEE LVGDIIDETD RYEPHIVKVK KQTWLVLGKT HIEETNAKLP MHIPESGEYD TFSGYLLYRI GRIPEEGEII TIDNFVITIK RMEANRILSL LVEEKASPPS LDAEKQKAGN GS
|
| |