Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0461 |
Symbol | |
ID | 8418267 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 565346 |
End bp | 566524 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645037023 |
Product | protein of unknown function DUF399 |
Protein accession | YP_003197336 |
Protein GI | 258404594 |
COG category | [S] Function unknown |
COG ID | [COG3016] Uncharacterized iron-regulated protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.978142 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTGA TGCCGATCAT CTCTGTTGTG GTTGGCGCTT TGTTGCTGGC GGCCTGTGGC GGGCCGCCAG CCCTGCAAAG CGATCCGGCC GCGGGTTTGC GCAAAGGGAC ATTGGTCACC CACGCCGGGG AGCCAATGTC CCTTTTTGCT TTTGCGCAGC AGGCCCGTCA CGTCGACTAC CTCCTCATGG GCGAGGCCCA CACCAACGCC TGCGATCATG CGGTGCAGGC GGACGTGCTG CAGGTCTTGG CCTCCCAGGA CCTGCAGCCT GTGCTTGGCC TGGAAATGGT GCCGGCAGCG AAACAGCCTG TTTTGGACCG TTTCAACCAG GGCCGGCTCT CCGTGGAGGA GTTGCCTGAG GCCCTTGATT GGCAGACAAG CTGGGGCCAC CCGTATTCGC TGTACAAGCC GGTTTTCCAA GTCGCTTCCG ACGCCGGGGT CGATTTGTAC GCGCTGAATA TCGAGCAGGC CGTCCTGGAC GAGGTGCGGG AAAAGGGACT TGAGGGGATG GCGCCTGAGA AACGGGCGCG GCTCCCCGTA ACGATCCTGG ATCCGCCAGA ACCTCAGCGG CAGGCCCTCG AGGAAGAATT TTCCGGGCAC CAGAAACTGT TCCAGCAAAT GGGCAACGCT ACCAGCGGTT CATTGGAGCG GTTCATGTTG ATCCAGTCCA TCTGGGATAC CCAGATGGCT AGTCGGGCCC GGCGGGTGCA CGCGCAGACC GGGCGACCGG TGGTTATCCT GACCGGGACC GGGCATGTTG AATACGATTG GGGGATTGTC TCCCGACTGC ACCGCTGGGA CCCCCAAGCG AAGATCGTAA GCGTGATAGG ATGGCGTGGC GGGACGCTGC CCGAAGCTGA GGCCGCCGAC TGGTTTTTTT ATTGCCCCTT GCAGCATACC AGCCGCCTGG GGTTTACCAT GGAGATGCGT CCTGAAGGGG CTCGAGTGAT GACTGTGGAG CCCGGCAGTC GTGCCGCTCG GGGCGGACTG CAAAGCGGTG ATCTTCTGGT GAAAGTGGGC GGTGAGCCAT TCACCGGGTT GTGGGACTTG CATCAGGCCG CCATGGACGC AGTCCAGGCC GAAGAACCGA TGCAGATCAC AGTCAAGCGT GAGGGCGCCT CTGTTGAACT GAGCTTGGAT TTGCAACGCT CAGGGACTAC AGACGGTCCA ACAGATTGA
|
Protein sequence | MGLMPIISVV VGALLLAACG GPPALQSDPA AGLRKGTLVT HAGEPMSLFA FAQQARHVDY LLMGEAHTNA CDHAVQADVL QVLASQDLQP VLGLEMVPAA KQPVLDRFNQ GRLSVEELPE ALDWQTSWGH PYSLYKPVFQ VASDAGVDLY ALNIEQAVLD EVREKGLEGM APEKRARLPV TILDPPEPQR QALEEEFSGH QKLFQQMGNA TSGSLERFML IQSIWDTQMA SRARRVHAQT GRPVVILTGT GHVEYDWGIV SRLHRWDPQA KIVSVIGWRG GTLPEAEAAD WFFYCPLQHT SRLGFTMEMR PEGARVMTVE PGSRAARGGL QSGDLLVKVG GEPFTGLWDL HQAAMDAVQA EEPMQITVKR EGASVELSLD LQRSGTTDGP TD
|
| |