Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0980 |
Symbol | |
ID | 8418802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 1155390 |
End bp | 1156439 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 645037549 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003197846 |
Protein GI | 258405104 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.289763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000463308 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTGCAAC TCGTTCTCGC GGTTGGGCTG GCCATCGGCA TCTCCTTCCT CTGTTCGGTC GCCGAGGCTG TGCTCTATTC AGTCCCGTGG AGTCATATCG AAAAGCTCCG GAGGTCGGGG GAGCGCAAAG GGGAACTCCT CTACCGGCTC CGAGTCGATG TAGACGAGCC GATCACCGCC ATCCTGACCC TGAACACCGT GGCCCATACT GCGGGGGCCT CTGTGGCCGG TGCAGCGGCG GCGCAGGTGT TCGGCCAAGA ATCCCTGTTC GCCTTCTCCG TTTTCTTCAC CCTGGCCATC CTCCTGCTAT CGGAGATCAT TCCCAAGACC CTGGGCGTCG TGTATACCCG GGGGCTCTCT TTGTGGGTGG CGCGCCCCTT ACACCTCCTG GTCTTTGTCA TGCGGCCCAT TGTGAACGTC TCCAGCTATC TGGTCCGTTT TTTGGGTAAG CGCAAGCTCG GTCCGGAGGC CTCGGAAGAG GATGTCCGGG CCATGGTCAG CCTCTCGCGC CAGGCTGGAG TGCTCAAGCC CTACGAAGAG ATGTCCATCA AGAATATCCT GACCCTGGAC AGCAAACGGG TCAAAGATAT TATGACCCCA CGGATGGTCA TTTTTTCCCT GCCGGCGCAT TTGACCGTGG CCGAGGCCCG GGAAGCCAAA TTGGTCTGGC CCCACAGTCG GATCCCAGTC TATGAAGGCG ATGATCCCGA AGAAGTCATC GGCATCGTCT ATCGCCGTGA ATTGCTCGAA GCACTGGCCG ACGACCAGGA CACACGGCAT CTCAGCGATC TGATGCGCTC TGCCCACTTT GTCCTGGAAA GCCTGACCCT GGACCGGCTT TTGGTCCAGT TTCTGGAGTC ACGGATGCAC TTGGCGGTGG TCCTTGACGA ATACGGCGGA CTTGCCGGGG TGGTGACCCT GGAAGACGTT CTCGAAGAAA TTCTTGGCAA CGAAATAGTT GACGAAACCG ACCAAGTTGT GGACATGCGG GAACTGGCCC GGCAGCGCCG GGAGAAGTTG GTGCAGGAGC GGCGGAAATC GCATTCCTGA
|
Protein sequence | MLQLVLAVGL AIGISFLCSV AEAVLYSVPW SHIEKLRRSG ERKGELLYRL RVDVDEPITA ILTLNTVAHT AGASVAGAAA AQVFGQESLF AFSVFFTLAI LLLSEIIPKT LGVVYTRGLS LWVARPLHLL VFVMRPIVNV SSYLVRFLGK RKLGPEASEE DVRAMVSLSR QAGVLKPYEE MSIKNILTLD SKRVKDIMTP RMVIFSLPAH LTVAEAREAK LVWPHSRIPV YEGDDPEEVI GIVYRRELLE ALADDQDTRH LSDLMRSAHF VLESLTLDRL LVQFLESRMH LAVVLDEYGG LAGVVTLEDV LEEILGNEIV DETDQVVDMR ELARQRREKL VQERRKSHS
|
| |