Gene Dret_0980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0980 
Symbol 
ID8418802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1155390 
End bp1156439 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content59% 
IMG OID645037549 
Productprotein of unknown function DUF21 
Protein accessionYP_003197846 
Protein GI258405104 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.289763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000463308 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGCAAC TCGTTCTCGC GGTTGGGCTG GCCATCGGCA TCTCCTTCCT CTGTTCGGTC 
GCCGAGGCTG TGCTCTATTC AGTCCCGTGG AGTCATATCG AAAAGCTCCG GAGGTCGGGG
GAGCGCAAAG GGGAACTCCT CTACCGGCTC CGAGTCGATG TAGACGAGCC GATCACCGCC
ATCCTGACCC TGAACACCGT GGCCCATACT GCGGGGGCCT CTGTGGCCGG TGCAGCGGCG
GCGCAGGTGT TCGGCCAAGA ATCCCTGTTC GCCTTCTCCG TTTTCTTCAC CCTGGCCATC
CTCCTGCTAT CGGAGATCAT TCCCAAGACC CTGGGCGTCG TGTATACCCG GGGGCTCTCT
TTGTGGGTGG CGCGCCCCTT ACACCTCCTG GTCTTTGTCA TGCGGCCCAT TGTGAACGTC
TCCAGCTATC TGGTCCGTTT TTTGGGTAAG CGCAAGCTCG GTCCGGAGGC CTCGGAAGAG
GATGTCCGGG CCATGGTCAG CCTCTCGCGC CAGGCTGGAG TGCTCAAGCC CTACGAAGAG
ATGTCCATCA AGAATATCCT GACCCTGGAC AGCAAACGGG TCAAAGATAT TATGACCCCA
CGGATGGTCA TTTTTTCCCT GCCGGCGCAT TTGACCGTGG CCGAGGCCCG GGAAGCCAAA
TTGGTCTGGC CCCACAGTCG GATCCCAGTC TATGAAGGCG ATGATCCCGA AGAAGTCATC
GGCATCGTCT ATCGCCGTGA ATTGCTCGAA GCACTGGCCG ACGACCAGGA CACACGGCAT
CTCAGCGATC TGATGCGCTC TGCCCACTTT GTCCTGGAAA GCCTGACCCT GGACCGGCTT
TTGGTCCAGT TTCTGGAGTC ACGGATGCAC TTGGCGGTGG TCCTTGACGA ATACGGCGGA
CTTGCCGGGG TGGTGACCCT GGAAGACGTT CTCGAAGAAA TTCTTGGCAA CGAAATAGTT
GACGAAACCG ACCAAGTTGT GGACATGCGG GAACTGGCCC GGCAGCGCCG GGAGAAGTTG
GTGCAGGAGC GGCGGAAATC GCATTCCTGA
 
Protein sequence
MLQLVLAVGL AIGISFLCSV AEAVLYSVPW SHIEKLRRSG ERKGELLYRL RVDVDEPITA 
ILTLNTVAHT AGASVAGAAA AQVFGQESLF AFSVFFTLAI LLLSEIIPKT LGVVYTRGLS
LWVARPLHLL VFVMRPIVNV SSYLVRFLGK RKLGPEASEE DVRAMVSLSR QAGVLKPYEE
MSIKNILTLD SKRVKDIMTP RMVIFSLPAH LTVAEAREAK LVWPHSRIPV YEGDDPEEVI
GIVYRRELLE ALADDQDTRH LSDLMRSAHF VLESLTLDRL LVQFLESRMH LAVVLDEYGG
LAGVVTLEDV LEEILGNEIV DETDQVVDMR ELARQRREKL VQERRKSHS