Gene Dret_1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1504 
Symbol 
ID8419333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1744540 
End bp1745610 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content55% 
IMG OID645038078 
Producthypothetical protein 
Protein accessionYP_003198368 
Protein GI258405626 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00430527 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGGCA AATCGATCTT TATTCTGGCC TGCGGCATTC TCCTCTTGTG TGTCTGGCCG 
GTGCAGTCCG ACCAGCAGCA GGACTGCTTC AGAACCAGCC TGCACCACAC CACCCGCGGC
ATGGCCACCT GGTATGACGC GGACAACGGT TTCAGCGCCA TCACCAATGT CCCCTACAAG
GACCTGGGAT GTAAAAATTG CCATGCCACC TCCTGCAACG ATTGCCATCT TGAAAAATCC
GGTGAGGGCT TTGCGTACTC CACGGCCAAG GCACGGCAAT CCTCGACTTG TCTCAAATGT
CACGCCCGGG AGAAGGCCAC CATCGGGATC GACACCGCCA GAAACTCCCT CGGCGTTCAT
ATCAAGGCCG GCATGCAATG CGCGGATTGC CATTCAGCCA GGGAAGTCCA CGGTGATGGA
ACCTGCTATG AAAGCATGCG CGCGCCAGGG GCAATGGATA CGGCCTGCAC AAATTGCCAC
ACCGAGGACA GCACCACCTA TCCGGCCATC CCCCCGACCG AATCGCATAT GGTCCACAGC
GGCAAACTCG ACTGTACAGC CTGCCACGTG GAAAACTCCA TGACCTGCTA CAATTGCCAT
TTCGGTGTTT TGCAAAAGAC CAAGAGCAAA CCAAAAAGCA TGGTCACCAA AACCAAGGAT
TTCCTGCTGC TTGTCAAATA TAACGGCAAA TTCATGAGTG GAACCATGCA GACGCTGGTT
GGCCCCGATA ATTACCCCTT CGTGGCCTAC GTTCCCTATT TCACCCATTC AGTGACCGAG
CAGGGGCGAA AATGCGAAAG CTGCCACTCC TCCAAGGCCC TCAAAGAGTT GGCTGCGGGC
AAGTCGTTCA ATGCCTCCAC CTACAAGGAC GGGAAACTCA GTTTTTTCGA GGGGGTCATC
CCGGTGGTTC CCGACCAGAT CAACTGGACT TTTCTGGAGA AAGCCGGGGA ACAGTGGACG
CCGTTTGAGC CGCCAGCCAA ACCGCTGGTC CAGATGGCGG TCTACGCTGA GCCCTTTACC
GACGACGAAC TGGAGATGAT GAACATGGAA CAGGTTTACA CCGGACAATG A
 
Protein sequence
MKGKSIFILA CGILLLCVWP VQSDQQQDCF RTSLHHTTRG MATWYDADNG FSAITNVPYK 
DLGCKNCHAT SCNDCHLEKS GEGFAYSTAK ARQSSTCLKC HAREKATIGI DTARNSLGVH
IKAGMQCADC HSAREVHGDG TCYESMRAPG AMDTACTNCH TEDSTTYPAI PPTESHMVHS
GKLDCTACHV ENSMTCYNCH FGVLQKTKSK PKSMVTKTKD FLLLVKYNGK FMSGTMQTLV
GPDNYPFVAY VPYFTHSVTE QGRKCESCHS SKALKELAAG KSFNASTYKD GKLSFFEGVI
PVVPDQINWT FLEKAGEQWT PFEPPAKPLV QMAVYAEPFT DDELEMMNME QVYTGQ