Gene Dret_1738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1738 
Symbol 
ID8419578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2001627 
End bp2003345 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content61% 
IMG OID645038321 
Productprotein of unknown function DUF181 
Protein accessionYP_003198600 
Protein GI258405858 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.858461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTTA AGAACTGCTT CAAGGGATCC ACCACCGATC TGGACAAGGT CTGCTCGCCG 
CAGGAAACCG TCGCCCGCGT CGGAGAAGTG CTCGAGCGCT ACGGCGGCGT GTTGTCCGAA
AATAAACGAA TCGACACCGG TCGTCTGGGA ATACCGGTCT ACATGAGTTG CTGTGGCCCC
AAGGCCCTTG ATCTGTTGCC GGGGCGCAAG CAAATGGGCA AAGGGGCTTC CCCGGAACAG
GCTCAGGCTT CGGCCCTCAT GGAACTCACG GAACGCTTCA GTTTTTTCGC TTTCTGGCAA
GACGGCCACT CCTGCGAGAC ATGCACCTGG ACCGAGGCCA AGGCCCGCTA CGGCGATGCC
CTGATCCCAA TAGAACACAT TCTGGATTCG GTCGGGGACG ACCTCGATCC GGATCTGGCT
GAAAATCTGC TTGATCTGCT CGCCTGGCGG TTCGCCAAGG TCTGGGACCT GGCCCTGGAA
CGCGAGGTGT ACGCCCCTGT GGATTGGTTC AGGCTGCTTA ATGAGTATAA TGGGTCCTGC
GCGGGCAACA CCAATGAAGA AGCCATCCTC CAGGGCTTGT GCGAAGTCAT CGAACGGCAT
GTCTGCGCCC GGATCGATGA CCGGACGCCC GAGTTGCCGA CGATCGATCC GGACTCGTGT
ACCGACCCCA CCTTGCGGGC CTTGCTTGAT ACCTTTGAGG CCAACGGCAT CCGGGTCTGG
CTCAAGGATT TCTCGTATGG CCTGCCCGCC CCGACAGTGG GAGCCCTGGC CTACGACCCC
CAGACTTTTC CCGAGTCCAG CGAAATCGTC TTCACCGCTG GGACCGCTTC GAGCCCGGAC
AAAGCAGCCA TCCGGGCCCT GACTGAAGTC GCCCAACTCG CCGGTGACTT CCAGACCAGC
AGCAATTACG AGGCCTCCGG GCTCAGCAAA TACACCAGCC TGGACGAATG CGCCTGGCTG
ACCTGGGGCA GAGCCTGTTC CCTGAACTCG CTACCGGACA TCACCGACGA AAACATCGCT
CATGAAATAC GGACCCTGGC CCGCTCTCTG GACCGACAGG GACTGCGCGC CTACGCTGCG
GATATGACCC ACCCCGACCT TACTATTCCA GCCTTTTACA CCTTTATCAC CGGCTGCGAC
TTCCGTGAAC GGACCCGGCA CCCCAGCCTC GGCCTGTTTG TCGGGCGGCG GCTGGCGGAA
GACACCCCGG TGGAAGAGGC CCGGCCCGGT TTGCAGATTC TGGAAGCCCT GCAGCCCGAA
GCCCCCTATG TTCCCTTTTT CCAGGGCCTG CTCGCCCTGC GCGAAGACGC CCCCGGACAG
GCCATGACCT GCTTCGCCGC TACGGCTGGT CTCCAGCCGG GCCGGGAGGA GCGGGCCCTG
GTCGACTTCT ATCTTGGCTA CAGCGCCTCC CTAATAAACG ACTGGGAAAC GACCGCCACC
GCTCTCGACC GCTCCCTGGC AGCCAGTCGT TCCCATGCCG CCTTCAACCT TCGGGGGGTG
GCCGCATTCA AACAAGCGCA ATACGCCGAG GCGCGGAAGC ATTTTGAACA GGCCCTTGCC
GAGGACAGCG GCTCGGCTAT GGATATAGCC AATGTGGGGA TGTGCGCTTT GAAATTGGGA
GACAGGCAAG AGGCCATTAC ATGGCTTCAA ACCGCCCTCG AGCTCGATCC CGGCATCGAA
TTCGCCCGGG AAACCTTGGC AGGCCTGCTC AATACTTGA
 
Protein sequence
MLLKNCFKGS TTDLDKVCSP QETVARVGEV LERYGGVLSE NKRIDTGRLG IPVYMSCCGP 
KALDLLPGRK QMGKGASPEQ AQASALMELT ERFSFFAFWQ DGHSCETCTW TEAKARYGDA
LIPIEHILDS VGDDLDPDLA ENLLDLLAWR FAKVWDLALE REVYAPVDWF RLLNEYNGSC
AGNTNEEAIL QGLCEVIERH VCARIDDRTP ELPTIDPDSC TDPTLRALLD TFEANGIRVW
LKDFSYGLPA PTVGALAYDP QTFPESSEIV FTAGTASSPD KAAIRALTEV AQLAGDFQTS
SNYEASGLSK YTSLDECAWL TWGRACSLNS LPDITDENIA HEIRTLARSL DRQGLRAYAA
DMTHPDLTIP AFYTFITGCD FRERTRHPSL GLFVGRRLAE DTPVEEARPG LQILEALQPE
APYVPFFQGL LALREDAPGQ AMTCFAATAG LQPGREERAL VDFYLGYSAS LINDWETTAT
ALDRSLAASR SHAAFNLRGV AAFKQAQYAE ARKHFEQALA EDSGSAMDIA NVGMCALKLG
DRQEAITWLQ TALELDPGIE FARETLAGLL NT