Gene Dret_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1101 
Symbol 
ID8418926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1291654 
End bp1293630 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content60% 
IMG OID645037673 
ProductABC transporter related 
Protein accessionYP_003197967 
Protein GI258405225 
COG category[R] General function prediction only 
COG ID[COG0488] ATPase components of ABC transporters with duplicated ATPase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0871053 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0818517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGACGC TTACCCTCCA GAATCTCAGC AAGGGCTTTG GCGCCAACCA GCTGTTTTCG 
GGGCTGTCCG GGAGTATCAG TTCGGGGATG CGAGTAGCCG TCATCGGCCC AAACGGCTGT
GGCAAATCGA CGTTATTGAA GATTATCAAC AACGAGACCG GGGCGGACGA GGGCGTGGTC
CAGGTCCCGA AAGGGGCGCG GATCGGCCGG GCGGAGCAGG AGTTGGCCCA GCATGATCTG
GACAGCCCGC TGCTGGCTTG GGTATTGGAA AATCTGCCCA ATTGGCAGGC GTTCTGGGCC
CGTTGGGAAC GGGCCCAAGA AAACGGGGAG ACTGACTCCC TGTCCCGCCT TGCGGCGGAG
CAGACCCGTC TGGAGCACCA GTACGGCTAC AACCCGGAGC ACCGGGCCAA GTCGATCCTC
AGCGGCCTTG GCTTCGGCAC AGGGCAGTTT GCCCAACCCT TGGCTTCCCT GAGCGGGGGC
TGGCGTGAAC GGGCCAAGTT GGCCCGGGTG CTGGTCGAAG GCGCGGACAT TTTGCTTCTC
GACGAACCGA CCAACCACCT CGATCTGGAG GCGGTGCAGT GGTTGGAGCA ATTCCTTCAC
GATTTTCCAG GGATTTTGCT TTTCGTGGCC CACGACCGGG TTTTTTTGGA CACCGTGGCC
ACGCACACCC TGTCCCTGGG CAGTGCCCGC CCGGTTTTCA GAGAAGGCAA TTTCTCCGCC
TATCTAGCCT GGGAAGAGGA AAAGCGCCGC TTGGCCGATA AAGAGCGCGC CAAGCTCGAT
CGACAGATCA AGCATAAGCA GGCCTTTGTC GATCGGTTTC GGTACAAGGC GACCAAAGCC
AGGCAGGCTC AGAGTCGGCT GAAACAGATC GACTGGTTGG AAGAGCAGCG GCGACAGCAC
GAAAACGAAG CCAGCGCCAG GAGCTTATCC TTCCAGTGGC CTCCCCCTGT TCGGGCCGGG
CACCATGTCC TGAGCCTGGT CGACGTGACC TACGGCTTTG CGCACGAGCC CCCACTGTGG
TCACCGGTCA CGGCGAATAT CTACCGCGGC CAGCGCATTG CTCTGGTCGG GCCCAACGGG
TGTGGCAAAT CAACCCTGCT CAAACTCATC GTGGGTGAGC ATTCGCCGCG CCAGGGAAGC
ATCAAGCTTG GCACCGGAGT CAAGGTCGGC TATTTCAGTC AACACCAGAC CGATATCCTC
CAGCCGAGAC AGACCGTGCT CGGTGAACTC CGCCGGCTGG CCCATCCCAA GATCAAGGAA
GAGGAACTCC GGTTTGCCCT GGGGCTGTTT TTGCTTGATG AGAGCTATTG GGAACGGCTT
GTCAGTGAGC TCAGTGGCGG GGAGAAGAAC AGGCTTGTAC TTGCCTCGCT GTTTCTGGGG
CAGGCCAATT TTCTTATCCT GGACGAACCG ACGAACCATC TCGATCTGGA GAGCCGCGAA
GCCTTGGTCC AGGCCCTGCG AGGCTACGAA GGTACGGTCC TTGTGGTGGC CCACGACCGC
TACCTGCTCT CAGAGGTGGC GGATGATGTC TGGGTCCTCG GGGGCCAAGG CATCGAAATG
TTGACCCGCG GCTTTGCCGA ATATGAAGAA CGCCTCCAGG AATTGCACAA CACCGGCCAG
GACCAGACTC CAACGGAGCC CTCAGCCAAA CCCTCCCGTG AAACGGTCAA GGCCAGACGG
CGGGAATTGG CAGAAGCGCG CAACGCTTTG TATCGACGCC TGCACCCCAA GCAAAAACGG
TATCAGGAAC TCGAACAGTT GCTGGAAGCC AACCTCGAAA GACAGACTGA GCTCGAACAG
CGCCTGGCGG ACCCGAGTAC CTATGAGCAG GCGGACCAGG CCCTGGCCGC GAATCAGGAA
TACGAAGAAG CCCGCCGCTG GGGCGAGCAA CTCATGGAGG AAATGGCCGA GTTGGAGGAA
GGGATGGCCC ATATCCGCGA GGAACAGGAC AAACTGCGCG ATGCCGGAGG AGACTAA
 
Protein sequence
MLTLTLQNLS KGFGANQLFS GLSGSISSGM RVAVIGPNGC GKSTLLKIIN NETGADEGVV 
QVPKGARIGR AEQELAQHDL DSPLLAWVLE NLPNWQAFWA RWERAQENGE TDSLSRLAAE
QTRLEHQYGY NPEHRAKSIL SGLGFGTGQF AQPLASLSGG WRERAKLARV LVEGADILLL
DEPTNHLDLE AVQWLEQFLH DFPGILLFVA HDRVFLDTVA THTLSLGSAR PVFREGNFSA
YLAWEEEKRR LADKERAKLD RQIKHKQAFV DRFRYKATKA RQAQSRLKQI DWLEEQRRQH
ENEASARSLS FQWPPPVRAG HHVLSLVDVT YGFAHEPPLW SPVTANIYRG QRIALVGPNG
CGKSTLLKLI VGEHSPRQGS IKLGTGVKVG YFSQHQTDIL QPRQTVLGEL RRLAHPKIKE
EELRFALGLF LLDESYWERL VSELSGGEKN RLVLASLFLG QANFLILDEP TNHLDLESRE
ALVQALRGYE GTVLVVAHDR YLLSEVADDV WVLGGQGIEM LTRGFAEYEE RLQELHNTGQ
DQTPTEPSAK PSRETVKARR RELAEARNAL YRRLHPKQKR YQELEQLLEA NLERQTELEQ
RLADPSTYEQ ADQALAANQE YEEARRWGEQ LMEEMAELEE GMAHIREEQD KLRDAGGD