Gene Dret_2095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2095 
Symbol 
ID8419945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2384042 
End bp2385751 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content57% 
IMG OID645038688 
ProductABC-type sugar transport system, periplasmic component 
Protein accessionYP_003198957 
Protein GI258406215 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.105917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATGCA AACGTCGGGT GGGTTTGGCC GTTCAACTGG TCATTGCCCT AGCTTTGTTG 
GTCCCCGCCT CCGCAGGGGC CCAGGATACC ATTGGAAAGT GGGTCGACGT CTTTCAACCC
TCGGTGCTGA ACCAAAAGCA GCAACGCCAG GAATTGGAAT GGTTCAAACA GGCGGCCGAG
CCGATGCAGG GCCTGGAAAT CAAATCCGTG GCCGAAGGCA TCACCACCCA TAAATGGGAG
GCCAAGGTCT TGGCCCAGGC CTTTTATGAA ATCACCGGCA TCAAGGTCAC CCATGACATC
ATTGGCGAAG GTGAGGTTGT CGACCGGGTC CAGCGCCAGA TCCAGACCCA ACGGAAGATT
TACGATATCT ACGTCAACGA TGCTGATTTG ATCGGTACCC ATCTCCGCCT CGACAGTGCC
CTGAACCTCA GCGACTACAT GAAAGGTGAA GGCGCTGAGG TCACTAACCC CATGCTGGAT
CTGGACGATT TCCTGAACCC GGAATTCGGC CAGGATTATG ACGGCAACCA ACTCCAATTG
CCGGACCAGC AATTCGCCAA CCTGTATTGG TTTCGCTACG ACTGGTTCAC CGACCCGAAG
TACAAAAAAG AGTTCCAGGA TGAATACGGC TATGAACTCG GCGTGCCCGT GAACTGGGCT
GCCTATGAGG ACATTGCCGA GTTCTTCACC GGCAAGACCA TCGATGGACA GACCGTGTAT
GGCCATATGG ATTATGGCAA GAAATCCCCG TCTTTGGGCT GGCGGTTTAC CGACGCTTGG
CTGTCTATTG CCGGTGTCGG AGATAAAGGG CTGCCCAACG GGTACCCCGT GGATGAATGG
GGCATCCGGG TCGACGGCAA AACCCCTGTG GGGGCCAGTG TTGAGCGTGG TGGTGCGGCC
AATAGTCCGG CCGCGGTCTA CGCGACGACC AAGTATGTCG AATGGCTCAA GAAGTACGCT
CCCCCCTACG CCGCTTCCAT GACCTGGTCT GAAGCAGGGC CAACCCCTGC CCGCGGCAAT
GTGGCGCAGC GGGTTTTCCA GTACATCACC TGGCTCTCCG ACCCGGCCTT CAATTCACCG
GACAGCCCGG TCACAGACGC TACCGGCAAG CCGGTCTGGC GTGTGGCCCC GACACCGCAC
GGCAAATACT GGGATGAAGG AATGAAGGTT GGCTATCAGG ATGCCGGCAG TTGGACGATC
CTGAAGGACA GTGTGACAGG AAAGTACCGC AAGGCCGCTT GGCTGTGGGC CCAATTCTGC
GTGTCCAAGT CGGTGTGTCT GAAGAAATTC CTCGTGGGTC GCACCCCCAT CCGGAAATCC
ACGGTTTTTT CCGATTACCT GGCCAAGGAA GAGGAAAAAG GGACGTACGG CGGAATCGTG
ACCTTCTACA AGTCCCCGGT GGAGCATATG TGGACCGACT CCGGGCCGAA TGTGCCCCAT
TATCCGCTTC TGGCCGAGCA GTGGTGGAAA AATGTCGCCC TGGCCGTTAC CGGTGAAGCC
ACACCCCAGG AAGCGATGGA CAGTTTGGCC TATAAAATGG ACGACCTGAT GGGGAAAATG
CGGCTCAACC AGTATTCTCC GAAACTCAAT CCCAAAAAGT CCCGGGAATA TTGGCTTTCC
CAGCCCGGCT CACCCAAGCC GGTCCGCTCC GAAGAAGAGC CGGAAACCAT GCCCTATGAC
GAAATGCTGA AGAAATGGAA GAACCAATAG
 
Protein sequence
MVCKRRVGLA VQLVIALALL VPASAGAQDT IGKWVDVFQP SVLNQKQQRQ ELEWFKQAAE 
PMQGLEIKSV AEGITTHKWE AKVLAQAFYE ITGIKVTHDI IGEGEVVDRV QRQIQTQRKI
YDIYVNDADL IGTHLRLDSA LNLSDYMKGE GAEVTNPMLD LDDFLNPEFG QDYDGNQLQL
PDQQFANLYW FRYDWFTDPK YKKEFQDEYG YELGVPVNWA AYEDIAEFFT GKTIDGQTVY
GHMDYGKKSP SLGWRFTDAW LSIAGVGDKG LPNGYPVDEW GIRVDGKTPV GASVERGGAA
NSPAAVYATT KYVEWLKKYA PPYAASMTWS EAGPTPARGN VAQRVFQYIT WLSDPAFNSP
DSPVTDATGK PVWRVAPTPH GKYWDEGMKV GYQDAGSWTI LKDSVTGKYR KAAWLWAQFC
VSKSVCLKKF LVGRTPIRKS TVFSDYLAKE EEKGTYGGIV TFYKSPVEHM WTDSGPNVPH
YPLLAEQWWK NVALAVTGEA TPQEAMDSLA YKMDDLMGKM RLNQYSPKLN PKKSREYWLS
QPGSPKPVRS EEEPETMPYD EMLKKWKNQ