Gene Dret_2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2122 
Symbol 
ID8419972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2412990 
End bp2414219 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content57% 
IMG OID645038715 
Productputative ABC transporter solute-binding protein 
Protein accessionYP_003198984 
Protein GI258406242 
COG category[R] General function prediction only 
COG ID[COG4134] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.708087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTGG TTTTTCGTTT TGGCCTGGTT GCAGGAGTAC TGGGATGCCT GTTGTGCATT 
GCCGGATGCC AGCAAACAGC ATCCGAACCG GATTGGCGCA CAAAGGATTT TGAGCGCATT
ATCGAGGCGG CCCGCGGCAC CACAGTGCGC TGGTACATGT ACGGTGGCTG GCCCCATGTC
AATGAATGGG TCGATACCTA TGTCGCCCCG GCCATGCAGG AGCGTTACGG AATCAGTGTC
AAACGCGTTC CCATGAACGC GCCTGTTTTT GTCAATAAAT TGATCAACGA AAAAAGTGCC
GGCAAAGACC CCGGCACCAT TGATCTGGTC TGGATCAACG GCGAAAATTT TAAGGCCACC
AAGAATGCAG GGGCCTTGTG GGGCCCCTTT GCTGAGCAGC TCCCGAATTG GCAGCGGTAT
GTCGACCCTT CTACCGTGGC CCAGGACTTC GGCTTTCCCA CAAAGGGGTA TGAAGCCCCT
TGGGGCCGGG CCCAATTCGT CTTGATCTAC GATGCCAAAC GCACGCCCAA TCCGCCGCGC
TCAGCAGAAA GTCTGCGCCG ATGGATCCAG GACCACCCTG GCCGTTTCAC CTATCCACAA
CCGCCGGATT TCACCGGATC GGCCTTTGTC CGTCAGCTCT TCTACGCCAC CACAGGTGGG
CACGAACAGT ATATGGACGG CTTCAACGCT ACCTTGTACG CTCGAAACGC CCCCCGCCTC
TGGGAGTATC TCAACGGGAT CGAGCCGTCA TTATGGCAAC AAGGCCGGAC ATACCCCCAA
AGCTCTGCAA CTCTGGACAC CTTGTTCGCC AGAGGCGAAG TCGATTTCAG TATGTCCTAC
CATCCGCCGC ACGCCCAAAA CAAAATCCTG GACGGCACTT TCCCCGCCAG CGTGCGGACG
GTGGCATTGG CCAACAATTC GATTGCCAAC ACCCACTACA CGGCCATTCC CTTCAATGCC
CCCAACAAAC CGGGGGCTAT GGTCCTGGCC AATTTTCTGC TCTCGCCCAC GGCCCAGCTC
TCGAAATACA AGCCTGAAAA CTGGGGGGAT TTTCCGGCCA TTGATCTCGA CCGCCTGGAC
CAGTCCCAAC GCCGACGCTT CGAGGATGTC GACCTCGGTC CGGCCACATT GAGCGCCGAG
ACCCTGGCTG AGCACGCGGT CCCTGAAATT CCCATCGGCT ATCTGGAAGC CATTGAAGCC
GATTGGAAGT CCCGAGTCCT GACCAATTGA
 
Protein sequence
MPLVFRFGLV AGVLGCLLCI AGCQQTASEP DWRTKDFERI IEAARGTTVR WYMYGGWPHV 
NEWVDTYVAP AMQERYGISV KRVPMNAPVF VNKLINEKSA GKDPGTIDLV WINGENFKAT
KNAGALWGPF AEQLPNWQRY VDPSTVAQDF GFPTKGYEAP WGRAQFVLIY DAKRTPNPPR
SAESLRRWIQ DHPGRFTYPQ PPDFTGSAFV RQLFYATTGG HEQYMDGFNA TLYARNAPRL
WEYLNGIEPS LWQQGRTYPQ SSATLDTLFA RGEVDFSMSY HPPHAQNKIL DGTFPASVRT
VALANNSIAN THYTAIPFNA PNKPGAMVLA NFLLSPTAQL SKYKPENWGD FPAIDLDRLD
QSQRRRFEDV DLGPATLSAE TLAEHAVPEI PIGYLEAIEA DWKSRVLTN