Gene Dret_2100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2100 
Symbol 
ID8419950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2389175 
End bp2390305 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content55% 
IMG OID645038693 
ProductABC transporter related 
Protein accessionYP_003198962 
Protein GI258406220 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.134118 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATAG TGGTTGAGTC GGTATCGAAG TCCTTTAGCG GGACGTTCGC CTTGAAGGAT 
GTCAGTGTCA GCATTGAAGA CGGGCAATTC GTCACCTTTC TGGGGCCGCT GGGCGCTGGC
AAGACGACAT TGCTGCGGAT CATGTGTGGT ATTGACCGCC CGGATTCAGG ACGGATCTAT
TATGACGGTC AGGACGTAAC CGATGTTGCG GTTCAGAAAC GGCCGGTAGC CATGGTCTAC
CAGCAGTTTG TCAATTACCC TTCCATGACC CTGTATGAAA ATATTGCTTC GCCGTTGCGT
GTCAGTCGGC GCAAATACTC GAAAGGGGAG ATCGAAAAAC GTGTCCATGA AAGTGCCGAT
CTGTTGGGGA TCCGTCAGAT CCTGGGCCAC TATCCTGAAG AAGTCAGTGG CGGTCAGAAA
CAACGCGCTG CCATCGCCCG CGCTCTGACC AAGGACGCCA AGTTTATTTT TTTGGACGAA
CCGCTGGCCA ATCTGGACTA CAAGCTCAGA GAGGAGCTAC GCGGGGAATT GAAGGAAATC
CTGCGGCGCA AAGGGGGCGT GGTGGTTTAT GCCACGCCTG AAGCTGTCGA CGCCCTGTCC
ATGGCCTCCC ATGTGGGGTA TATCGAAAAC GGGCAACTCT GGCAGTACGG GGCCCTCAAA
CACGTCTACC GGTATCCGCA ATTCAAAGAG GTCGGGCGGT ATTTCAGTTA TCCGACGATG
AATATTTTGC CGGGGACGGT CGAAAAATAC GCCAAGGGGG CGGCACTTGT CCTCTCAGAT
GATTTGCGGG TGGATGTCTC CCGCATTGCC GATCAGTTGG ACCAGGAGGT CTACCAGGTC
GGTATCCGGG CCTATAATAT CAGCACGAGC AAGGAACATG CGGAGATGGT TCCATTTCAG
GCCGAAGTGG AGCTTTCGGA GGAGTTGGGG TCTGATACAG AACTCCACGT GCGCCACAAC
GGCCAAACCC TGGTCGTCTT GCTTCAGGAA TTCGCCCGCC ACGAGATCGG GCAAAAAGTG
ACGCTCTACC TGGACAGTAC CCGCCTCTTT CTTTTTCATC CGCATACAAA CGAACTGGTG
CTCAAGACAT TTCAGGAAAC CACGGCGACG TCAGCGGCCC AGGAGGCATG A
 
Protein sequence
MGIVVESVSK SFSGTFALKD VSVSIEDGQF VTFLGPLGAG KTTLLRIMCG IDRPDSGRIY 
YDGQDVTDVA VQKRPVAMVY QQFVNYPSMT LYENIASPLR VSRRKYSKGE IEKRVHESAD
LLGIRQILGH YPEEVSGGQK QRAAIARALT KDAKFIFLDE PLANLDYKLR EELRGELKEI
LRRKGGVVVY ATPEAVDALS MASHVGYIEN GQLWQYGALK HVYRYPQFKE VGRYFSYPTM
NILPGTVEKY AKGAALVLSD DLRVDVSRIA DQLDQEVYQV GIRAYNISTS KEHAEMVPFQ
AEVELSEELG SDTELHVRHN GQTLVVLLQE FARHEIGQKV TLYLDSTRLF LFHPHTNELV
LKTFQETTAT SAAQEA