Gene Dret_2099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2099 
Symbol 
ID8419949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2388072 
End bp2389169 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content58% 
IMG OID645038692 
ProductABC transporter related 
Protein accessionYP_003198961 
Protein GI258406219 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0897069 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGAA TCGTCCTCGA AAACGTTTCC CACACCTACG ATACGAGCGA CCGGCCGGAT 
TCGGACAAGA CCTTTGCCGT TCAGGGGTTG GACATCTGTT GGGACAATGG GACCGCAAAT
GCGTTGCTCG GCCCTTCGGG GTGCGGCAAG ACGACGCTTT TGAATATCAT TTCCGGCCTG
TTGACCCCGT CGCAGGGCCG GGTCCTGATC GACGGCCGGG ATGTGACCAC CCAACAGCCC
CGGGAGCGCA AGATCGCCCA GGTATTTCAG TTTCCGGTCG TGTATGACGC CATGAGCGTC
TACGACAACC TCGCCTTTCC CCTGCGCAAC GCCAAATATC CCCGGCAGGA GATTGATGCC
AAAGTCCGGG AGGTGGCCGA GATCCTGGAC CTGACAGATC TGCTCAAGGC CGCAGCTGCC
AAGCTCAATC CCGCGGATAA ACAAAAAATT TCCCTGGGGC GCGGAATCGT GCGCGAGGAC
ACGGCCGCGA TATTGCTCGA CGAGCCGCTG ACGGTCATCG ACCCGAAACT CAAATGGTAT
TTGCGGCGCA AGCTCAAAGA GGTTCAGGAA GAACTCGGCA GGACGATGAT TTATGTCACC
CATGACCAGC ATGAGGCGCT GACGTTTGCC GATCAGGTGA CCGTCATCCG GGACGGGGTT
CTGGTTCAAA ACGGCACGCC CCAGGAACTT CACGATGAAC CGCAGGATCC CTTCATCGGC
TATTTTATCG GCAGTCCGGG GATGAATTTC TTTGAATGCC ACCTGGAAGG GGAGCGCTTT
GTCTGCCGCG ACCAGCTGAC ATTTCCGGTG CCCCAAGCAT GGCGGGATGT TGTGCGCGAT
CATCAGGGCG AGCAATTTGG TCTTGGTATC CGCCCCGAGT TCGTTCACGT CCATCAGGAT
GCGCAGCAAG GCGCCCCGTG CCAGGTTCAA GTCATCGAGG ATACGGGGGC GTACCGGATT
TTCACGCTCT GGCAGGGAGA TATCCGGATC AAGGCTCGTG TCTCAGAGGC GGTGCGCCGT
CAGGAGGGCG ATGAGGTTGG TGTCACGTTC AGAGAGGATA AAATGAAACT TTTCCAAGGC
GCGAAACGCA TAATATGA
 
Protein sequence
MARIVLENVS HTYDTSDRPD SDKTFAVQGL DICWDNGTAN ALLGPSGCGK TTLLNIISGL 
LTPSQGRVLI DGRDVTTQQP RERKIAQVFQ FPVVYDAMSV YDNLAFPLRN AKYPRQEIDA
KVREVAEILD LTDLLKAAAA KLNPADKQKI SLGRGIVRED TAAILLDEPL TVIDPKLKWY
LRRKLKEVQE ELGRTMIYVT HDQHEALTFA DQVTVIRDGV LVQNGTPQEL HDEPQDPFIG
YFIGSPGMNF FECHLEGERF VCRDQLTFPV PQAWRDVVRD HQGEQFGLGI RPEFVHVHQD
AQQGAPCQVQ VIEDTGAYRI FTLWQGDIRI KARVSEAVRR QEGDEVGVTF REDKMKLFQG
AKRII