Gene Dret_2456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2456 
Symbol 
ID8420318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2817806 
End bp2818819 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content58% 
IMG OID645039059 
ProductExtracellular solute-binding protein 
Protein accessionYP_003199316 
Protein GI258406574 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACGCC GTATCGCCTT GTCACTGATG CTCCTCATGG TTTTCCTGGG GACCACTGCC 
GTCTCTACCT CAGCCACCAC ATTGACCTAT GCCAATTTTC CTCCCGCCCC CACATTTCCC
TGCGTTCAGA TGGAGCGCTG GGCCGATGAA GTCGAAAAAC GCACCGACGG GGCATTGACC
ATCGAGACTT TCCCCGGTGG CACCCTGCTC GGGGCCAAAC AGATCTGGCG TGGCGTCCAA
TACGGTCAGG CCGACATCGG CTGCATCAGC CTGGCCTACC AGCCGGGTCT TTTTCCTTTG
ATGTCCGTTA TGGAACTGCC ACTCGGGCTT CCTTCAGCTG AAACCGCGAG TACCCTCATG
TGGGATCTCT TTACCAGCTA TTCCCCCGAA GAGTTCGACA AGGTCAAAGT CCTGACCATG
TTCACCTCAG CCCCCTCTAA TATCATGAGC AAAAAACCGC TTCCCGACCT GGCCAGCCTC
CAGGGCGTGG AATTGCGTGG CTCCGGTACC GCGTCCCGCA TCCTCGAGGC CCTGGGGGCC
ACCCCGGTCT CCATGCCCAT GCCGGACACC CCTCAGGCCT TGCAAAAAGG TGTTGTCCAG
GGCCTTTTCT CCTCGCTGGA AGTCCTTAAA GACCTCAATT TCGCCGCCTA TTGCCAGCAC
GTGACCCGTA CCGATCTCCA GGTCTATCCC TTTGCCGTGA TCATGAACAA ACGGGTTTGG
GAGGATTTGC CCGAGTCAAC CAAGAAGATT TTGAACGAGC TTGGACCGGA ACAGGCGGCC
TGGACCGGCC GGTATATGGA CAATCACGTC CAGAAAGCGC TCGCCTGGGC CCAAAAGGAA
CACGGTCTGA CCACCCACGC CTTGTCGGCA ACTGCATTGG AAGCCGTCCA ACCCAAGCTC
GACAAACTCA TTGAGGAATG GGTCCAGGAC GCCTCGGCCA AAGGACTTCC TGCCAAAGCG
GTTTTGCGCG ATATCAGCGC CCGTCTGGAC AAAGCTGAGG CGAAAGGGGA ATAG
 
Protein sequence
MQRRIALSLM LLMVFLGTTA VSTSATTLTY ANFPPAPTFP CVQMERWADE VEKRTDGALT 
IETFPGGTLL GAKQIWRGVQ YGQADIGCIS LAYQPGLFPL MSVMELPLGL PSAETASTLM
WDLFTSYSPE EFDKVKVLTM FTSAPSNIMS KKPLPDLASL QGVELRGSGT ASRILEALGA
TPVSMPMPDT PQALQKGVVQ GLFSSLEVLK DLNFAAYCQH VTRTDLQVYP FAVIMNKRVW
EDLPESTKKI LNELGPEQAA WTGRYMDNHV QKALAWAQKE HGLTTHALSA TALEAVQPKL
DKLIEEWVQD ASAKGLPAKA VLRDISARLD KAEAKGE