Gene Dret_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1647 
Symbol 
ID8419478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1897174 
End bp1898148 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content58% 
IMG OID645038221 
Productputative sulfonate/nitrate transport system substrate-binding protein 
Protein accessionYP_003198509 
Protein GI258405767 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCC GCCGAATTGT TCTGTTTGTT GTCCTGCTCT GTCTTTTCAG CGCCGGTGGC 
GCGATGGCTG AAATGACACC GGTCCGCCTG GCCCACGCCA CCTGGGTGGG GTATGGACCG
TTGTATATCG CCCAGGAAAA CGGGTATTTC GAAGACGAAA ATATCGATAT GGACCTCTTT
ATCATCGAGG ACGAGGCCCA GTACGCTGCC GCGTTGGCCT CGGGCAATAT CGACGGCCTG
GGCAACGTCA TCGACCGTGA AGTCATCCAC TTCGCCAAAG GGACTTCGGA AGTGGTTGTC
TTTGCCATGG ATGAATCCGC CGGCGGGGAC GGGATCATCG CCACTGAGGA GATCCAGAGT
GTTGCGGATC TGGCCGGCAA GGACATCGGC CTCGACAAAT CCTCGACCTC CTATTTCTTT
TTCTTGAGTA TCCTGGATAA ATACGGTGTC GACGAGCAGT CCATGACCTT CCACGAGATG
GGCTCCTCCA ACGCTGGCGC GGCTTTTGTG GCCGGCAAGC TCGATGCCGC AGTGACCTGG
GAGCCTTGGC TCTCCAAGAG CGATCAGCGC GAGGGCGGCC ACGTGCTCAT TTCCAGTGCG
GAGATGCCCA AGACTATTGT CGATGTCGTG GTTCTCAACA GCGACTTCGT GGCCGAGCAC
CCTCAGGTCC CCGCCGGTCT GACCCGGTCC TGGTTCCGGG CCATTGACTG GTATCGAGCC
CATCCTGACA AGGGCAATGC CATTATGGCC GAGGCGATGG GGCTCAGTAC CGAAGAGATG
GCCAGCATGG CCGAAGGGGT CCGCTTTATC GGCGAAAAGG GGAACAAAAC GTTTTTTGAC
CCCTCGACCT CCGGCAATAT TTACGAGGTG GCAGACCGGG CCCTGGATTT CTGGCGCTCG
AAGGGCATTA TCCAATCGCC GGTCAAGGCC GAGGAATTGG TGACCTCCGA ATACGTCAAC
CAGGTTGCTG ACTAG
 
Protein sequence
MPIRRIVLFV VLLCLFSAGG AMAEMTPVRL AHATWVGYGP LYIAQENGYF EDENIDMDLF 
IIEDEAQYAA ALASGNIDGL GNVIDREVIH FAKGTSEVVV FAMDESAGGD GIIATEEIQS
VADLAGKDIG LDKSSTSYFF FLSILDKYGV DEQSMTFHEM GSSNAGAAFV AGKLDAAVTW
EPWLSKSDQR EGGHVLISSA EMPKTIVDVV VLNSDFVAEH PQVPAGLTRS WFRAIDWYRA
HPDKGNAIMA EAMGLSTEEM ASMAEGVRFI GEKGNKTFFD PSTSGNIYEV ADRALDFWRS
KGIIQSPVKA EELVTSEYVN QVAD