Gene Dret_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1847 
Symbol 
ID8419688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2117767 
End bp2119398 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content57% 
IMG OID645038431 
ProductNa+/solute symporter 
Protein accessionYP_003198709 
Protein GI258405967 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.145671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATCA AAATCGTGTG CATGCTCGGC TACATCACCA TCATCGCCCT GCTCGGGTAC 
AAAGGCTGGA AGGAAACCCA ACAGGCCAAA GACTACCTCG TCGCCGGCCG GGCCATGCAT
CCCTTTATCA TGGCTTTTTC CTACGGCGCG ACATTTATTT CCACTTCGGC CATCATCGGT
TTCGGCGGGG CCGCTGGCCT GTTCGGCTTC CCCCTGCTCT GGCTCACCTT CCTGAATATT
TTTGTCGGCA TCTTTCTGGC CATGCTGTTT TTCGGTAAAC GGACCCGCCG CCTCGGATTG
ACCCTGAACA GCATGACCTT TCCCGAACTG TTGGGACGCC GCTACCAATC CGGCTTCATC
CAGGGATTCG CCGGTGCGAT CATTTTCCTC TTCATTCCGG TCTACTCCGC CGCTGTCCTG
ATCGGAATCT CCAGGATCAT CGAGATTTCT TTGCATATCC CCTACAATCT CGTTTTAATC
CTGGTGACCC TGATCCTGAC CGCCTACGTC ATCACCGGCG GTCTGAAGGC GGTCATGTAC
ACCGACGCGT TCCAGGGATG CATCATGTTT ATCATGATGC TCATTCTTTT AGTTTCCACC
TACTCCATCC TCGGCGGCAT CACTCCGGCC CACCAGGCCC TGACTGACAT GGCCCCGCTC
ATGCCCGCCA AACTCCAGAA AGGCGGCATG CTCGGTTGGA CCCAGGGCGC TGCCTTTGGT
TCGCCGCTGT GGTTGGTCAT CTACACGACC ATTGTCTACG GCGTGGGCAT CGGAGTCCTG
GCCCAGCCCC AATTGGCCGT GCGCTTTATG ACTGTACCTT CGGACAAGAG CCTCAACCGC
GCAGTACTCT ATGGCGGGGT CTTCATTCTT TTCATGACCG GCACGGCCTT TATTGTCGGC
GCGCTCTCAA ACGCTGTATT CTACCAATAT TTCGAGAAAA TTGCCATTGC AGTGGCCAAA
GGAGACATCG ACAAGATCAT TCCCGTCTAC ATCGAACGGA TCATGCCGGC CTGGTTCGCG
GCCCTGTTCC TGGTGGCCAT GTTCGCCGCA GCCATGTCGA CCCTGTCTTC CCAATACCAT
GTCGGGGGGA CCTCCCTGGG ACGGGACCTC TTTGAAAAAG GGCTGCGCCT GTCACGGCAA
CGTTCGATCA CCCTCAACCG AGCCGGGGTC TTCCTGACAA TCATCGCCAC CCTGATTTGG
GCCTGGGTCC TGCCGGCCTC GGTCATCGCC CGGGCTACGG CCTTCTTCTT CGGCCTCTGC
GCTGCGGCTT TTCTGCCGGC CTATGCCTGC GCCCTGTACT GGAAGAAAAC AACCAAGGCC
GGCGCCGTGA CCTCTATGGT TGGCGGATTT TTTATCTCCA TGTTCTGGCT GTTGTTCATT
CACGAAAAAG AGGCCGCAGC CATCGGTCTG TGCAAGGCGC TGACCGGGCA GACCACGCTG
GTCGCTAGCG CCGAGGCGGG GAGTTGGATC TGGCTGCTGC AATGGGTCGA CCCCAATGTC
GTCGCATTGC CAGTCTCCTT TGCGCTCATT ATCACCGTTA GCCTGACGAC TCGGGGCTAC
AGCACAGACC ACCTCCACCA TTGCTGGAGC AACTTCGTCA GCGCACAACA GACCAGGGAA
TGGAGAAATT GA
 
Protein sequence
MLIKIVCMLG YITIIALLGY KGWKETQQAK DYLVAGRAMH PFIMAFSYGA TFISTSAIIG 
FGGAAGLFGF PLLWLTFLNI FVGIFLAMLF FGKRTRRLGL TLNSMTFPEL LGRRYQSGFI
QGFAGAIIFL FIPVYSAAVL IGISRIIEIS LHIPYNLVLI LVTLILTAYV ITGGLKAVMY
TDAFQGCIMF IMMLILLVST YSILGGITPA HQALTDMAPL MPAKLQKGGM LGWTQGAAFG
SPLWLVIYTT IVYGVGIGVL AQPQLAVRFM TVPSDKSLNR AVLYGGVFIL FMTGTAFIVG
ALSNAVFYQY FEKIAIAVAK GDIDKIIPVY IERIMPAWFA ALFLVAMFAA AMSTLSSQYH
VGGTSLGRDL FEKGLRLSRQ RSITLNRAGV FLTIIATLIW AWVLPASVIA RATAFFFGLC
AAAFLPAYAC ALYWKKTTKA GAVTSMVGGF FISMFWLLFI HEKEAAAIGL CKALTGQTTL
VASAEAGSWI WLLQWVDPNV VALPVSFALI ITVSLTTRGY STDHLHHCWS NFVSAQQTRE
WRN