Gene Dret_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1044 
Symbol 
ID8418867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1233687 
End bp1235411 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content55% 
IMG OID645037614 
ProductNa/Pi-cotransporter II-related protein 
Protein accessionYP_003197910 
Protein GI258405168 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.53118 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACAG GATCGGTAGG CATAGCCACA TTAGGCGGAT TAGGCCTTTT CCTTCTCGGT 
ATGCGGATGA TGACCGACGG CCTGCAGATG ACAGCTGGCC AGCGTATTCG CTCCATTTTA
AAAACCTTGT CCGCAAACCG TGTGGTTGGA TGTTGCACCG GCGCGATTGT CACGGCAATC
ATCCAATCGT CTTCGGCGAC CACGGTCATG CTGGTGGGAT TTGTCGGCGC TGGGCTGATG
ACCCTGTATC AAGCCGTTGG CGTCATCCTT GGGGCCAATA TCGGCACCAC CCTGACCGCC
CAGTTGATCG CTTTCAAACT GTCCAGTCTT TCCCTGCCTG CCATTGCCCT TGGTGTGGGA
CTCAAATTTT TCGCCACCAA AAAACGCTTG CGCTATATTG GAGAAATCAT TCTCGGATTT
GGCCTGCTCT TTTTCGGTTT GACAGTCATG AAGGACTCGC TGGCCCCGAT CAAAGACGAC
CCGGCCTTCA TCGATTTTTT TACCAAGTTC GATCCCAGTA CCATCGGCGG CCTTCTGCTC
TGTATCGCCG TCGGCACGGT CCTAACGATC ATCGTCCAAT CCTCCTCGGC GACCATCGGC
CTGACCATGA GCCTGGCTTG GCAGGGACTG ATCGGCTTTC CAGCCGCCAT GGCCCTTGTG
TTGGGCGAAA ATATCGGGAC CACGTTGACG GCCCAGATTT CGACCATTGG TTCCCGCAAT
GCCGATGCTC ACCAGGTGGC CAACGCCCAC ACCCTATTTA ACGTGCTTGG TGTCGTCCTC
ATGATCCTGA TCTTCCCGTG GTTCGTCGAC GGAGTGCGGA TCACCAGTGA ATTTCTGGGC
GCTGGACCGA TCGATGCGGT GGTCGATGGA GAAAAAGCTA ATATTTCACG GTATATTGCC
AACGGTCACA CGATATTCAA TGTCGCCAAC GCCCTTTTCT TTTTGGCTAT CATGCCGTGG
CTGGTGCGCG CAGCCCGGTT ATTCACCAAG CGGGACCCCG AAGAAGACGA TCTCTTTCGT
CTCCCGGAAT TCAGCGACCG CTTTCAGGAC ACCCCCATGG CAGCCGTGGC CGAAGCGCGT
CAGGAAGTCC ACGCCCTGTC CCGGGTGGTT CGGGCCGGCC TGAACAACGC CCTGGAGGGG
GTCTGGCAGA ATGACTCCAA AAAAATCGGG CGCTGGCAAC GCTTTGAAGA ACACATCAAC
ACCGCTCACC GGGAAATCTT GCGCTACCTT TCAGGGATTT TCCAAAGCGA GGTCTCAGAA
GACATCTCCC GGGAAGTCAG CGTCTTGATG CGTATCAGCT ACAACCTCGA GCGCATCGGC
AACGGCGTGG CCAATATCGC CAAGCACTTT GAGGATGTCA TGGAACAGGA TCTCCCCCTG
TCCTCCCAGG CCTGGAAAGA AGTCGACCAG ATGGCCGAAG AGGTGCGCGC CTTGATGAAG
TTGGTCTCTG ATTCGATCAT CAATCCCAGC GACGATCTGC TCGACAAGGC TCAGGACCTG
GAACAGCACA TCGATAGCAT GCGCGAAGAA ATGCGCCAGA ATCATATCCA GCGGATCCGG
GAAGAACGGT GCCAGGTCGA TGCCGGGATC GCCTTCATGG ATATCCTGAC CCGGTTCGAA
AAGATCGGCG ACTGGACTTA CAATATTGCC AAGGGACTCA AAGAAATAGA AAATAATGTC
CCCCCTGATA AAGCCACCTA TGACAATGGG ACGGCTACGC CGTAA
 
Protein sequence
MSTGSVGIAT LGGLGLFLLG MRMMTDGLQM TAGQRIRSIL KTLSANRVVG CCTGAIVTAI 
IQSSSATTVM LVGFVGAGLM TLYQAVGVIL GANIGTTLTA QLIAFKLSSL SLPAIALGVG
LKFFATKKRL RYIGEIILGF GLLFFGLTVM KDSLAPIKDD PAFIDFFTKF DPSTIGGLLL
CIAVGTVLTI IVQSSSATIG LTMSLAWQGL IGFPAAMALV LGENIGTTLT AQISTIGSRN
ADAHQVANAH TLFNVLGVVL MILIFPWFVD GVRITSEFLG AGPIDAVVDG EKANISRYIA
NGHTIFNVAN ALFFLAIMPW LVRAARLFTK RDPEEDDLFR LPEFSDRFQD TPMAAVAEAR
QEVHALSRVV RAGLNNALEG VWQNDSKKIG RWQRFEEHIN TAHREILRYL SGIFQSEVSE
DISREVSVLM RISYNLERIG NGVANIAKHF EDVMEQDLPL SSQAWKEVDQ MAEEVRALMK
LVSDSIINPS DDLLDKAQDL EQHIDSMREE MRQNHIQRIR EERCQVDAGI AFMDILTRFE
KIGDWTYNIA KGLKEIENNV PPDKATYDNG TATP