Gene Dret_2546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2546 
Symbol 
ID8420403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013224 
Strand
Start bp40444 
End bp41844 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content33% 
IMG OID645039143 
Producthypothetical protein 
Protein accessionYP_003199400 
Protein GI258406659 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones71 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones116 
Fosmid unclonability p-value0.666829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAAA TAGGATTCTT AAGAAATGTG AAGAAAAGTT CTGAACAAAA GATCAATACA 
AGAAATAATC AAAAAGATAT ATTTAGTTTC GACTCTGGAC AAAAAATAAC CCTTGAAGAG
GCAAATCACA ATTGCCTAGT TCTAGGAGCT ACTGGAACTG GCAAAACGTC CTCTTTTGTA
TCTCCAAAAT TATATAACTT AATAAAAAAA GGTTACGGTG GTCTAATTTT TGACGTTAAG
AATAATTTCA CGAAAGACAT TCAATCGATG GCTAATATGT GCGGTCGAGA AGCTGATATA
ATAGAAGTAG GTAGCCATAA CACAGCAACT CCAATAAATT TTTTATTAGG TTTAGACCTT
GAATCTATGC TGCAAAGCTT AGAAGACATA ATCGCAGGCA GCATACCAGC TAGCGGTAAC
TGGGAATTTA TACAAATGGG CATTAACAAT GTTAAACAAA TTGCAAAAGT TTTATACTAT
TTCAAAAAAA ACAAACCAGA TGAACGATCA ATTTTACCAA ACTTAGATTT AATAGTTAAT
ATGCTAAACA ATGAGTTACT GGCAACTCAA ATTTTTACAC ACTGGAAAGA AAATATAAAC
ACCTCAGATC GTGACCAAGT AAGATTAAAA GATGAAATTG AAAACGACAT ATTTAATTTT
ATGATGCCTC CTGAGAATAC AGGTTCGAGA AATTCAACCG ATTGGTACAA ACAAACTACC
TGGCGACTTT CAGTGCCGCG AAAGGTCCTT GGTGAATTCA GCAATGGAAT TTTAAAGAAA
AAACTATCTT CAAAATTAAA CAAAACATTA AATTTTGAAG AATTAGTCCT AAAACAAAAT
AAAATAGTGA TTATAAGATT CTCTGCAATT AGCGGAACTC CAGGGGTAAG ATTCTCCAAA
ATGGCAAAAG AAGAGTTTTA TAAAACAATA TACCGCAGAT TTGAACTACA AAAAGAAACA
AAAATACAAT ATGTTTTTAA TATCATGGAT GAATTCCAAG ATATAGTTAA CCTGGACGAA
AATTCACTTT TTGATGACTT CACGTGGGTG TCCAAGTCAA GAGAGTTCAA AGTCATAAAC
ATAGCAGCGA CGCAAAGCAT GTCTAGTCTT TACAAGCTTG ATTATAAAGA AAGAGTTTAT
GGAATGCTAA ATAACTTTGG CATAAAAATC TTCATGCAGA ATGACGACCC AAATACAATA
GCATGGATTG AACAATGTTA TAGAAAAAAT TATAATTTAA CTGAATTAGG CCCTGCAGAG
GCCTTGCTGA TTAAGAAAAA GATGCCTGAA CGAAAAATTC AAATTGGTAT AGATTCTGCT
CAAAAAAGCC ATGATAAGCT GCAGGACATA CTAAGGAATA GCAGTGAATT TTGCAAAAAT
GAAAATAGGA TTACTATATG A
 
Protein sequence
MSEIGFLRNV KKSSEQKINT RNNQKDIFSF DSGQKITLEE ANHNCLVLGA TGTGKTSSFV 
SPKLYNLIKK GYGGLIFDVK NNFTKDIQSM ANMCGREADI IEVGSHNTAT PINFLLGLDL
ESMLQSLEDI IAGSIPASGN WEFIQMGINN VKQIAKVLYY FKKNKPDERS ILPNLDLIVN
MLNNELLATQ IFTHWKENIN TSDRDQVRLK DEIENDIFNF MMPPENTGSR NSTDWYKQTT
WRLSVPRKVL GEFSNGILKK KLSSKLNKTL NFEELVLKQN KIVIIRFSAI SGTPGVRFSK
MAKEEFYKTI YRRFELQKET KIQYVFNIMD EFQDIVNLDE NSLFDDFTWV SKSREFKVIN
IAATQSMSSL YKLDYKERVY GMLNNFGIKI FMQNDDPNTI AWIEQCYRKN YNLTELGPAE
ALLIKKKMPE RKIQIGIDSA QKSHDKLQDI LRNSSEFCKN ENRITI