Gene Dret_2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2120 
Symbol 
ID8419970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2410275 
End bp2411981 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content60% 
IMG OID645038713 
ProductProtein of unknown function DUF2064 
Protein accessionYP_003198982 
Protein GI258406240 
COG category[S] Function unknown 
COG ID[COG3222] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.413688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGC TCACCTCTCC CCTTTTTTGC TGTCTGGCCC GGACCCCAGA GCCAGGACGC 
GTCAAGACCC GTCTGGCAGC GGATCTCGGC GAGGAAGCCA CGTTTTACGT CTATACCTCC
ATGCTCCGCG ATACTGTGGG CGCCCTGCGC GCTACCGGAC ACCGCTTCCA GCTCTGGTAT
ACCCCTTCAG GAAGCGAAGC GGCCTTGCTG AACCTCCTGG GCGGCCCCCT CGAATTGGTG
CCCCAGACCT CGGGAGACCT CGGCCAGCGC ATGAACGCGA TCTGCCAAGC CGCCTTCCTC
GGCGGAGCTG ACCGGGTGCT CCTTTTGGGC AGCGACATTC CCGAACTGAC CAGTTCTCAC
CTTCTCCAGG CGGCCGAACG TCTGCACCGA TCGGATGCCG TCATGGTCCC GACTGCCGAC
GGCGGCTATT GTCTCCTGGG ACTGAAGCGG GCAAGCTATT CTCCCGAACT CTTTACGGAT
ATCCCTTGGA GCACCGCAAA GGTAGCGGCC ACGACCCTGG AACGGTTGCG CCGGCTGCGT
TGCACTACGT CGCTTTTCCC GCCCCAGCAG GACATCGACA CCCTGGACGA CCTGGCCGCT
TTCTGGCACC GCTGTGAAGA CACACCGTTC CATACCTCAC GGACCATCCG CGAACTCGGG
CTCTTCCCGG AAAGTATCCA GCCGGCTCCC GCCCCCCTTG GGGGCAAACC GAAAAGGGAA
TTACCTCCCA TGAATTCTAC AGGTTCGGTC GAAACCACCA TTCAGACTAT CAAGGACTTC
ATCCAGAAAA ATCGTTGGCT CGCCCCGCCG CTGGAGGTCA CCTTCCTTGC CGCCGGGGAA
TACAATGCCA ACTACACAGT CCACAGCCCG GCTGGGACCT TTGTTTTCCG GATCAACCAC
GGCACGCAAC TCGGGCTGGA AAACCAGATC GAGTATGAAT ATGCCGTCCT GACCGCTCTG
GCCGATTCCG GAGTAACCCC GCGTCCCCAC GCAGTTGCGC CCGCACCGAG TGCCTTTCAC
GGGGGCGTTT TGCTCATGGA CTTCGTGCCC GGACGGCCCC TGCGCTACGA AACCGATCTG
GCCACGGCCG CCACCATTTT TGCCCGCGTC CATGCCCAGC CATGCTCCGA GCATCTCCTC
TCGCAGCCCC AGCCTGTGCT GGACATTGTC GCCGAATGCG AGGAACTTCT AGACCGCTAT
CCCGATCACC CCCTGCCGCA GGCCCAAAAA ACCATCCGGA ACTATCTGGA AACAATCCGC
CGTATGGGGG AAGATTCGCG CGATTTTTTC GCTGCTGAGG ATCAATGCAT CGTCAATACC
GAAGTCAATG CCAACAATTT CTGCATCAAT CCTGGCGGAC GCTCCTTCCT GGTCGACTGG
GAAAAAGCGG TCGTCTCCTC ACGCTACCAG GATCTCGGGC ACTTTCTGGT CCAAACAACA
ACGCGCTGGA AAACGGCTAC CGTACTCACC GAGGCCCAGA AACGGGACTT CCTTACCTGC
TACCGCGACG CCTCCGGCCT GGACTGCGAC CTGGAGGAAC TCCACTACGG CACCCGCCTT
CTGGAAAAGA CCATTTTGCT GCGGGCCCTG TCCTGGTGCG CCATGGCCTA TTACGAATAC
ACCCAGACCG ACCGGCCCAT CCAGAACCAA GACACCTTCG CCAAGATTAC TGAATACCTG
GATAATGTGG AATGGATTTT AGGGTAG
 
Protein sequence
MAQLTSPLFC CLARTPEPGR VKTRLAADLG EEATFYVYTS MLRDTVGALR ATGHRFQLWY 
TPSGSEAALL NLLGGPLELV PQTSGDLGQR MNAICQAAFL GGADRVLLLG SDIPELTSSH
LLQAAERLHR SDAVMVPTAD GGYCLLGLKR ASYSPELFTD IPWSTAKVAA TTLERLRRLR
CTTSLFPPQQ DIDTLDDLAA FWHRCEDTPF HTSRTIRELG LFPESIQPAP APLGGKPKRE
LPPMNSTGSV ETTIQTIKDF IQKNRWLAPP LEVTFLAAGE YNANYTVHSP AGTFVFRINH
GTQLGLENQI EYEYAVLTAL ADSGVTPRPH AVAPAPSAFH GGVLLMDFVP GRPLRYETDL
ATAATIFARV HAQPCSEHLL SQPQPVLDIV AECEELLDRY PDHPLPQAQK TIRNYLETIR
RMGEDSRDFF AAEDQCIVNT EVNANNFCIN PGGRSFLVDW EKAVVSSRYQ DLGHFLVQTT
TRWKTATVLT EAQKRDFLTC YRDASGLDCD LEELHYGTRL LEKTILLRAL SWCAMAYYEY
TQTDRPIQNQ DTFAKITEYL DNVEWILG