Gene Dret_0180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0180 
Symbol 
ID8417984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp228572 
End bp230395 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content60% 
IMG OID645036745 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003197060 
Protein GI258404318 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0115715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTGGT TGTTGTACCG CCTTGCCATC CTGCTCATGG CGAGCACCTG CTGCGTCCTC 
CCTGTGCGCT CGGCCTTTGC CGCCCACGCC CTGGCCATGA ATGGTGAACC AAAATACAGC
GCTGACTTCA CCCATTTTGC CTACGCCAAT CCGCAGGCGC CCAAGGGCGG TCATGTCCGC
CGAGCCGCCA TCGGCACCTT TGACACCTTC AATCCCTATG TTCCCAAAGG CATGGCGCCG
CAGGGAATCG GCCTTATCTA CGACACCCTG TGTGTCCAGT CCATGGACGA ACCATTCACG
GCTTACGGCC TTCTGGCGCA AGACATAACC GTGCCCCCGG ATCGCTCCTG GGTGCGCTTT
ACGCTTCGCG AAAACGCCCG TTTCCACGAC GGCCACCCCG TGCACGCCGA GGATGTGGCC
TTCACTTTTG AGCTGCTCAT GGCAAAAGGG AGTCCAACAT ATGCAAACTA TTACGGGGAC
GTGAGCGAGG TCGAGGTTCT GGGCCCCAGA CAAATCCGCT TCACTTTCGA GCACGCCAAC
AACCGTGAAC TGCCGCTTAT CCTCGGCCAG CTGCCGGTGC TGCCCAAGCA TTTCTGGAAA
GGGAAGGATT TCACTTCAGC CGGATTGACA CGCCCCCTGG GCAGCGGCCC GTATACCATC
GAGACGTTCA AGCCCGGACA TTTTGTGCGC TACAAACGCG TTGCCTCCTA TTGGGGCGCT
GACCTTGCCG TGAACACCGG GCGCTACAAT TTTGATTCCC TGCAATACGA CTATTTTCGG
GACACCACTG TGGCCTTGGA GGCATTCAAG GCCGGTGAGT ACGATTTCCG ACAAGAAAAC
ACCGCCAAAC ACTGGGCCAC GGCCTACACC GGTCCAGCGG TGGACCAAGG GCATATTGTC
AAGGAACGCA TCCCCCATGA CCGACCTCAA GGTATGCAGG CCTTTATCTA CAACACCCGG
CGGCCTCTGT TCAGCGATCC CGAAGTCCGG CGGGCCTTGG CCTACGCCTT TGATTTCGAA
TGGACCAACT CCCAGCTGTT TTACGGCCAG TACACGCGGA CCAAAAGTTA TTTTTCCAAT
TCTGAACTCG CCGCACAGGG ACCGCCTGCC CCGGAGGAAC TGGCTCTGCT CGAACCGCAC
CGCAGCCACC TCCCCGAAGA AGCACTGACC TCGGCCTACA CCGTGCCCAG CACCGAAACA
ACGCCCCTGC GCCAAAACCT CCGGCAAGGT CTGCGCCTCT TGCGCCGGGC GGGGTGGACG
ATGCAAGACG GGCAACTGGT GCACAAGCAA ACGGGAAAAT CGTTTCAATT TACCATATTG
TTGCGTTCCC CGAGTTTCGA ACGAGTCGTG CTGCCCTTCA AACGCAACCT GGCCAAACTG
GGGATCACCA TGGAGATCCG CCGCGTCGAT GCCTCCCAAT ATGTCAACAG ATTGCGGAGC
TTCGATTTCG ACATGCTCAT AGCGACCCTG CCCCAATCCA ATTCACCAGG AAACGAGCAG
CGGTATTTCT GGACCTCCGA AGCCGCCTCC ACCCCCGGGA CCTACAACTA TATGGGCGTC
GACAACCCGG CCATCGACGC CCTGGTCGAA CAAGTCGTCA CCGCCCCGGA CAGGGAAAGC
CTCATCACCC GGTGCCGCGC CCTGGACCGC GCCCTGTTGT GGGGACACTA CGTCATTCCC
CAATGGCATC TCGGGGCCCT GCGCGTCGCG CGCTGGGATA TTTTCGGTCG CCCGGAAAAG
ATGCCCCGCT ACGGGCTCGA TTTTTTCACC TGGTGGGTCG ACCCGGACAA GGCCGCTGCC
GTAAGAGCCT TTCAGGGGCG CTAG
 
Protein sequence
MSWLLYRLAI LLMASTCCVL PVRSAFAAHA LAMNGEPKYS ADFTHFAYAN PQAPKGGHVR 
RAAIGTFDTF NPYVPKGMAP QGIGLIYDTL CVQSMDEPFT AYGLLAQDIT VPPDRSWVRF
TLRENARFHD GHPVHAEDVA FTFELLMAKG SPTYANYYGD VSEVEVLGPR QIRFTFEHAN
NRELPLILGQ LPVLPKHFWK GKDFTSAGLT RPLGSGPYTI ETFKPGHFVR YKRVASYWGA
DLAVNTGRYN FDSLQYDYFR DTTVALEAFK AGEYDFRQEN TAKHWATAYT GPAVDQGHIV
KERIPHDRPQ GMQAFIYNTR RPLFSDPEVR RALAYAFDFE WTNSQLFYGQ YTRTKSYFSN
SELAAQGPPA PEELALLEPH RSHLPEEALT SAYTVPSTET TPLRQNLRQG LRLLRRAGWT
MQDGQLVHKQ TGKSFQFTIL LRSPSFERVV LPFKRNLAKL GITMEIRRVD ASQYVNRLRS
FDFDMLIATL PQSNSPGNEQ RYFWTSEAAS TPGTYNYMGV DNPAIDALVE QVVTAPDRES
LITRCRALDR ALLWGHYVIP QWHLGALRVA RWDIFGRPEK MPRYGLDFFT WWVDPDKAAA
VRAFQGR