Gene Dret_1137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1137 
Symbol 
ID8418964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1333395 
End bp1334576 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content60% 
IMG OID645037711 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_003198003 
Protein GI258405261 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0165472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.816566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAAT TGCCCAATTT TTCCCGCTGT GCCCTTTGTC TCGGCGCGCT TTTTCTCGCG 
CTGGCCCTGG CGGCCTGTCA GGACTCCACA TCGTCCCAGG ACCAGGCCCA GAACCAGGAG
GATCAGACCC AGGCCCTGAA GATAGGAGCC GTTTTACGTC TCTCCAAAGG CGCCTCCGAT
GGATTGCCGG CCCGCCACGG TATTGAAATC GCTGTGCAGG AAATCAACTC CCAGGGCGGC
ATCGACGGCC GGCCCCTGGA AGTTGTCTAC TACGACAGCA AGGACGACGC GACGACGGCT
GTGAACGCGG TCCAGAAACT CATTTCCGTG GACGAGGTCG AGGCCATCAT CGGTCCGATG
ATGAGCGGCA ATGTCCTGGC CGCTGCCCCA CTGTGTCAGC GCAACAATGT GGTTTTGCTC
ACTCCCACCG GCACCTCGCC GCGCATCTCC GAGGCCGGAT CGTATACCTT CCGCCTCTGT
TCCCGCATCG ACGATCAGGC CCGGGCCCTG GTCCAGGAAG CCCTGAGCCG AGTTGGAGCG
GACCCGACCG TGACTATCCT CTACAGCAAC GAACCCTACG GCAAGGGGTC CAAAGAACTC
TTCACGCGCT ACCTTGCCGA GCAGGACATC ACTCCGGCCA CTGTGGAATC GTTCCAGCGC
GGCGACAAAG ACTTCCAGGC CCAGCTGACC AAGATCAAAC AACTCAATCC GGACATCCTC
TTTGTCCCCG GATATCTCCA GGAAACCGCT CCGCTGATCA GCCAGGCCCG GCAGATGGGG
ATCAATGCCC TCAGCGTCGG TGTTTTCGGT GATATGGCCC CGAAATATAT TGAACTAGCC
GGCAAGGCCG CTGAAGGCCA CCTCATCGCT GGTGAATACA ATAAGCACAA GGACACCGAA
CACAACCAGG ACTTTGTCAA CGCCTATGAG GCGCTTCTGG CGGATCAGCC CAAGGCCCCG
GAAAACATCA TGTTCGCGGC TTTGACCTAC GACGCGGTCC ATCTTTTGCG GCAGTCCTTC
AGCACCGGGG CGACCACGGG CAGCGCCATC CAGTCCTTCC TGGACGAGTT GGAGGCCTTT
GACGGCATCA CCGGGACACT TTCCTTCGAT GCTAACGGGG ACGTCCAAAA AGGCGGGGTC
TACCTCTTTG AGGTCCAGAA CGGGACCTAC CGTAAACTGT AA
 
Protein sequence
MPQLPNFSRC ALCLGALFLA LALAACQDST SSQDQAQNQE DQTQALKIGA VLRLSKGASD 
GLPARHGIEI AVQEINSQGG IDGRPLEVVY YDSKDDATTA VNAVQKLISV DEVEAIIGPM
MSGNVLAAAP LCQRNNVVLL TPTGTSPRIS EAGSYTFRLC SRIDDQARAL VQEALSRVGA
DPTVTILYSN EPYGKGSKEL FTRYLAEQDI TPATVESFQR GDKDFQAQLT KIKQLNPDIL
FVPGYLQETA PLISQARQMG INALSVGVFG DMAPKYIELA GKAAEGHLIA GEYNKHKDTE
HNQDFVNAYE ALLADQPKAP ENIMFAALTY DAVHLLRQSF STGATTGSAI QSFLDELEAF
DGITGTLSFD ANGDVQKGGV YLFEVQNGTY RKL