Gene Dret_2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2027 
Symbol 
ID8419872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2324906 
End bp2326096 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content59% 
IMG OID645038615 
Productaspartate aminotransferase 
Protein accessionYP_003198889 
Protein GI258406147 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000277317 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0077207 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGATGT TGAGCCAACA GGTGCAGACC TATTTGGAAA GTTCCTCATG GATCCGCCGG 
ATGTTCGAGG CCGGGCGGGA GATGAAGGCC AAATACGGCG AAGACCAGGT GTATGATTTC
AGTTTGGGCA ATCCGGACCT CCCCCCGCCT GCAGCCGTGA CCAAAGGGTT GCAGCGCCTG
GCTGAACAGG CGCAGTCGTC CTATGCCTTC GGGTATATGC CCAATGCCGG GTATCCTGAT
GTCCGGCAAG CCCTGGCCCA GCGCTTGTCC CGCGAGCAGC AGGTCGCGCT TTCCGAGCAG
GAACTGTTGT TGAGCTGTGG GGCCGCCGGA GGGCTGAATG TGTTGTTCCG GGCGATCCTG
GAGCCCGGGG ACGAAGTGGT CTGTCCCGCG CCCTTTTTTG TGGAGTATAC CTTTTACGTC
CAAAACCACG GCGGCGTCCT GCGCACGGTC CCCTCACGCG AACCGGATTT CGCTCTGGAT
ATTGAAGGCA TTGAGGCGGC TCTCTCGGAG AAGACCCGGA TTGTGCTCAT CAATTCCCCC
AATAACCCGA CCGGGCGGGT CTATTCTGCA TCGGAGCTGC GCCAACTCGC GGCTGTCCTC
GACGCAGCAA GCCGCAAGTA CGGCCGGCCC ATCCTGCTTG TGTCCGACGA GCCGTACCGT
TTCTTGACTT TTGACGGAAC GCAGGTGCCC CCTGTTTTGC CGGCCTACCA ACACAGTGTG
GTGGTCAGTT CCTTTTCCAA GAATCTGTCC CTGGCCGGAG AACGGGTCGG GTATTTGGCT
TTGAATCCGG AGATGCCTGG AAAAGAGGAA CTTATGGACG GCTTGGTATT GACCAACCGC
ATCCTCGGTT TTGTCAATGC TCCAGCCCTT GGCCAGCGTC TTGTCGGCTA TTGCCTGGAG
GCCTCGGTGG ATCTCGAGGT CTATGAAAAA CGACGGGCGG CCATGGTCGA GGCCCTTGAC
GCCGGGGGCT ATACCTATGC CGTGCCCCAA GGGGCGTTTT ATTTCTTTGT CCAGGCCCCA
GGCGGCGACG ATGTCGCCTT TGTCCAGACC CTCCAGGAAG AGCGGGTTTT GGCTGTCCCA
GGTTCCGGAT TCGGTTTTCC CGGCTATTTC CGGTTGTCTT TTTGCGTTCC TGAAACGGTT
ATTCGCAATG GCGCCGCCTC CCTGGCCCGG GCCCGTCAAC GGTGGCAATG A
 
Protein sequence
MSMLSQQVQT YLESSSWIRR MFEAGREMKA KYGEDQVYDF SLGNPDLPPP AAVTKGLQRL 
AEQAQSSYAF GYMPNAGYPD VRQALAQRLS REQQVALSEQ ELLLSCGAAG GLNVLFRAIL
EPGDEVVCPA PFFVEYTFYV QNHGGVLRTV PSREPDFALD IEGIEAALSE KTRIVLINSP
NNPTGRVYSA SELRQLAAVL DAASRKYGRP ILLVSDEPYR FLTFDGTQVP PVLPAYQHSV
VVSSFSKNLS LAGERVGYLA LNPEMPGKEE LMDGLVLTNR ILGFVNAPAL GQRLVGYCLE
ASVDLEVYEK RRAAMVEALD AGGYTYAVPQ GAFYFFVQAP GGDDVAFVQT LQEERVLAVP
GSGFGFPGYF RLSFCVPETV IRNGAASLAR ARQRWQ