Gene Dret_0079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0079 
Symbol 
ID8417883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp105309 
End bp106487 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content63% 
IMG OID645036644 
Productaminotransferase class I and II 
Protein accessionYP_003196959 
Protein GI258404217 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG CCACACGCCT GTCTCAGATC AAACCTTCCC CCACCCTCGG CCTCAACGCC 
CAAATCCAGG AGATGAAAGC GGAGGGCCGT TCCGTTGTCA GCCTCGCCGT GGGCGAACCG
GACTTTCCCA CGCCGGCGCA CGTTTGCCAG GCTGCCAAGG CCGCTGTGGA CGAGGGATTT
ACAGGCTATA CCGCGGTGCC AGGCATGCCC TCGCTGCGCC AGGCAGTGGC TGGATACTAC
CAGACCCAAT ACCAGGTGCA GGCCCAGGCG GACAACACCA TCGTCAGCAA CGGCGGCAAG
CAGGTCCTCT ACAATCTCCT CCAGGCCTTG GTCGAACCGG GTGACGAAGT CCTGCTCCCG
GTCCCCTACT GGGTGAGTTA TCCCCCGCTG GTCGCTCTTG CCGGCGGCAT CGTCCAGGAG
GTCCCCAGTA CAGCCGCGGA CGATTTCCTG GTCACCGCAG ACCAATTGGA GGCCTGCCGC
ACCCCGCAAA GCCGTTTCCT CATCCTCAAC ACCCCGTCGA ACCCTACCGG CAGCCACTAC
AGCCAACAGC AGCTCGACGC GCTGATCGAA TGGGCGCTCA GTCACGATAT CTTCGTCATT
TGCGACGAAA TCTACGACCA GCTCATCTAT CCACCAGCCG GACCGGCTAC TGCAAGCCGC
TGGTGGCAGC AGGCCCCGGA TAAAATCGCT GTGGTCAACG GGTTGTCCAA AAGTTTTGCC
ATGACCGGGT GGCGGATCGG TTTCGGGTTG GCTCACGCCG ATTTGATCAA GGCCATGACC
AAACTGCAAG GCCAATCCAC ATCCAATATC TGCTCCATCG CCCAACGAGC CGCAGAAGCC
GCCCTGACTG GCACCTGGGA CCAGGTGCGG ACCATGCGCG ACTCCTTTGC CAAACGGCGC
GATCTGGCCC TGGAATATAT TGCCGCCTGG CCGGGAACCA CGTGCCCCCG CCCGGCCGGG
GCCTTTTACC TCTTCCCCCA CCTTGGCGAC TGCCTGGAAG GGTCCGCTAT CCCGGATTCG
GCGGCGCTGG CTTCACACAT CCTGGAACAG ACCGGCATTG CCGTGGTGCC CGGCAGCGCG
TTTGGCGACG ACGCCTGCCT GCGCCTGTCC TATGCCGTCG ACGAAACCAC ATTGACCGAC
GCCCTGGAGC GCATCGGAAA GGTCCTGCAG CAACTCTAG
 
Protein sequence
MRIATRLSQI KPSPTLGLNA QIQEMKAEGR SVVSLAVGEP DFPTPAHVCQ AAKAAVDEGF 
TGYTAVPGMP SLRQAVAGYY QTQYQVQAQA DNTIVSNGGK QVLYNLLQAL VEPGDEVLLP
VPYWVSYPPL VALAGGIVQE VPSTAADDFL VTADQLEACR TPQSRFLILN TPSNPTGSHY
SQQQLDALIE WALSHDIFVI CDEIYDQLIY PPAGPATASR WWQQAPDKIA VVNGLSKSFA
MTGWRIGFGL AHADLIKAMT KLQGQSTSNI CSIAQRAAEA ALTGTWDQVR TMRDSFAKRR
DLALEYIAAW PGTTCPRPAG AFYLFPHLGD CLEGSAIPDS AALASHILEQ TGIAVVPGSA
FGDDACLRLS YAVDETTLTD ALERIGKVLQ QL