Gene Dret_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_2010 
SymbolargJ 
ID8419855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2308394 
End bp2309572 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content63% 
IMG OID645038598 
Productbifunctional ornithine acetyltransferase/N-acetylglutamate synthase protein 
Protein accessionYP_003198872 
Protein GI258406130 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.870192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAGG TCCCGCAAGG GTTTTATTTT GCCGCCCACG GCGCGGGGTT TAAATATACC 
GGACGCGATG ACGTGGCCTG TCTCGCCAGC GACCGCCCGG CAGCCGCAGC CGGTGTCTTT
ACGCAGAATG TCTTCCAGGC CGCCCCGGTC CTTGTGGCCC GGGAGCAACT GCAAAAGCGC
CAAACAGCGC AAGCCGTTTT GGTCAATGCT GGTCAGGCCA ACGCCTGTAC CGGGCAGACC
GGCGTGGACA ATTGCCGGCA GACGCTGCGC TTGGCGGGTG AGGCCCTGGA GATGGATTCC
GCAGACATCC TTCCCGCCTC GACGGGCGTG ATCGGGGACC AATTGGTGAT GGATCTCTGG
CCCCAGGCTG TTCGCGGGTT GCGTACTTCC GCTGGCCAAG CGGGACCGCT GGACGCGGCT
AAAGCCATTG TGACCACTGA CAGCTTTCCC AAGATTGCCT GGCGCAGCGT GACCGTCGGG
GGGAGAGAGA TCCGTATTCT GGGCCTGGCC AAGGGGGCGG GCATGATCTG CCCGAATATG
GCCACCATGC TTGGCTTTGT CCTTTGCGAC GGGGCGATTG AGCCCCAGCT GTGGCAGACG
ATGCTGGCCC GGGCCGCAGA CCGGAGTTTC AACCGTATCA CGGTCGACGG CGACACGAGC
ACCAACGATT GTGTGCTGGG GTTGGCCAAC GGCGCCAGCG ATGTGACGGT GGACACTGCG
ACGCCCGGCT TTCAGGAGGC CTTGGAGGCG GTCTGCCGTG ATCTGGCCTA TTTCATCGTC
CAGGACGCTG AGGGTGGCAC CAAGGTGGTT CGTATCCATG TCCGGGGGGC GTCGTCCGAC
CAGGACGCCG AGCAGGCGGC ACGGACTATC GCCCATTCGC CGCTGGTCAA AACAGCGCTT
TTCGGGCGGG ATCCGAATTG GGGGCGCATC GTGGCGGCCC TGGGACGGTC CGGCGCGACT
GTTGCCCCGG AATCGACGTC GGTCTGGATG GGCGGATTGC CGCTCTTCGA AAACGGGGTC
CCGGTGGATG CGGATTTGGA CAGCCTTTTG GCACCGTATT TGGACCGCAG GGAAGTGCCA
CTGGAAATCG ACCTCGGTCT TGGGGACAAG AGCGCTGAAG TCCTGACTTC GGATTTGACC
CTGGACTACG TCCGCATCAA CGCCGAATAT CGGACGTAA
 
Protein sequence
MIQVPQGFYF AAHGAGFKYT GRDDVACLAS DRPAAAAGVF TQNVFQAAPV LVAREQLQKR 
QTAQAVLVNA GQANACTGQT GVDNCRQTLR LAGEALEMDS ADILPASTGV IGDQLVMDLW
PQAVRGLRTS AGQAGPLDAA KAIVTTDSFP KIAWRSVTVG GREIRILGLA KGAGMICPNM
ATMLGFVLCD GAIEPQLWQT MLARAADRSF NRITVDGDTS TNDCVLGLAN GASDVTVDTA
TPGFQEALEA VCRDLAYFIV QDAEGGTKVV RIHVRGASSD QDAEQAARTI AHSPLVKTAL
FGRDPNWGRI VAALGRSGAT VAPESTSVWM GGLPLFENGV PVDADLDSLL APYLDRREVP
LEIDLGLGDK SAEVLTSDLT LDYVRINAEY RT