Gene Hlac_0832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0832 
Symbol 
ID7400798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp826167 
End bp827321 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content67% 
IMG OID643707898 
Productaspartate aminotransferase 
Protein accessionYP_002565501 
Protein GI222479264 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0198108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG CATACGACTT TTCAGAGCGG ATCGGACGCG TAGAACCCAG CGCCACGCTG 
GCGATATCGA ACCTCGCGGC GGAGAAAGAG GCGGAGGGCG CTGACATCGT CGACCTGTCG
GTCGGCGAGC CCGACTTCGA TACTCCAGCG AACGTCGTCG AGGCAGGCAA GGATGCCCTC
GACGCCGGCC ACACGGGGTA CACCTCCTCG AACGGGATTC CCCAACTCAA AGAGGCGATC
GCGGCGAAGC TCCGCGACGA CGGACTCGAC GCCGACGCCG ACGAAGTGAT CGTCACCCCC
GGCGGCAAGC AGGCGCTGTA CGAGACGTTC CAGACCCTGA TCGACGACGG CGACGAGGTC
GTCCTGCTGG ATCCGGCGTG GGTCTCCTAC GAGGCGATGG CAAAGCTCGC CGGCGCCGAC
CTCTCGCGGG TCGATCTAGC CCCGCACGGG TTCCAGCTGG AGCCCGCGCT CGATGCGCTC
GCGGAGACGG TTTCTGACGA CACCAAACTG CTCGTCGTCA ACTCCCCGTC GAACCCGACA
GGTGCCGTCT TCTCGGAGAC CGCCTTGGAG GGCGTCCGCG ACCTCGCGGT CGAACACGAC
ATCGCCGTGA TCTCAGACGA GATCTACGAG CAGATCACCT ACGACGCAGA GCACGTCTCG
CTCGCGAGCC TTGACGGAAT GGCCGACCGG ACGATCACGA TCAACGGTTT CTCGAAGGCG
TACTCGATGA CCGGCTGGCG GCTCGGCTAC CTCCACGCGA CCGACGAGTT CGTCGGGCAG
GCCGGCAAGC TCCACTCGCA CTCCGTCTCG TGTGCGGTCA ACTTCGTCCA ACGCGCGGGC
GTCGAGGCCC TCGAAAACAC CGACGAGTCG GTCACGGAGA TGCGCGACGC GTTCCGCGAC
CGCCGCGACC TCCTCGTCGA CCTGTTCGAC GAGCACGGCG TCGACGTCGA CGTGGGCGAC
GGCGCCTTCT ACATGATGAT CCCGGTCGAC GAGGACGATC AGGCGTGGTG TGAGGCCGCC
ATCGAGGAGG CGTCGGTCGC CTGCGTCCCC GGGAGCGCGT TCAACGCGCC GGGCCACGCC
CGGATCTCGT ACGCTGCCAG CGAGGGGCGG CTTCGCGAGG CGGTCGACCG ACTCGTCTCG
AACGACCTGC TGTAG
 
Protein sequence
MSDAYDFSER IGRVEPSATL AISNLAAEKE AEGADIVDLS VGEPDFDTPA NVVEAGKDAL 
DAGHTGYTSS NGIPQLKEAI AAKLRDDGLD ADADEVIVTP GGKQALYETF QTLIDDGDEV
VLLDPAWVSY EAMAKLAGAD LSRVDLAPHG FQLEPALDAL AETVSDDTKL LVVNSPSNPT
GAVFSETALE GVRDLAVEHD IAVISDEIYE QITYDAEHVS LASLDGMADR TITINGFSKA
YSMTGWRLGY LHATDEFVGQ AGKLHSHSVS CAVNFVQRAG VEALENTDES VTEMRDAFRD
RRDLLVDLFD EHGVDVDVGD GAFYMMIPVD EDDQAWCEAA IEEASVACVP GSAFNAPGHA
RISYAASEGR LREAVDRLVS NDLL