Gene Hlac_1393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1393 
Symbol 
ID7400712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1401302 
End bp1402423 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content68% 
IMG OID643708454 
Productaminotransferase class I and II 
Protein accessionYP_002566051 
Protein GI222479814 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACT TCTCCGACAG GGTCGAACGG ATCTCCATCA GCGGGATCCG CGAGGTGTTC 
GAGGCGGCCG GCGACGACGC GATCAACCTC GGGCTCGGCC AGCCCGACTT CCCGACGCCG
GACCACGCGC GGCAGGCGGC GATCGATGCC ATCGAAGCCG GGAAGGCCGA CGCCTACACC
GAAAATAAGG GCACTCGGTC GCTTCGCGAG GCGATCGCCG AAAAACACCG GACGGATCAG
GGGATCGACC TCGACCCGAG TAACGTGATC GCCACTGCGG GCGGCAGCGA GGCGCTCCAC
ATCGCCTTAG AGGCGCACGT CGACGCCGGT GACGAGGTGT TGATTCCGGA CCCGGGGTTC
GTCTCCTACG ACGCGTTGAC GAAGCTGACC GGGGGCGAAC CGGTCCCGGT CCCGCTGCGC
GACGACCTCA CGATCGACCC CGCCGCGATC GAGGCGGCGA TCACCGACGA CACCGTCGCG
TTCGTGGTGA ACTCGCCCGG GAATCCCACC GGCGCGGTCT CCTCCGAGGA AGACGTGCGC
GAGTTCGCGC GGATCGCCGA CGAGCACGAC GTGCTCTGTA TCTCCGACGA GGTGTACGAG
TACACCGTCT TCGAGGGGGA ACACTACTCG CCGATGGAGT TCACCGAGAC CGACAACGTC
GTCGTGATCA ACTCGGCCTC GAAGCTGTTC TCGATGACTG GCTGGCGGCT CGGATGGGTG
TACGGCTCCG AGGAGCGCGT GGAGCGCATG CTGCGCGTCC ACCAGTACGC GCAGGCTTGC
GCGTCGGCGC CTGCCCAGTA CGCCGCTGAG GCTGCGCTGC GGGGCGATCA CGGGGTCGTC
GACGAGATGA CCGCCTCCTT CGAGCGCCGC CGCGACCTCC TCCTCGACGG CTTCGACGAA
ATCGGCATCG ACTGCCCGAC GCCGCAGGGA GCGTTCTACG CGATGCCGCG GGTGCCCGAG
GGGTTCGTCG ACGAGTGTCT TGACCGCGGC GTGGTCGTCG TTCCCGGCGA GGCGTTCGGC
GAGCACGGGC GCGGCCACGC TCGGATCTCG TACGCGACCG ACGAAAGCGA GCTCCGCGAG
GCGCTCGACG TGATGGCGGA CGCCTACGAG GCGGCGAAGT AG
 
Protein sequence
MPNFSDRVER ISISGIREVF EAAGDDAINL GLGQPDFPTP DHARQAAIDA IEAGKADAYT 
ENKGTRSLRE AIAEKHRTDQ GIDLDPSNVI ATAGGSEALH IALEAHVDAG DEVLIPDPGF
VSYDALTKLT GGEPVPVPLR DDLTIDPAAI EAAITDDTVA FVVNSPGNPT GAVSSEEDVR
EFARIADEHD VLCISDEVYE YTVFEGEHYS PMEFTETDNV VVINSASKLF SMTGWRLGWV
YGSEERVERM LRVHQYAQAC ASAPAQYAAE AALRGDHGVV DEMTASFERR RDLLLDGFDE
IGIDCPTPQG AFYAMPRVPE GFVDECLDRG VVVVPGEAFG EHGRGHARIS YATDESELRE
ALDVMADAYE AAK