Gene Hlac_0461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0461 
Symbol 
ID7400341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp478779 
End bp479714 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content71% 
IMG OID643707525 
Productaminotransferase class IV 
Protein accessionYP_002565133 
Protein GI222478896 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase 
TIGRFAM ID[TIGR01121] D-amino acid aminotransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.893728 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGAGG ACCCGGGCGG GGCGGACGAC GCGGCCGAGG AACTCCGCTA CCACGTCGAC 
GGCGAGATCG TCCCCGCCTC GCAGGCGACC GTCTCCGTCG AGGACCGCGG GTTCGCCTAC
GGCGACGCCG CCTTCGAGAC CCTGCGCGCG TACGGCGGCG AGGTGTTCCG GTGGGACGAC
CACGCCGCGC GACTCGCGGA CACCTGCGAG ACGCTCGGGC TCGACCACGG GCTCTCCGAG
ATCGACCTGA AAGCCCGGAT CGACGAGACG CTCGCCGCGA ACGACCTCGC TGAGGCGTAC
GTGAAGCTCT CTATCACGCG TGGGGTCCAG CCCGGCACGC TCGACCCGCG GCCCGAGGTC
GACCCCACCG TCGTCGTGAT CGCGAAGCCC CTCGCCCGCG GCGGCGTCGA CTCGACACCG
GTCCACGACG GCCCCGCCGC GCTCCAGACG ACGAAGACCC GAAAGCCCTC CTCGCGGGCG
CTCCCGGCCG ACGCGAAGAC GCACAACTAC CTCAACGGAA TCCTCGCACG GCTGGAACTG
CGCGTGACCG GTGCCGACGA GGCGCTGATG CTCGATCCGG ACGGCAACGT CGCCGAGGGG
GCGACCGCGA ACCTCTTCTT CGCCGACGGC ACCGCACTCA AGACGCCCTC GCTCGACGGG
CCGATCCTCC CGGGCGTGAC GCGTCGCACC GTGATCGAGA TCGCGGAGGC GGAGGGGATT
CCGGTCGAGG AGGGGACGTA CGCGCCGGAC GCGGTGCGCG AGGCGGACGA GGTTTTTCTC
ACCAACTCGA CGTGGGAGAT CCGGCCGGTC GAGACGGTGG ACGGTATCGG GGTCGACGGC
GACGGCGAGG GCGTCGAGGG ACCGCTGACC GCGCTGCTCT CGCGGCTGTT CGATCGGCGC
GTGGAGGAAG CGTACTACGA CGGCGAGCGG CTATAA
 
Protein sequence
MGEDPGGADD AAEELRYHVD GEIVPASQAT VSVEDRGFAY GDAAFETLRA YGGEVFRWDD 
HAARLADTCE TLGLDHGLSE IDLKARIDET LAANDLAEAY VKLSITRGVQ PGTLDPRPEV
DPTVVVIAKP LARGGVDSTP VHDGPAALQT TKTRKPSSRA LPADAKTHNY LNGILARLEL
RVTGADEALM LDPDGNVAEG ATANLFFADG TALKTPSLDG PILPGVTRRT VIEIAEAEGI
PVEEGTYAPD AVREADEVFL TNSTWEIRPV ETVDGIGVDG DGEGVEGPLT ALLSRLFDRR
VEEAYYDGER L