Gene Hlac_2304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2304 
Symbol 
ID7401921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2296444 
End bp2297583 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content72% 
IMG OID643709377 
Productaminotransferase class V 
Protein accessionYP_002566950 
Protein GI222480713 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00866664 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.420378 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGGAC TCGAAGTCGA CGACCGGATG ACCCCGCGCG AGCTGCGCGC GGACGTGCCG 
GCGCTCGGCG AGGCCGCGTA CTTTAATTTC GGCGCGCACG GGCCGAGCCC CGAGTACGTC
GTCGAGGCGG CGTCTGAGTT CCTCGCCGAC CACGAGTACG GCTCGGCGAC GACGGACCCG
TACAGTCGCG CCTTCGAGAC GTACGAGACG GTCCGCGAGC GCGTCGCCGA CTTCGTCGGC
GCCGAGCCCG ACGAGATCGC CCTGACCGAG AGTACGACCG ACGGGATCAC CCGGATCGCG
GGTGCCATCG ACTGGGAGCC CGGCGACGTG GTCGTCCGGA CCGACCTGGA ACACCCAGCC
GGCATCCTCC CGTGGAAGCG GCTCGAACGC GAGGGCGTCG AGGTGCGCGT CGTCGAGACC
GAAGAGGGCC GGATCGATCG CGAGGCGTAC GCCGAGGCCG TGGCGGACGC GCGGCTCGTC
TGCTTCAGCG CGATCACGTG GACCCACGGG ACGCGGCTCC CGGTCGCGGA CCTCGTCGAG
ATCGCGGACG AGGCGGGCGC GTTCACGCTC GTCGACGCGG TCCAGTCGCC CGGACAGGTC
GCGATGGACG TGTCGGCGTG GGGCGCCGAC GCGGTGGCGG CGGCGGGCCA CAAGTGGGTG
CTCGGCCCGT GGGGAGCCGG GTTCCTCTAC GTCGACCGCG AGGCCGCGAC CGAGCTGGCG
CCGCGCGCGG TCGGCTACCG GAGCGTCGAG GATCCCAACG CCGACGAGAT CGTGTTCAAG
GAGGGTGCGA AGCGCTTCGA GGTCGGATCG ACGACCCCGG CCGCCCACGT CGGGCTGATC
GAGGCGCTCG ACGCGATCGA CGCGGTCGGG ATCGCGACGA TCGAAGACCG GATCGCGTCG
CTCACGGACC GGCTCAAAGA CGGCGTCCCC GACGACCGTC TGCTGAGCCC CCGCGAGTAC
GAGTCCGGGC TCGTCACGAT CGACGTCGAC GACCCCGAGG CGACGGTCGA CCGGCTCGCC
GACGAGGGGA TCGTCGTTCG TTCTCTCCCG CACCCGGACG GGGTCCGGGC GTCGGTCCAC
GCCGTCTCCA CCGAGGCCGA GATCGACCGG CTCGTCGAGC GGCTCGCCGT CGAGTGGTGA
 
Protein sequence
MAGLEVDDRM TPRELRADVP ALGEAAYFNF GAHGPSPEYV VEAASEFLAD HEYGSATTDP 
YSRAFETYET VRERVADFVG AEPDEIALTE STTDGITRIA GAIDWEPGDV VVRTDLEHPA
GILPWKRLER EGVEVRVVET EEGRIDREAY AEAVADARLV CFSAITWTHG TRLPVADLVE
IADEAGAFTL VDAVQSPGQV AMDVSAWGAD AVAAAGHKWV LGPWGAGFLY VDREAATELA
PRAVGYRSVE DPNADEIVFK EGAKRFEVGS TTPAAHVGLI EALDAIDAVG IATIEDRIAS
LTDRLKDGVP DDRLLSPREY ESGLVTIDVD DPEATVDRLA DEGIVVRSLP HPDGVRASVH
AVSTEAEIDR LVERLAVEW