Gene Hlac_1546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1546 
Symbol 
ID7401477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1566749 
End bp1567957 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content66% 
IMG OID643708613 
ProductCys/Met metabolism pyridoxal-phosphate-dependent protein 
Protein accessionYP_002566204 
Protein GI222479967 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATC GATCCGACTC CCACGAGGGC GACGCCGACG ACGAGGGCGA CGCCGACGAC 
CGCCGCTTCG AGACCCGCGC GATCCACGCC GGACAGGAAC CCGACCCGGA GACCGGCGCG
CTGATGACAC CAATTCACGC AAACTCCACG TATAAACAGG ACGCGCCGGG CGAGCACCGC
GGCTACGAGT ACAGTCGAAC CGGAAACCCG ACTCGAACCG ACCTCGAAGC CAACCTCGCC
TCGCTGGAAT CGGGGAGTCA CGCGCGCTGC TTCTCCTCCG GGATGGGCGC AATAAACACC
GTCCTCAACC TGCTTGAGGC GGGTGACCAC GTCGTCGCCG GCGACGACGT GTACGGCGGG
ACCCACCGCA TCCTCACGCA GGTGTACGAG CAGTACGACT TGGAGACCAC CTTCGTCGAC
ACGACCGACC ACGACGCGGT CCGGGATGCG ATGCGTGAGG AGACGGAGTT AGTGTGGGTC
GAGACGCCGA CGAACCCCCT GTTGAACGTC AACGACATCG GCGCACTTGC CGATATCGCC
CACGAGGCAG ATGCGCTCTG CGCGGTCGAC AACACGTTTG CGACCCCGTA TCTCCAGCGC
CCGCTCGAAC ACGGCGCCGA CATCGTCTGC CAGTCGCTGA CGAAGTACCT CGGCGGTCAC
TCCGACACCA TCGGCGGGGC GCTCGTCGTC GACGACGCAG AACTGGACGA GCGGCTCGGC
TTCTACCAGA ACTCGGTGGG CGCGACGCCC GGCCCGTTCG ACTCCTTCCT CGTGTTACGC
GGGACGAAGA CGCTCCCGGT CCGGATGGAC CGCCACTGCG AGAACGCGAT GGAACTGGCT
CAGTGGTTAG AGGACCACGA CGACGTGAGC CGCGTCTACT ACCCCGGACT AGAGAGCCAC
CCGGACCACG ACCTCGCGGC CGAGCAGATG GACGCCTTCG GCGGGATGCT CTCCTTCGAG
TTCGACGGCA CCCTCGAACA GGGCTCGACA GTCGTCAGTG AGACGGAAGT GTTCACCCTC
GCGGAGTCGC TCGGCGGCGT CGAGAGCCTG ATCGAGCAGC CGGCAGCGAT GACCCACGCC
GCGATCCCCC GCGAGGAGCG GCTCGCAGCC GGGCTCACGG ACGGCCTCAT TCGAGTGTCG
GTGGGGATCG AGCACGTGGA CGACATGAAG GCCGACTTCC AGCAGGCGTT CGACGCGGCG
CTGGAGTAG
 
Protein sequence
MSDRSDSHEG DADDEGDADD RRFETRAIHA GQEPDPETGA LMTPIHANST YKQDAPGEHR 
GYEYSRTGNP TRTDLEANLA SLESGSHARC FSSGMGAINT VLNLLEAGDH VVAGDDVYGG
THRILTQVYE QYDLETTFVD TTDHDAVRDA MREETELVWV ETPTNPLLNV NDIGALADIA
HEADALCAVD NTFATPYLQR PLEHGADIVC QSLTKYLGGH SDTIGGALVV DDAELDERLG
FYQNSVGATP GPFDSFLVLR GTKTLPVRMD RHCENAMELA QWLEDHDDVS RVYYPGLESH
PDHDLAAEQM DAFGGMLSFE FDGTLEQGST VVSETEVFTL AESLGGVESL IEQPAAMTHA
AIPREERLAA GLTDGLIRVS VGIEHVDDMK ADFQQAFDAA LE