Gene Hlac_1639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1639 
Symbol 
ID7399588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1660291 
End bp1661412 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content72% 
IMG OID643708705 
Producthypothetical protein 
Protein accessionYP_002566294 
Protein GI222480057 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.264137 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGATC TCGCCGCCCG CACCGAGAGG CTCGACGCGT ATCTCGACGA GCGCGGGCTC 
GAAGCGGTCT GGTTCGCCAA GCCGAACGGG TTCGCGTGGC TCACCGGCGG CGACAACGTC
GTCGACGCCG ACGCCGACTT CGGGGTCGGG GCCGCCGGCT ACGACGGCGA TCTCCGGGTG
ATCACAGACG ACATCGAGGC GGACCGCCTC GCCGACGAGG AGCTCCCCGA CGCCGTCGCC
GTCGAGTCGT TCCCGTGGCA CGCGAACTCG CTGGCTGAGG CGGTCACTGA GCGCTCTCCC
GCGCCAGCGG CCGCCGACTT CGACGTACCG GGCTTCGAGC GCGTCGACGG GAGCCGGCTT
CGGCAGCCCC TCACCGACGA TGATGTCGAG CGCTACCGCG AACTCGGTCG GGAGGCCGCC
GCCGCCGTCG AGACCGTCTG CCGCAACCTC GAACCTGAGG ACCCGGAGTA CGAAGTGGCC
GCCGGCATCG ACATCTCGCT CGCGTCCCGC GACGTCGACA CCCCGGTCGT GCTCGTCGGC
GGCGCTGAGC GCGCCCAGCG CTACCGCCAC TACACCCCGA GTGACGCGAC GCTCGGCGAC
TACGCGCTCG TGTCCGTCAC CGCCGAGCGG GCCGGCCTCT ACGCCTCGCT CACCCGAACC
GTCGCGTTCG ACGCCCCCGA CTGGCTAGAG GAGCGCCATC GCGCGGCCGC GCGCGTCGAG
GCAACCGCGC TCGCCGCGAC CGAGGCCGCC GCGGCCGGAG AGCTAACGGG CTCCGATGGC
CCGGACACCG CCGGCGACGT GTTCGATACG ATCCGAACAG CGTACGACGC CGTCGGCTTC
GCCGGAGAGT GGCGCGAGCA CCACCAGGGC GGCGCGGCGG GCTTTGCGGG CCGCGAGTGG
ATCGCGACAC CCGAGAGCAA CGAGCCGGTT CGGTGGCCCA TGGGCTACGC GTGGAACCCC
ACCGTACAGG GAGCCAAAAG CGAGGACACC CACCTCGTGG CGCCCGACCG GACCGAGACG
CTGACGAAGA CCGGGCAGTG GCCGACACAC GAGGTTGAAC CGGTCGACAT CGAGGGAGTC
GCGACGGAGC CGCGAGAGCT GTCCGCACCG GTCATTCGGT AG
 
Protein sequence
MVDLAARTER LDAYLDERGL EAVWFAKPNG FAWLTGGDNV VDADADFGVG AAGYDGDLRV 
ITDDIEADRL ADEELPDAVA VESFPWHANS LAEAVTERSP APAAADFDVP GFERVDGSRL
RQPLTDDDVE RYRELGREAA AAVETVCRNL EPEDPEYEVA AGIDISLASR DVDTPVVLVG
GAERAQRYRH YTPSDATLGD YALVSVTAER AGLYASLTRT VAFDAPDWLE ERHRAAARVE
ATALAATEAA AAGELTGSDG PDTAGDVFDT IRTAYDAVGF AGEWREHHQG GAAGFAGREW
IATPESNEPV RWPMGYAWNP TVQGAKSEDT HLVAPDRTET LTKTGQWPTH EVEPVDIEGV
ATEPRELSAP VIR