Gene Hlac_1773 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1773 
Symbol 
ID7399646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1791620 
End bp1793083 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content66% 
IMG OID643708839 
ProductTrkA-N domain protein 
Protein accessionYP_002566422 
Protein GI222480185 
COG category[R] General function prediction only 
COG ID[COG0618] Exopolyphosphatase-related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0116553 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGTG GGGTATCCGG ATCGGCGAGA ACGCGATACG CGATCCTCGG TTGCGGGAGC 
GTCGGGCACG CCGTCGCGGA GGACCTGACC GGCCGCGGCA AGGACGTGCT CATCCTCGAT
CGCGACGAGA GCCGGGTCGA GGCGCTCCGC GACCAAGACC TCAACGCCCG CGTGCAGGAT
ATCGCCGACC CGGACGTGGT CGAGGCGATC GACGACCGGG ACATCGTGTT GATCTTGGCG
AGTGACGTCG AGGCGAACAA GGCGGCCGTC TCGGCGGTGC GCGACACCGG TGGCGACCAC
TACGTCGTCG TCCGCGCCTC GGACCCCGTC AGCGAAGACG AACTGCGCGA GCGCGGCGCC
GACGTGGTGA TAAACCCCTC GACGGTGATC GCCGACAGCG CGCTCCAGTC ACTGGAGACC
GGCGAGCTGG AGTACATGGC GCGTCAGCTC GCGGAGATTA TCGAGGACGG GGATGGGCGG
ATGGCGATCT TGACCCACGA CAACCCGGAG CCGGACTCAA TCGCGTCGGC GACCGCGCTG
CAGGCCATCG CCGGCGCGTT CGGCGTCGAG GCGGACATCC TCTACACCGG AGACGTCGGC
CACCAGGAGA ACCGCGCGTT CGTCAACCTC CTTGGGATCG ACCTAGTGGC CCGCTCGGAG
GCACCCGACC TCTCGGAGTA CGGGACGGTC GCTGCAGTCG ACCTCGCAAA GTCGGCCGAG
GACAGCTTCG ACTTCGAGAC CGATATCGAC ATTTATCTCG ATCACCTCGA AGCCGACGTC
CCCTTCGACG CCCGGTTCGT CGACGTACGG ACCAACGTCT CCTCGACCTC GACGATCCTC
ACGAAGTACC TCCAAGAGTT CGACCAGTCG CCGACGGAGG CGGTTGCGAC CGCCCTCCTC
TACGGGATCC GCGCCGAGAC GCTCGACTTC AAGCGGGACA CGACACCCGC GGACCTGACC
GCCGCCGCGT ATCTCCACCC GTTCGCGAAC CACGACACGC TCGAACAGGT GGAGTCGCCG
TCGATGAGCC CCGAGACGCT GGACGTACTC GCGGAGGCGA TCCAGAACCG CGAGGTACAG
GGGAGCCACC TGTTCTCGAC GGCCGGGTTC ATCCGCGATC GTGAGGCGCT CGCGCAGGCG
GCCCAACATC TCCTCAACTT AGAGGGGATC ACGACGACCG CCGTCTTGGG AATCGCTGAC
GACACGATCT ACCTCGCGGC TCGCTCGAAG GATATCCGGC TGAATATCGG TAACGTCCTC
GACGAGGCAT TTTCCGAGAT GGGCGACGCC GCCGGTCACT CAACACAGGG CTCGTTAGAG
ATTCCGCTCG GCATCTTTAC CGGGATCGAG TCGAGCGGCG ACAACCGAGA CACCCTGCTC
AACCTCACCG AGGAGGCCGT CCGCCGAAAG CTGTTCGACG CGCTCGGCGT CGAGGGCGGG
AGCGAGAGCG GGAACACCTC GTGA
 
Protein sequence
MSSGVSGSAR TRYAILGCGS VGHAVAEDLT GRGKDVLILD RDESRVEALR DQDLNARVQD 
IADPDVVEAI DDRDIVLILA SDVEANKAAV SAVRDTGGDH YVVVRASDPV SEDELRERGA
DVVINPSTVI ADSALQSLET GELEYMARQL AEIIEDGDGR MAILTHDNPE PDSIASATAL
QAIAGAFGVE ADILYTGDVG HQENRAFVNL LGIDLVARSE APDLSEYGTV AAVDLAKSAE
DSFDFETDID IYLDHLEADV PFDARFVDVR TNVSSTSTIL TKYLQEFDQS PTEAVATALL
YGIRAETLDF KRDTTPADLT AAAYLHPFAN HDTLEQVESP SMSPETLDVL AEAIQNREVQ
GSHLFSTAGF IRDREALAQA AQHLLNLEGI TTTAVLGIAD DTIYLAARSK DIRLNIGNVL
DEAFSEMGDA AGHSTQGSLE IPLGIFTGIE SSGDNRDTLL NLTEEAVRRK LFDALGVEGG
SESGNTS