Gene Hlac_0902 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0902 
Symbol 
ID7401273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp892898 
End bp894379 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content71% 
IMG OID643707967 
Producthypothetical protein 
Protein accessionYP_002565570 
Protein GI222479333 
COG category[S] Function unknown 
COG ID[COG1650] Uncharacterized protein conserved in archaea 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.757817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.396557 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATAGCGA TCGTCGTCAG CCGGGCCGAC AGCGCCTCGG AACACATCGG CGAGCACCTG 
CTCGACCTCG GCGACTGGGA GCGCCGCGAC GACCCGAGTC GTCCCGACGC CGACGGCGGC
GGAACGTACT ACCGGACCGA CGGGTTCGAG CTGCGGGAGT TCGACGACCT CCACATCTAC
CTGGACGATC CCGCGGCCGC GTTCGGTGGT GGGGCAGGTG ACGAGACGAA CGACGCGGCA
AGCGACGACA CCGACGAGAC CCCCGAGTTC CTCGCGTTCG TCTCCCGCCA TTCCGGCGAG
ACGGGAGAGC TACTAACGGC TCACGTCACC GGGAACTTCG GCCCTGCGCC ATACGGGGGC
GAGCCGGACA CGCTGGCTCG GGCGGCGCCG GGAGCCGAGA AGCGCGTCGT CGAGGCGCTG
GCGGCGCACG CTCCCGAGGG GTACGACGTG GGGATCGAGT GCACTCACCA CGGCCCGACG
GACACGTCCG TCCCGTCGCT GTTCGTCGAA CTCGGCTCCG ACGAGCCGCA GTGGACCGAC
GCGGATGCGG CCCGGGCGGT CGCGCGGGCG GTGCTCGACC TGCGCGGGAC CGACGCGGAT
CTGGTCACTG ACGCAGGGGA AACGACTGAC GAGATCGACG ACGACCCCCA CCCCCGCCAC
GTCGTCGGCT TTGGCGGCGG CCACTACGCC CCGCGGTTCA CCCGAATTGT CCGCGAGACC
GAGTGGGCGG TGGGACACGT CGGCGCCGAC TGGGCGCTCG GAGAACTTGG CGCGCCCGAC
GCGAACCGAG ACGTGATCGA GCAGGCGTTC GCGCGGAGCA AGGCGAATGT GGCGGTTATC
GAGGGTGAAA AGCCCGATCT CGAAGCGACG GTCGAGGCGC TCGGCCACCG TGTCGTGAGC
GAGACGTGGG TGCGTGCGGT CGGCGATCGC CCCTTGCCGC TGGTCGAGCG GCTGGAGTCC
GACCTCGCGA CGATCGACGA GGGGCTCCGG TTCGGTGAGG TCGTCCCCGC GTCACCCGAC
GCGATCCGCG TCAGGGGCCT CCCGGAAGAC CTGCTCTCGC GGGCACAGGG CGTGGACGCG
GACGCGGCCC GCGTGGCCGT GGAGACGAAC GCGGTCGCCT TCGACACCGA GCAGGCCGGA
ACGCGAGCGG CCGGGTCGGT CGCGTTCGCT GACGACGAGG TGTCGCCCGG ATACGACGAC
CTCGTCGCAG ACCTCGCGGG CGTGTTGGAG CGCGGGTACG ACACGGTCGA CATTACCGAC
GGCGCCGTGA TAGCGCGCGA GACCGCGTTC GATCCCGAGC TCGCCGCCAA GCGTGGGGTC
CCGGAGGGGC CGGCGTTCGG GCGGCTCGCG AGCGGGGAGT CGGTCGAAGT CGACGGCGAA
ACGATCGCGC CGGCGGACGT GTCGCGAGAG CGGACAAACC GATTCCCGAT CGACTCCCCC
ACTGACTCCG CCGCCGAGCC CCCTACCGAA CCCTCTGAGT GA
 
Protein sequence
MIAIVVSRAD SASEHIGEHL LDLGDWERRD DPSRPDADGG GTYYRTDGFE LREFDDLHIY 
LDDPAAAFGG GAGDETNDAA SDDTDETPEF LAFVSRHSGE TGELLTAHVT GNFGPAPYGG
EPDTLARAAP GAEKRVVEAL AAHAPEGYDV GIECTHHGPT DTSVPSLFVE LGSDEPQWTD
ADAARAVARA VLDLRGTDAD LVTDAGETTD EIDDDPHPRH VVGFGGGHYA PRFTRIVRET
EWAVGHVGAD WALGELGAPD ANRDVIEQAF ARSKANVAVI EGEKPDLEAT VEALGHRVVS
ETWVRAVGDR PLPLVERLES DLATIDEGLR FGEVVPASPD AIRVRGLPED LLSRAQGVDA
DAARVAVETN AVAFDTEQAG TRAAGSVAFA DDEVSPGYDD LVADLAGVLE RGYDTVDITD
GAVIARETAF DPELAAKRGV PEGPAFGRLA SGESVEVDGE TIAPADVSRE RTNRFPIDSP
TDSAAEPPTE PSE