Gene Hlac_2965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2965 
Symbol 
ID7398944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012028 
Strand
Start bp218416 
End bp219738 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content62% 
IMG OID643706777 
Producthypothetical protein 
Protein accessionYP_002564399 
Protein GI222475878 
COG category[S] Function unknown 
COG ID[COG4983] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAATTC CCCCGACCGC CGACACCCTC CCGGACGCGC TCGTCGATCG TGACCAGTGG 
GTGTGCTGGC GAAGCCAACA GCGAGAGGAC AAGCACACCA AAGTCCCGAT TATTCCCGGC
ACAACGCAGT TTGCGTCGAC GACGGATCCG GATACGTGGC GAGACTTCCA GACGGCCCGC
AAGGCGGTGA CATCAACCCC AGTCGATGGA TTGGGGTTTG TGTTTACGAC AGATGACCCG
CTCATCGGGA TCGATCTGGA TGACTGCCGC GATCCTGACA CCGGCGAGCC GACCGCGTGG
GCCAGCCAGA TCATCACCCA ACTCGACTCG TACACGGAAG TGAGCCCCTC TGAAACCGGC
TATCACATTC TCGTCACGGG GGCGCTCCCT GAGGGACGCA ACCGCGCTGG CAACCTTGAG
CTGTATGACC GATCGCGATT TTTCACTGTA ACCGGCACAC AACTTGCAGA AACACCCGCA
ACGGTAGCCG AGCGGACGGC AGCGGTGGCG TCGCTTCACG CTGAGAAAGT GGCTTCCGAG
TCGTCCAGCG AGCCGCCCAG TACCACCACG GATCGATCCC CGAACGATGA CACCACCGCA
CCTGAGACAG CCACCGCCGG CGTCGGTGCG CTTTCAGACG AGGAGCTACT CAATCGGGCA
ACAGCCGCTG CCAACGGTGC AAAGTTTCGT CGGCTCTGGG GAGGTGACAC CAGCGGCTAC
GAGAGCCACT CTGAGGCAGA CATGGCACTG TGTCGGCTGC TTGCTTTCTG GACGGGCCGG
GATCGCGGTC GAATGGATCG ACTGTTTCGA CAGTCGGGGC TGGCCCGTGA CAAGTGGGAC
GAGGTCCACT ACGCTGACGG CAGTACCTAC GGTGAGAAGA CGCTTGAGCG GGCGATCGCT
CGTACAGACG ACGTGTACAC GCCACCGGGG ACAGCCGACG CATCACCGTC GGCCGGTGAC
ACTGACACCG CTCCCGATGA CTCAGCGGCT CCCTCGACAC CAACGGCGAC CGATCCCGCA
CAGCACGCGT CGACGTCACC AGCCGAATCG ACACCACCCA GCCGAACCGA GACATCATCA
GCCGGTGACT CCACCGCAGG GTCTCCCGCG ACGGCAGCAA CGACACTCAA CTCCGGGCCA
GCAGCTCACC CACAGCATGC CCGCGCACGT CTCGAACGAC TCGAAGAGCT GACAGCCCGA
ATTGAAACGC TCATCGAAGA AAACGAACAG CTTCGAGCGG ATTTAGCGGC CGAACGGAAC
CGACGACAGG CGCTCGAACG CGCCACAGAT GAGGATGAAA CAGCCAGTTG GTGGCCGCTG
TAA
 
Protein sequence
MGIPPTADTL PDALVDRDQW VCWRSQQRED KHTKVPIIPG TTQFASTTDP DTWRDFQTAR 
KAVTSTPVDG LGFVFTTDDP LIGIDLDDCR DPDTGEPTAW ASQIITQLDS YTEVSPSETG
YHILVTGALP EGRNRAGNLE LYDRSRFFTV TGTQLAETPA TVAERTAAVA SLHAEKVASE
SSSEPPSTTT DRSPNDDTTA PETATAGVGA LSDEELLNRA TAAANGAKFR RLWGGDTSGY
ESHSEADMAL CRLLAFWTGR DRGRMDRLFR QSGLARDKWD EVHYADGSTY GEKTLERAIA
RTDDVYTPPG TADASPSAGD TDTAPDDSAA PSTPTATDPA QHASTSPAES TPPSRTETSS
AGDSTAGSPA TAATTLNSGP AAHPQHARAR LERLEELTAR IETLIEENEQ LRADLAAERN
RRQALERATD EDETASWWPL