Gene Hlac_1416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1416 
Symbol 
ID7400735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1427355 
End bp1429010 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content67% 
IMG OID643708477 
ProductABC transporter related 
Protein accessionYP_002566074 
Protein GI222479837 
COG category[R] General function prediction only 
COG ID[COG3845] ABC-type uncharacterized transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.168356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCGG CGGTCCACAT GGACGGTATC ACGAAGCGAT TCCCCGGGGT CGTCGCGAAC 
GACGACGTTG ATCTCGCGGT GGAGCGCGGG AGTGTCCACG CCCTGCTCGG CGAGAACGGG
GCCGGCAAGA CCACGCTGAT GAACGTGTTG TACGGGCTCT ACGAGCCGAC CGAGGGGACC
GTCTTCCTCG ACGGCGAGCC GCAGTCGTTC GACTCGCCGC GCGACGCAAT CGACGCGGGC
GTCGGCATGA TCCACCAGCA CTTCATGCTC GTCGACCCGA TGACGGTGTG GGAGAACGTC
GTCTTGGGCA ACGAGCCGAA GACGTGGGGC GGGCTCCGCG TCGACGAGGC CGCCGCCCGC
GAGGCGGTCG TCGAACTGAG CGAGCGCTAT GGTTTCGACG TGGACCCCGA CGCCCGGATC
GAAGACGTCT CGGTCGGCGT CCAACAACGC GTCGAGATCC TAAAGGCGCT GTACCGCGGC
GCCGACGTGC TCATCCTCGA CGAGCCGACC GCGGTGTTGA CGCCTCAAGA GGTCGAGGAC
CTGTACGGCG TCTTCGAGGA GCTCACCGAG CAGGGGAAGA CGATCATCTT CATCTCGCAC
AAGCTCGGCG AGGCGCTGTC GGCGGCCGAC GAGATCACCG TCCTCCGCGA CGGCGTCAAC
GTCGGGACGG TCGCGTCCGC AGACGTGACC CGTGAGGATC TTGCGGAGAT GATGGTCGGC
CGCGAGGTGC TGATGGAGCC CGCGACGACG CCGCAGAAGC CGGGCGACCG CGTCCTTGAG
GTGACCGAGG TCCACGCCGA CGACGACCGC GGCGTTGAGA CGGTGTCGGG GATCTCCTTC
GAGCTCCGTG CAGGCGAGGT GTTCGGCATC GCCGGTGTCG ACGGAAACGG TCAGTCCCAA
CTCGTCGAGA CAATCACGGG GATGCACAAT CCGACCGACG GGTCGATCAC CTACCTCGGC
GAACCGATGG CCGACGAGAG CCGTCGGGAA CACATCGACC GCGGGATGGC CTTCATCCCC
GAAGACCGGC AGGAGCGAGG GCTGGTGATG TCGTACGACC TGACCGAGAA CGGTATCCTC
GGAAGCCAGC ACGACCCGCC GTTCGCCGAG GGCGGCCGGC TCGACTGGCG CGCCTCTCGT
GATCACGCCG AGTCCGTGAT CGAGCAGTAC GATGTGCGGC CCCCGAACGC CGACGCCGAG
GCGGAATCGC TCTCTGGCGG CAACCAGCAG AAGTTCATCG TCGGCCGGGA GTTCGAGCGC
GATCCGGAAC TGGTCGTGGC GATGCACCCG ACGCGGGGGG TCGACATCGG CTCCACGGAG
TTCCTCCACG ACCGCCTGCT CGAACTGCGG ACCGAGGGGA AGGCGGTGCT TCTGGTCTCC
TCGAAGTTAG ACGAAGTACA GGGGCTCTCC GACCGGCTCG CCGTGATCCA CGAGGGAGAG
TTCACTGGAG TCGTCGACCC CGCCACGGTG ACCGAAGAGG AAATCGGCCT GCTGATGGCG
GGCGAGACGC TCGACGACGA CTCCGTCGAG GGGGCTGCGG TCCACAACGC CGGACCGGAC
CCCGTTGGCA ACGGCGGCGT GGAGACGGAC ACCGGGACAG ATGATACCCT CGAGAAGACA
GATGACGACA TTGAGAACGA GGAGGTGGAC GCGTGA
 
Protein sequence
MNPAVHMDGI TKRFPGVVAN DDVDLAVERG SVHALLGENG AGKTTLMNVL YGLYEPTEGT 
VFLDGEPQSF DSPRDAIDAG VGMIHQHFML VDPMTVWENV VLGNEPKTWG GLRVDEAAAR
EAVVELSERY GFDVDPDARI EDVSVGVQQR VEILKALYRG ADVLILDEPT AVLTPQEVED
LYGVFEELTE QGKTIIFISH KLGEALSAAD EITVLRDGVN VGTVASADVT REDLAEMMVG
REVLMEPATT PQKPGDRVLE VTEVHADDDR GVETVSGISF ELRAGEVFGI AGVDGNGQSQ
LVETITGMHN PTDGSITYLG EPMADESRRE HIDRGMAFIP EDRQERGLVM SYDLTENGIL
GSQHDPPFAE GGRLDWRASR DHAESVIEQY DVRPPNADAE AESLSGGNQQ KFIVGREFER
DPELVVAMHP TRGVDIGSTE FLHDRLLELR TEGKAVLLVS SKLDEVQGLS DRLAVIHEGE
FTGVVDPATV TEEEIGLLMA GETLDDDSVE GAAVHNAGPD PVGNGGVETD TGTDDTLEKT
DDDIENEEVD A