Gene Hlac_0162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0162 
Symbol 
ID7402091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp171606 
End bp172763 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content67% 
IMG OID643707225 
ProductABC-type iron(III) transport system,substrate-binding protein 
Protein accessionYP_002564837 
Protein GI222478600 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACT ACCTACCAGA CGGCGTAGAC CGCCGGCAGT TCCTCGCAGC CACGGGCGCG 
CTCGGCGTCG CCGGCCTTGC CGGGTGTACG GGTGACGATA CCGACGGCGG CAGCGGCAAC
AGCAGCGACG GCGGCGACGG CGGCGACGGC GGCGACGGAT CGATCGGACA GATCGGTTCC
GGGCGCGAAG GTCGCGGAGC CCCCGGTGGC ATCCCGATGG CCGAGATGCC CGATCTGGAA
GGCGAGCTCA CGGTCTACTC CGGCCGCGGC GAGTTCCTCG TCGGCGAGCT CGTTGAGTAC
ATCGAGGACC AGTACGACGA TTTCGACCTG ACCGTCCGGT ACGCTGGTTC TACCGACCTG
GTAAACCAAA TTCTCAATGA GGGTGACGGC TCCCCCGCCG ACGTGTTCTA CTCGGTCAAC
GCGGGCTCGC TGGGCACGCT CGCCGGCGAG GGCCGTTCCC AAGCGCTCTC CTCGGAAATC
ACCGACATGG TTCGGTCGGA GTTCCGCACC GAGCAGTGGA TCGGCACGTC GGGACGCGCC
CGGACCGTTC CGTACAACAC CGGAGAGTTC TCCGACGACG ACCTCCCCGA CGACATCATG
GCGTACCCGG AGGAGTTCGC CGGGAGTCTC GGCTGGGCGC CTTCCTACGG CTCCTGTCAG
GCGTTCATCA CCGCGATGCG ACTCATCGAG GGCGAGGAGG CGACGCTGGC GTGGCTGGAG
TCCGTGGTGG AGGCCGGAAT CAGCAGCTAT CCCGACGAGT TCGCCGCCTG TCAGGCGATC
GCTGACGGAG AGATCGACGC GGCGTTCACG AACCACTACT ACATCCAGCG CGTCCTCGAC
GGCAACCCCG ACGCATCGAT CGGCACCGCC TTCACCAGCG GCGACGCCGG TGCGGTGTTC
AACGTCGCCG GCGCCGCAGT CGTCGACACG GCGAGCGACG CGACGCTCGC AGAGAACTTC
ATCCGCCACC TGCTGTCGGC CGAGGCGCAG GACTACTTCG CCCGGTCGAC GTTCGAGTAC
CCACTCATCC CCGATGTCGA GCCGATCGGC GACCTGCCGA CGATCGACGA ACTCGACGTG
CCAGATATCG ATCTGACGGA GCTGTCCGAC CTCGAACCGA CGATCGACCT CATGCGTGAG
GCCGGCGTCG AGGTGTAG
 
Protein sequence
MTNYLPDGVD RRQFLAATGA LGVAGLAGCT GDDTDGGSGN SSDGGDGGDG GDGSIGQIGS 
GREGRGAPGG IPMAEMPDLE GELTVYSGRG EFLVGELVEY IEDQYDDFDL TVRYAGSTDL
VNQILNEGDG SPADVFYSVN AGSLGTLAGE GRSQALSSEI TDMVRSEFRT EQWIGTSGRA
RTVPYNTGEF SDDDLPDDIM AYPEEFAGSL GWAPSYGSCQ AFITAMRLIE GEEATLAWLE
SVVEAGISSY PDEFAACQAI ADGEIDAAFT NHYYIQRVLD GNPDASIGTA FTSGDAGAVF
NVAGAAVVDT ASDATLAENF IRHLLSAEAQ DYFARSTFEY PLIPDVEPIG DLPTIDELDV
PDIDLTELSD LEPTIDLMRE AGVEV