Gene Hlac_0853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0853 
Symbol 
ID7400819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp846056 
End bp847114 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content67% 
IMG OID643707919 
ProductABC-3 protein 
Protein accessionYP_002565522 
Protein GI222479285 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1108] ABC-type Mn2+/Zn2+ transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGG AATCAGCCGC CGACACGAAA AATCGCCCGC GCCCCGACCG TGGCCTCCGC 
CGAACCGCCG AACTCGTCGG AATCGCGATC ACGGCCGTCG TCGCGGCCGT CATGCTCGGA
TTCGTTCTGC TCTACTGGGC GCAGGACCTC CCGGTAGCGA GCGAACTGTA CGCCGCGTTC
CGCTCGTTCG GACGCGGGAT GGACGCCGCG TTCGGCACGA ATGTGTTCCG GCACCCGATC
ATGTGGCAGT CGATGGCGGT CGGCGTGCTC GTCGGGGTCG TCGCTCCGCT GGTCGGTTCC
TTCCTCGTCC ACCGCGAGAT GGCGCTGATC GGCGAGACGC TCGCGCACAC CGCATTCGCC
GGCGTCGCGA TCGGCATCCT CGTCACGTCT TCGACCGGGT GGAACGGTTC GCTGCTGCTC
GTCGCGCTCG CGGTCGGGAT CCTCGGCGCG CTTGTAGTCC AGTGGCTTAC CGAGCGCACT
GACGCCTACG GGGACGTGCC GATCGCGATC ATGCTCAGCG GGAGCTTCGC GGTCGGAACC
CTGATCATCA GCTACGGTGA CGGACTCACC GGGGTCAACA TTCAGGGATA CCTGTTCGGG
AACCTCGCGG TCGTCACGCC GGAGGGAGCG CGCCTGATGG GCGCGCTCTC CCTGATCGTC
GTCGCTGGCG TGGCGCTGAC GTACAAACAG CTGCTCTTCA TCACCTTCGA CGAGCAGGCC
GCGCGGGTCG CTCAGCTGAA CGTCACCGGA TACAACACCC TGCTCGTGGT GTTGACCGCG
GTCGTCGTCG TCGGCGCGAT GCAGGTGCTC GGCGTCATCC TCGTCGCCGC GATGCTCGTC
GTTCCGGTCG CGGCCGCCTC CCAGATCGCT CGGAGCTTCC GCGAGACGAT GTACCTCGCG
GTGATCTTCG GACAGCTGTC GGTGATCGGC GGCTTCGCCG TCTCGATCGG CTTCGGACTT
CCCTCCGGGG GGTCGATCGT CATCACCGCG ATCGCAATCT ACCTCGCGAG CATCGTCGGC
TCCGGCTTCT CGGTGAAGGC GATCTCGGCG CACGGGTGA
 
Protein sequence
MSGESAADTK NRPRPDRGLR RTAELVGIAI TAVVAAVMLG FVLLYWAQDL PVASELYAAF 
RSFGRGMDAA FGTNVFRHPI MWQSMAVGVL VGVVAPLVGS FLVHREMALI GETLAHTAFA
GVAIGILVTS STGWNGSLLL VALAVGILGA LVVQWLTERT DAYGDVPIAI MLSGSFAVGT
LIISYGDGLT GVNIQGYLFG NLAVVTPEGA RLMGALSLIV VAGVALTYKQ LLFITFDEQA
ARVAQLNVTG YNTLLVVLTA VVVVGAMQVL GVILVAAMLV VPVAAASQIA RSFRETMYLA
VIFGQLSVIG GFAVSIGFGL PSGGSIVITA IAIYLASIVG SGFSVKAISA HG