Gene Hlac_2329 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2329 
Symbol 
ID7401946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2326021 
End bp2327070 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content66% 
IMG OID643709402 
ProductTRAP transporter solute receptor, TAXI family 
Protein accessionYP_002566975 
Protein GI222480738 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.809783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCAC ACCAAACACG ACGTAGATTC CTCGAAGCGA CCGGTGTCGC TGGCGTGGCC 
GCACTTGCCG GCTGTAGCGG AAACGGCGGC GATGGCAGCG ACGGAAGCGA CGGAAGCGAC
GGAAGCGACG GTAGTGACGG AAGCGATGGC AGTGACGGAA GCGATGGCAG CGACGGAAGC
GATGGCGGCG ACGGCGAGAC GCGCCTGACC TGGCACGCGG GCGGGACCGG CGGGACCTAC
TTCCCCCTCT CGAACGAGAT CAAGACCATC GTCGACGCCA ACACCGACTT CACGCTGAAC
GTCCAGTCCA CGGGCGCGAG CGTCGAGAAC GTCGGCAGCC TCGCCGACGG GTCGGCCGAC
TTCGCGCTGA TCCAGAACGA CATCGCCTCG TTCGCGAGGA ACGGTACGGG CATCGACGCC
TTCATCGACA ATCCGATCGA GAACCTTCGG GGCGTCGCGA CGCTGTACCC GGAGACGATC
ACGCTCGTCA CGCTGGCGGA GAACGACATC TCCTCGGTCG ACGACCTCAG CGGCGCGACG
ATCAACACCG GCGACCTCGG GTCGGGGACG CAGGTTAACG CGGTACAGAT CCTGGACTCG
CTCGGAGTCA CCGACTACAA CGAGCAGAAC GCCGGCTTCT CGCAGGCGTC CGAACAGCTC
GCCAACGGCG ACATCGACGC GGCATTCGTC GTCGGCGGCT GGCCGGTCGG CGCGATCGAG
GAGCTCGCGA ACACGAACGA CATCGAGATC GTTCCGATCG GCGGCGACAG CCGCGAGGCC
GTCAAGGAGG ACGCCTCCTG GTTCGCGGAC GACACCATCC CCGGCGGCAC GTATAGCGGA
ATCGATGAAG ACGTCGAGAC GGTCGCCGTG CAGGCGATGA TCGCCACGAA CGCCGAGGTG
CCGGACGAAA CCGTCCGGAC GGTCACCGCG GCCATCTTCG ATAACCTCGA CGAGCTCTCG
ATCAAGACCG AGTTTATCAC CGTCGACACC GCACAGGACG GGATGTCCAT CGAGCTCCAC
GACGGCGCCG CGGCCTACTT CGACGCGTAG
 
Protein sequence
MSSHQTRRRF LEATGVAGVA ALAGCSGNGG DGSDGSDGSD GSDGSDGSDG SDGSDGSDGS 
DGGDGETRLT WHAGGTGGTY FPLSNEIKTI VDANTDFTLN VQSTGASVEN VGSLADGSAD
FALIQNDIAS FARNGTGIDA FIDNPIENLR GVATLYPETI TLVTLAENDI SSVDDLSGAT
INTGDLGSGT QVNAVQILDS LGVTDYNEQN AGFSQASEQL ANGDIDAAFV VGGWPVGAIE
ELANTNDIEI VPIGGDSREA VKEDASWFAD DTIPGGTYSG IDEDVETVAV QAMIATNAEV
PDETVRTVTA AIFDNLDELS IKTEFITVDT AQDGMSIELH DGAAAYFDA