Gene Hlac_2523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2523 
Symbol 
ID7401575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2501705 
End bp2502856 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content62% 
IMG OID643709595 
ProductABC transporter related 
Protein accessionYP_002567166 
Protein GI222480929 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.299517 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGC TTGAACTCAA CTCGATAACG AAGACGTTCC AGGACGGCGA CGAGGAAATC 
GTCGCGGTCG ACGATGTGTC GATGTCGATC GACGACGGCG AGTTCCTCGT CGTCGTCGGT
CCGTCCGGCT GCGGGAAGTC CACGACGCTC CGGATGATCG CTGGCTTGGA GACGATCACC
TCCGGCACGC TCAGCATCGA CGATCGCGTC GTTAACGACG TGAAATCACA GGACCGGGAT
ATCGCGATGG TGTTCCAGTC GTACGCGCTC TACCCGCACA TGAGTGTCCG GCAGAACATG
TCATTCGGAC TGGAGGAATC GACGGACCTC CTCGACGACG AGATTAACCG GATGGTGTCT
GAAACCGGCG AGATGCTCGG AATTTCGCCG CTGCTCGATC GAAAGCCGAG CGATCTCTCG
GGCGGGCAAC AGCAGCGCGT CGCACTCGGC CGCGCCATCG TCCGCGACCC TGAGGTGTTC
CTGATGGACG AGCCCCTCAG CAACCTCGAC GCGAAGCTCC GTGCGGAGAT GCGGACGGAA
CTACAGCGGC TTCAGAACGA TCTCGGTGTG ACGACGGTGT ACGTCACGCA CGACCAGACC
GAGGCGATGA CGATGGGCGA TCGGATCGCC ATCCTCGACG GCGGCAAACT GCAGCAGATC
GCGTCGCCGC TGAAGTGCTA CCACGAGCCG GCCAACCAGT TCGTCGCGAG CTTCCTCGGC
GAGCCCTCGA TGAACTTCTT CGACGTGACG CTCGACGGCG ACCGGCTGGT CGGTGACGTC
TTCGAGTATC CCATCGGCGA CGACGTGCGT GTGGACCTCG GAGAGACGGC CGATCTTGTC
ATGGGGATTC GTCCCGAGGC GATCAAGCTC GTCGCGAGCA AGTCCGACGC ACACGAGTTC
GAAATGACCG TCGACGTCGT CGAACCGATG GGCGACGAGA ACACGGTGTA CCTCCATTTC
GACCCCGACG CAGACCCCGA GTCCGCTGCG ACCCTCGTCG CGACGATCGA CGGGTTCACG
CAGGTCAGTG AGGGAGACTC GGTCGTCGCG CAGATTCCGG AGGACGCGAT CCACATCTTT
GATCGCGTCA CCGGCGAGGC GCTCCACAAC CGGTCGATGA AGGACGCCGC TGATCAGGTC
AACCTCGCCT GA
 
Protein sequence
MATLELNSIT KTFQDGDEEI VAVDDVSMSI DDGEFLVVVG PSGCGKSTTL RMIAGLETIT 
SGTLSIDDRV VNDVKSQDRD IAMVFQSYAL YPHMSVRQNM SFGLEESTDL LDDEINRMVS
ETGEMLGISP LLDRKPSDLS GGQQQRVALG RAIVRDPEVF LMDEPLSNLD AKLRAEMRTE
LQRLQNDLGV TTVYVTHDQT EAMTMGDRIA ILDGGKLQQI ASPLKCYHEP ANQFVASFLG
EPSMNFFDVT LDGDRLVGDV FEYPIGDDVR VDLGETADLV MGIRPEAIKL VASKSDAHEF
EMTVDVVEPM GDENTVYLHF DPDADPESAA TLVATIDGFT QVSEGDSVVA QIPEDAIHIF
DRVTGEALHN RSMKDAADQV NLA