Gene Hmuk_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1036 
Symbol 
ID8410554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp987970 
End bp989853 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content68% 
IMG OID645019371 
ProductTrkA-C domain protein 
Protein accessionYP_003176870 
Protein GI257387097 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.131112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTCG TCTTCGCCGT CGTCGCCGGT GCGCTCGTCC TCTTCGCGAC CGAGCGCGTG 
CCCGTCGACG TGACCGCCAT CGGCGTGATG GTTGCACTGC TGGTCGTCGA ACCGCTGACC
GCGCTGCTGG CCGACGCCGG AGTGCTGGCC GGGCGACTCT ACGTCCTCCA CGAGCCCGGC
GACGCCGTCG ATCCGGTGGC CGTGGGGCTC TCGGGCTTTG CCTCTCCGGC GACGATCACC
GTCCTCGCGA TGTTCGTCCT CTCGGCGGGG GTCCAGCGGA CCGGGGTAAT CCAGATTCTG
GGGGCCAAGG TCGCCGCTCT GACCGGTGAC AGCGAGTCCA GACAGCTCGG GGCGACCGTC
GGCATCGTCG GGCCGATCTC CGGGTTCATC AACAACACGG CGGCCGTCGC CATCCTGCTG
CCCATGGTGA CAGACCTCGC ACACAAGGGC CAGACCTCTC CCTCGAAGCT GTTGATGCCG
CTCTCTTTTG CCTCGATGTT CGGAGGGATG CTCACGCTGA TCGGCACCTC GACGAACATC
CTCGCCAGCG ACCTCGCCGG CCGACTGGCG ATCGAGGACC CGGCGCGGTA CGGCGACCTC
CACGCGTTCT CCATGTTCGA GTTCACACAG CTCGGCGTGA TTCTCCTGGT GGTCGGCTCT
CTGTACCTCA TGACGGTCGG TCGCTGGCTC ACTCCCGAAC GCATCAAACC GCGTGGCGAC
CTGACCCAGG AGTTCGAGAT GGCCGACTAC CTCACCGAAG TCGTCGTCCG CGAGGACTCG
CCGATCGTCG GCCAGACCGT CCACGACGCG CTGGAAGCGA CCGACCTCGA CATCGACATC
GTCCAGCTAA TCCGAGATCG CCGGACCTTC CTCGAACCGC TCGGTGCGAA GTCGATCCGG
GCCGGTGACG TGTTCGCGAT CCGGACCGAC CGGGACACGC TCGTCGAACT GCTCGACGCC
GAGGGCCTGG ACGTGGTTCC CGACGCGGTC GTCGGCGAGG CGGAACTCGA AGCGGCCGAG
GAACGACAGA ACCTCGTCGA GGTCGTGATC GCACCGGGCT CGGAGCTGGT CGGCGCATCG
CTCCGGTCGA CGAACTTCCG ACAGCGCTAC GACGCCAACG TCCTCGCGCT CCGACGCGGC
GGCGAGTTGA TCCGCCAGCG GATGGACCGG ACGACGCTTC GCGTCGGCGA CACGCTCCTG
ATTCAGGGGG CCGGCGACAG CATCGACCGC CTGAACAACA ACCCGAACTT CATCGTCGCC
CGCGAGGTCG AACGCCCCGA CTTCCGGAAG TCGAAAGTCC CCGTCGCCGT CGGTATCGTC
GCCGCCGTCG TCGCCGTCGC GGCACTCACG CCGGTCCCGA TCGTCGTCTC GGCGCTGGCC
GGCGCGCTCG GGATGATCCT CTCTGGCTGT CTGCGCTCCT CGGAGATCTA CGACGCCGTC
CAGTGGGACG TGATCTTCCT GCTCGCGGGC GTCATCCCGC TGGGGCTCGC CCTGGAGGCG
ACCGGCGGAG CGACGCTGCT GGCCGACCTC CTCGTGCTCG CCGCGCCGTC GTTCCCGCCG
CTCGTGGTGC TCGGGCTGAT GTACGTCGTC ACGGCGGTCC TGACGAACAT CATCTCGAAC
AACGCCAGCG TCGTCCTCAT GATCCCCGTC GCCGCCGAGG CCGCCGTCCA GCTCGGAGCC
AACGCCTTCG CGTTCGTGCT GGCCGTGACC TTCGCCGCCT CGACGGCCTT CATGACGCCC
GTTGGCTACC AGACGAACCT CTTCGTCTAC GGCCCCGGTG GCTATCGATT CACCGACTAT
CTGCGGGTCG GCGCGCCGCT ACAGGCGGTC TTTGCCGTCG TCACGACTCT GGGCATCGCC
TACTTCTGGG GCCTGACTCC GTGA
 
Protein sequence
MAFVFAVVAG ALVLFATERV PVDVTAIGVM VALLVVEPLT ALLADAGVLA GRLYVLHEPG 
DAVDPVAVGL SGFASPATIT VLAMFVLSAG VQRTGVIQIL GAKVAALTGD SESRQLGATV
GIVGPISGFI NNTAAVAILL PMVTDLAHKG QTSPSKLLMP LSFASMFGGM LTLIGTSTNI
LASDLAGRLA IEDPARYGDL HAFSMFEFTQ LGVILLVVGS LYLMTVGRWL TPERIKPRGD
LTQEFEMADY LTEVVVREDS PIVGQTVHDA LEATDLDIDI VQLIRDRRTF LEPLGAKSIR
AGDVFAIRTD RDTLVELLDA EGLDVVPDAV VGEAELEAAE ERQNLVEVVI APGSELVGAS
LRSTNFRQRY DANVLALRRG GELIRQRMDR TTLRVGDTLL IQGAGDSIDR LNNNPNFIVA
REVERPDFRK SKVPVAVGIV AAVVAVAALT PVPIVVSALA GALGMILSGC LRSSEIYDAV
QWDVIFLLAG VIPLGLALEA TGGATLLADL LVLAAPSFPP LVVLGLMYVV TAVLTNIISN
NASVVLMIPV AAEAAVQLGA NAFAFVLAVT FAASTAFMTP VGYQTNLFVY GPGGYRFTDY
LRVGAPLQAV FAVVTTLGIA YFWGLTP