Gene Huta_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1046 
Symbol 
ID8383320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1010918 
End bp1012114 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content68% 
IMG OID644972111 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003129962 
Protein GI257052129 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAGC GGACGCGGCT CGCGGTGGTC GTCTGGGCAG TGCTGGTCTC GCAGGTACTG 
CTCTACCCTG GCCTCGAAGA CACCGTGATC GCACTGGGTG GCGGCGACCA CCTCCTCGCC
GGAACCTGGT TTCTCGTTGC GGAGTTCGGA GCCTTCGTCG GGATGGCCGT CCTCTGGGGA
TTGCTCAGTG ACGCACTCGG TCGACGGACG CCACTGGTCG TCGCCGGGGC GGCCGGTGGA
GCGGTGAGTT ACCTCGCCGT CGCGGCGGTA CCAGGTCTCG GCGGTAGCTT CGACGTGGTG
CTCGTGCTAC GGGTGATCGG CGGCGGGTTC ACGATCGGGG CGTTCTCGTT GTCGATCACG
AAGCTGATGG ATCTCGCCGG AGGGCACGGC AGAAACATGG GAGCAGCGGG GACGGCGATC
GGCTTCGGCG CGGCGCTTGG CTCGATCGTC GGCGGGGGAC TCGCGACGCT GGATCCGCTC
GCCCCACTCT ACGCCGGGGC AGTTGTCCTC GCAGGGGCGG CACTGCTGGC AGCGACGGTC
CCGGACCGGG GCGTCGGCGG TGGGCTAGCC CTCGAGACCG TCTTCGCTCG CATCCGTACC
CGTCCAGCAC TGCTCGTCCC CTACGCGTTC GGGTACATCG ACCGCCTGAC AGCCGGCTTC
TTCGCGCTGG CCGGCGTAGC GTACTTCCGT GACGCCTTCG ACGTTGGGCC CGCACTGGCC
GGGGTGACAC TCGCGCTGTT TTTCCTCCCG TTCGCCGCGC TCCAGTACCC GATGGGGAAC
CTCTCGGATC GGATCGGCCG GTTCGTGCCC GTCGTCGCCG GATCGCTCTG TTACGGAGTG
GCGATTATCG CTGTCGGGCT CGCCCCGGTG TACGCACTCG CCGCGCTCCT CATGGTCGTC
GTCGGCATCT GTGGCGCGGC GGTCTCACCG GCAACGATGG CGCTCGTGAC TGACCTCGTT
CCGGCGAGCG AACGCGGCGC GGCCATGGGC GGGTTCAACG TCTTTGGCAG TCTAGGCATG
CTGACCGGCT TCCTCCTCGG TGGCGTCGTT TCCGGCGTCT TTGGCTATCT CCCGGCGTTC
GTCGCGGTCG GCGGCCTCGA AATTGCGATC GCCCTGCTCG CGGCACGGGC CGTCTTTCGA
ATGACGGCCG GCCAGCCGGG CGCTGAATGG TTTCGACATG CGATTCGGGA TGGATGA
 
Protein sequence
MTERTRLAVV VWAVLVSQVL LYPGLEDTVI ALGGGDHLLA GTWFLVAEFG AFVGMAVLWG 
LLSDALGRRT PLVVAGAAGG AVSYLAVAAV PGLGGSFDVV LVLRVIGGGF TIGAFSLSIT
KLMDLAGGHG RNMGAAGTAI GFGAALGSIV GGGLATLDPL APLYAGAVVL AGAALLAATV
PDRGVGGGLA LETVFARIRT RPALLVPYAF GYIDRLTAGF FALAGVAYFR DAFDVGPALA
GVTLALFFLP FAALQYPMGN LSDRIGRFVP VVAGSLCYGV AIIAVGLAPV YALAALLMVV
VGICGAAVSP ATMALVTDLV PASERGAAMG GFNVFGSLGM LTGFLLGGVV SGVFGYLPAF
VAVGGLEIAI ALLAARAVFR MTAGQPGAEW FRHAIRDG