Gene Huta_2119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2119 
Symbol 
ID8384413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2157122 
End bp2159311 
Gene Length2190 bp 
Protein Length729 aa 
Translation table11 
GC content64% 
IMG OID644973188 
Producthypothetical protein 
Protein accessionYP_003131019 
Protein GI257053186 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGAGAC CCAATAATCC GCTGGATGAA CTGTCCAGAC GTAAATACGT TCAGGCAATT 
GTCGCAGCCA GCGTCGCCGG TGCCGCTGGT TGTTCCGACG ACTCGGGGGA AAATACCGAC
GGACCCGGAG GCAACGGAAA CGGCAATGGC AATGGCAACG GCAATGGTAA TGGCAACGGC
AACGGCAATG GCAACGGCAA TGGCAACGGG GATGGCCAAC AGCCTGTCGA CGAGGTGTTC
ACCGTCGTCG ACAACAACAT TCCCGAGCAG GCCAACATGT CCACCTGGCA GACGGGGGAC
CGCTCCACCG GCATCAACTG GATGACGGAG ATCACGTCGG CGCGGACCCA GGGGCTCAAC
ATCATCATCG ACGGGCACAC CTACGAGATG CCCCACGTCG ATGGCGTCGA GGAGGTCGAG
ATCCCGACGC TACTCACGGA CTATACGGTC GAGCCCCCGT ATGACATGTA CAACACCTAC
AACCAGGACA TGTACTACTG GGACGAGGAA ACGAACATCG ACGCCGAGGC CCGGGTGACA
CACGACTACG TGTACTACGG GTACGATGGG AACATCTTCG CCTCGGACGT CTCGTTCGCC
AGCGAGGCTG TCGATCAGTT CCGGCGTCAC TTCTGGTACG AGGACGGCAC GCGTCGACTT
GAGCCGCGCA ACTCCAACGC GCCGACGGGC GAGTCCGACC TGCCCGACGA CGGGACCAAC
GCGTACGTCA TGGAGACCGA CACGGTCGAA GGGGTCGCCC AGACCGAGGC CCAGCCGATG
CACCCGGCGT TCACCACGCC GTACGCCGAG CGGTACGCGG ACGCCGCGGA CTCGGACGCC
GTGACGACGA TCACCGACGA GCTCGAAGGC GACCGCGTCT CCATGCAGCG CTACGCCGAC
GAGGGCTGGG GTGGCAGCGT CTACAAGATC CCGTCCGCCG ACGCCATCTC CGGGACCGAC
GCAACCCTGA CCCTGCGGGA CAACCATCCC AACGAGCACA TCAACATCCC AACACTCCGC
ATCCGCTTTG CGACCGAAGA CCGTGCACAG GTCATGCGGG CGCGTGGTCA GATCGATCTC
GAGAACGGCG TCCTGCCGAT GTCGACGGGG AACATCAACC GGAACTCCGT GCCTGACTAC
ACCCAGGAGA TCGCCCGCTG GCTCCAGATC GGTGGGGACC AGCTCATCTT CAACTTCAAC
AACAAACACC TCGGTCGCCT GTGGGTCCGG CGTGCGGCTG TCGCCGCGAT CGACTGGAAC
CAGGTCGGGG CCAACGGATG GGGTCCCGAA GTCTCGGAAG CCAACCCCCA CCACGTCGGC
ACACTCGAGT CCGTCGCCGA AGGGAACTTC TCCGACGAGT TCCTCGATCA GATGTACTCC
TACCCGATCG AGGCCGACCA GGAACTGGCC GGCCAGTGGA TGCGCCGGGC GGGCTACGAG
AAACAGGGCG GCTCGTGGGT CGGCCCGGAC GGCGACCGAG TCGACTGGAA CCTGTCGTTC
AACTCCGGCG AAGCCTCCTG GATCGGCGGC GTCCAGACCG TGATGGCCAA CCTGGAGGAC
TTCGGCCTGG GCGTTACGCT CGACGGCAAC GCCTGGTCGA CCTACACCTC GCGGCTCGAC
TCGCCGAGCT ACGACTACGA CATCGCGCTG CAGTGGGCGA ACTTCCAGAC CATCACCGGC
GCCTACGACT ACCAGGGCGC ATGGTGGTCG AACCCGCTGC TCAAGGGTAG TCCTGACGAC
GCCCCGTACT ACGACATCAC GGATGACGAC GAAGTCGACG GGCTGGGTCG GCCGGTCCAG
GAAGCCCCGA TCCCCTCGGA GCCCGGCTCG ATCGAAGCGC CGGATGGCGC TTACAAGATC
CCGGACAGTA TTCCGGGCGG CTCGGAGACC TACGACATGA AGGAAGTCGT CGAAGGTCTG
CGTGAGCCCG GGATCACCAT CGAGCAGGTT CGCGAGCGCG CGCAAGTTCC GGCCAGGTAC
TACAACTACT ACCTGCCGAA CTTCGTGTTC CACTCCTACT ACAACGGCGT CTTCGGCAAC
GTCCGGGATC ACAACTTCCC GCCGGCGGAC CACGACGTCT GGGGCTCGAC CAAGGAGTAC
GGATCGCGCA ACTACAGCGT CCTCACCGGG ATGCCACAGC TGAAGTACGA CTCGGACTAC
CCCGACCCGC CCGCGGATCA CCGAAGCTAA
 
Protein sequence
MRRPNNPLDE LSRRKYVQAI VAASVAGAAG CSDDSGENTD GPGGNGNGNG NGNGNGNGNG 
NGNGNGNGNG DGQQPVDEVF TVVDNNIPEQ ANMSTWQTGD RSTGINWMTE ITSARTQGLN
IIIDGHTYEM PHVDGVEEVE IPTLLTDYTV EPPYDMYNTY NQDMYYWDEE TNIDAEARVT
HDYVYYGYDG NIFASDVSFA SEAVDQFRRH FWYEDGTRRL EPRNSNAPTG ESDLPDDGTN
AYVMETDTVE GVAQTEAQPM HPAFTTPYAE RYADAADSDA VTTITDELEG DRVSMQRYAD
EGWGGSVYKI PSADAISGTD ATLTLRDNHP NEHINIPTLR IRFATEDRAQ VMRARGQIDL
ENGVLPMSTG NINRNSVPDY TQEIARWLQI GGDQLIFNFN NKHLGRLWVR RAAVAAIDWN
QVGANGWGPE VSEANPHHVG TLESVAEGNF SDEFLDQMYS YPIEADQELA GQWMRRAGYE
KQGGSWVGPD GDRVDWNLSF NSGEASWIGG VQTVMANLED FGLGVTLDGN AWSTYTSRLD
SPSYDYDIAL QWANFQTITG AYDYQGAWWS NPLLKGSPDD APYYDITDDD EVDGLGRPVQ
EAPIPSEPGS IEAPDGAYKI PDSIPGGSET YDMKEVVEGL REPGITIEQV RERAQVPARY
YNYYLPNFVF HSYYNGVFGN VRDHNFPPAD HDVWGSTKEY GSRNYSVLTG MPQLKYDSDY
PDPPADHRS