Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2119 |
Symbol | |
ID | 8384413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2157122 |
End bp | 2159311 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644973188 |
Product | hypothetical protein |
Protein accession | YP_003131019 |
Protein GI | 257053186 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGAGAC CCAATAATCC GCTGGATGAA CTGTCCAGAC GTAAATACGT TCAGGCAATT GTCGCAGCCA GCGTCGCCGG TGCCGCTGGT TGTTCCGACG ACTCGGGGGA AAATACCGAC GGACCCGGAG GCAACGGAAA CGGCAATGGC AATGGCAACG GCAATGGTAA TGGCAACGGC AACGGCAATG GCAACGGCAA TGGCAACGGG GATGGCCAAC AGCCTGTCGA CGAGGTGTTC ACCGTCGTCG ACAACAACAT TCCCGAGCAG GCCAACATGT CCACCTGGCA GACGGGGGAC CGCTCCACCG GCATCAACTG GATGACGGAG ATCACGTCGG CGCGGACCCA GGGGCTCAAC ATCATCATCG ACGGGCACAC CTACGAGATG CCCCACGTCG ATGGCGTCGA GGAGGTCGAG ATCCCGACGC TACTCACGGA CTATACGGTC GAGCCCCCGT ATGACATGTA CAACACCTAC AACCAGGACA TGTACTACTG GGACGAGGAA ACGAACATCG ACGCCGAGGC CCGGGTGACA CACGACTACG TGTACTACGG GTACGATGGG AACATCTTCG CCTCGGACGT CTCGTTCGCC AGCGAGGCTG TCGATCAGTT CCGGCGTCAC TTCTGGTACG AGGACGGCAC GCGTCGACTT GAGCCGCGCA ACTCCAACGC GCCGACGGGC GAGTCCGACC TGCCCGACGA CGGGACCAAC GCGTACGTCA TGGAGACCGA CACGGTCGAA GGGGTCGCCC AGACCGAGGC CCAGCCGATG CACCCGGCGT TCACCACGCC GTACGCCGAG CGGTACGCGG ACGCCGCGGA CTCGGACGCC GTGACGACGA TCACCGACGA GCTCGAAGGC GACCGCGTCT CCATGCAGCG CTACGCCGAC GAGGGCTGGG GTGGCAGCGT CTACAAGATC CCGTCCGCCG ACGCCATCTC CGGGACCGAC GCAACCCTGA CCCTGCGGGA CAACCATCCC AACGAGCACA TCAACATCCC AACACTCCGC ATCCGCTTTG CGACCGAAGA CCGTGCACAG GTCATGCGGG CGCGTGGTCA GATCGATCTC GAGAACGGCG TCCTGCCGAT GTCGACGGGG AACATCAACC GGAACTCCGT GCCTGACTAC ACCCAGGAGA TCGCCCGCTG GCTCCAGATC GGTGGGGACC AGCTCATCTT CAACTTCAAC AACAAACACC TCGGTCGCCT GTGGGTCCGG CGTGCGGCTG TCGCCGCGAT CGACTGGAAC CAGGTCGGGG CCAACGGATG GGGTCCCGAA GTCTCGGAAG CCAACCCCCA CCACGTCGGC ACACTCGAGT CCGTCGCCGA AGGGAACTTC TCCGACGAGT TCCTCGATCA GATGTACTCC TACCCGATCG AGGCCGACCA GGAACTGGCC GGCCAGTGGA TGCGCCGGGC GGGCTACGAG AAACAGGGCG GCTCGTGGGT CGGCCCGGAC GGCGACCGAG TCGACTGGAA CCTGTCGTTC AACTCCGGCG AAGCCTCCTG GATCGGCGGC GTCCAGACCG TGATGGCCAA CCTGGAGGAC TTCGGCCTGG GCGTTACGCT CGACGGCAAC GCCTGGTCGA CCTACACCTC GCGGCTCGAC TCGCCGAGCT ACGACTACGA CATCGCGCTG CAGTGGGCGA ACTTCCAGAC CATCACCGGC GCCTACGACT ACCAGGGCGC ATGGTGGTCG AACCCGCTGC TCAAGGGTAG TCCTGACGAC GCCCCGTACT ACGACATCAC GGATGACGAC GAAGTCGACG GGCTGGGTCG GCCGGTCCAG GAAGCCCCGA TCCCCTCGGA GCCCGGCTCG ATCGAAGCGC CGGATGGCGC TTACAAGATC CCGGACAGTA TTCCGGGCGG CTCGGAGACC TACGACATGA AGGAAGTCGT CGAAGGTCTG CGTGAGCCCG GGATCACCAT CGAGCAGGTT CGCGAGCGCG CGCAAGTTCC GGCCAGGTAC TACAACTACT ACCTGCCGAA CTTCGTGTTC CACTCCTACT ACAACGGCGT CTTCGGCAAC GTCCGGGATC ACAACTTCCC GCCGGCGGAC CACGACGTCT GGGGCTCGAC CAAGGAGTAC GGATCGCGCA ACTACAGCGT CCTCACCGGG ATGCCACAGC TGAAGTACGA CTCGGACTAC CCCGACCCGC CCGCGGATCA CCGAAGCTAA
|
Protein sequence | MRRPNNPLDE LSRRKYVQAI VAASVAGAAG CSDDSGENTD GPGGNGNGNG NGNGNGNGNG NGNGNGNGNG DGQQPVDEVF TVVDNNIPEQ ANMSTWQTGD RSTGINWMTE ITSARTQGLN IIIDGHTYEM PHVDGVEEVE IPTLLTDYTV EPPYDMYNTY NQDMYYWDEE TNIDAEARVT HDYVYYGYDG NIFASDVSFA SEAVDQFRRH FWYEDGTRRL EPRNSNAPTG ESDLPDDGTN AYVMETDTVE GVAQTEAQPM HPAFTTPYAE RYADAADSDA VTTITDELEG DRVSMQRYAD EGWGGSVYKI PSADAISGTD ATLTLRDNHP NEHINIPTLR IRFATEDRAQ VMRARGQIDL ENGVLPMSTG NINRNSVPDY TQEIARWLQI GGDQLIFNFN NKHLGRLWVR RAAVAAIDWN QVGANGWGPE VSEANPHHVG TLESVAEGNF SDEFLDQMYS YPIEADQELA GQWMRRAGYE KQGGSWVGPD GDRVDWNLSF NSGEASWIGG VQTVMANLED FGLGVTLDGN AWSTYTSRLD SPSYDYDIAL QWANFQTITG AYDYQGAWWS NPLLKGSPDD APYYDITDDD EVDGLGRPVQ EAPIPSEPGS IEAPDGAYKI PDSIPGGSET YDMKEVVEGL REPGITIEQV RERAQVPARY YNYYLPNFVF HSYYNGVFGN VRDHNFPPAD HDVWGSTKEY GSRNYSVLTG MPQLKYDSDY PDPPADHRS
|
| |