Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1305 |
Symbol | |
ID | 8383582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1277663 |
End bp | 1279744 |
Gene Length | 2082 bp |
Protein Length | 693 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644972366 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003130214 |
Protein GI | 257052381 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACAG ATGACGACGT TCACCCGGCA GCATCACGGG AGGACAAAAA AGTGTCCAGT GTGGAGCAAT CCGACCCGGC AAACCAACAA GAGACCGACT CATCACGAGA CAGAGCCCTG GGGACAGATA TGCAGACGAC GCGACGATTG TTCCTCAGTC TCGCGGGGGT CGGAGTGAGT GCCGGGGCAG GATGCACCAC GCTGGGGCCG GACGATGATG GGAACACGGA GACACCCACG AGCACGCCGA CAGGAACTGA AACGGACACA CCGACGACAA CAGACACGAA CACACCGACA GAGACCGACA CGCCGACGGA GACTGACACG GACACACCGA CTGAAACCGA GTCCGACACA CCGACTGAAA CCGAAGCGGA AGAGACGACT GAGGAAGGAA GAGAGACGGA GGAAGAGACG AACGAGGAAG CGGACGATGA GACGGTTTAC GACGTCACGG TTGACTCCGT CGACGCCCGT TCGATGGACA TGCTGGACTG GAACTCCCAG TACGCGGGCT GGCCATACAT TTGGGGGCGG TGGGCTGCCT ACGAGCGCTT TGTGCAGTAT GATCTGGAGA ACAACGTCTG GATCCCCCGC CTCATCGATA ACTGGTCGAT TGACGGGACG ACAATCACGC TGGACATCCG TGATCCACAT AAATACGAGG ACGGGGACGA GGTGACGGCC GAGGATGTCA AGGCCAACAT CGTGATGAAT CTCGCGACCG GGGCACCGTT CTCGGAGATC TTCGAATCGT TCGACGAACC CGACGACAAA ACCCTCGTGA TCGAGACGAA GAAGTCTGTC AACAAGACGA TCCTCGAGTT TTCGATCTTC TCGCAGTTGC AACAGGCCAA AATGGCACCA CCGTACGACG AGTTCTATCA GCGCTTTTGG GTAGACGGCG AAGAGGGCGT CGGCAGAGAT ATTCAGGCCA TGGAACCCAA CGAGCCCCAC TACGTCTCGG GTATCTTCGG CCAAATGAGG AAGGACTCCG AGCAGTACCT CATGGAACGC AACCCCGAGC ACCCGGACGC CGATAACGTC AACTTCGAGC GGTACCGGTT CCGGACCTAC CCGGGGAACC AGGCCAAGTG GGACGCGATG TTAGCCGATG AGGTGGACAC GGTGATGAGT GCGTTCACCC CGGCGAATGT CAAAGCGGAG CTCCCGGATC ACTGGCAGGA GTACAACTTC CCCGGCAACT GGGGCGTCGG GCTACTGCCC CAGCACGATC CGGACACCGC GCCCCACATT TCAAAGCGAC CGGTTCGCCA GGCCATCACG CATGCGATTA GCCGCGAAGC TGTCCGGAGC GCCGGTGGTC CGCGGGTCAA GACGGCATTC CCGACCCCAG CCGCGATCTC CGCGAGCGTG CAGGATGAAT GGATTGACGT CGATGGGACG TTCGGCCCAA TGCAGGGAGG CAAAGCGAAA GCCGCCGAAC GCATGCAAGA CGCCGGCTAC GAGAAAAACG CCAACGGTAT CTGGGCGATG GACGGCGACA CCGTTTCCTT CGACATCTCA GTGCCGGGTA GCTGGAGTGA CTGGGTGACG GTGATCCAGG CAACGGTTTC CCAGCTAACC CAGGCCGGGT TCGACGCACG ACTGAACCGC GTCAACAACA TCTATAGGAT TGTCGCCAGC GGCGAGTTCA AGATGGCTGC CCGCCCGTGG TCGTCGGGTT ACGCACGGTC GTCGTTCCCG TATTTCCCGC TGGACTGGGT GTTCGGGCGA GCCTACGGCA ACGCTCACAG CTACCCCGGT GGCGAGGAAG GCACCGAGAT CACGGTTCCC GCCATGGACG GAGACGGCAC CATGTCGGTC GATGTCCAGC AGCACCTCGC GGATCTCTCG ATGGCTCGTG GCGACGACGT CAAGCCGATC GTCGAGGAAC TCGCGTGGGT TTCCCATCAG GATCTACCGA TGATACCGAT CGTCGAAAAG CTGGAGCAGT CGTTCATCAG CACGAACAAT CTCTCTGCGC CCGATCCTGA CTCTTTGGCC GGCAACGTCA AATGGCCGTG CTTTTACGCT CCACGGGAGG GCGAGATGCA GTGGCAGGGC GGCCAAGAGT GA
|
Protein sequence | MATDDDVHPA ASREDKKVSS VEQSDPANQQ ETDSSRDRAL GTDMQTTRRL FLSLAGVGVS AGAGCTTLGP DDDGNTETPT STPTGTETDT PTTTDTNTPT ETDTPTETDT DTPTETESDT PTETEAEETT EEGRETEEET NEEADDETVY DVTVDSVDAR SMDMLDWNSQ YAGWPYIWGR WAAYERFVQY DLENNVWIPR LIDNWSIDGT TITLDIRDPH KYEDGDEVTA EDVKANIVMN LATGAPFSEI FESFDEPDDK TLVIETKKSV NKTILEFSIF SQLQQAKMAP PYDEFYQRFW VDGEEGVGRD IQAMEPNEPH YVSGIFGQMR KDSEQYLMER NPEHPDADNV NFERYRFRTY PGNQAKWDAM LADEVDTVMS AFTPANVKAE LPDHWQEYNF PGNWGVGLLP QHDPDTAPHI SKRPVRQAIT HAISREAVRS AGGPRVKTAF PTPAAISASV QDEWIDVDGT FGPMQGGKAK AAERMQDAGY EKNANGIWAM DGDTVSFDIS VPGSWSDWVT VIQATVSQLT QAGFDARLNR VNNIYRIVAS GEFKMAARPW SSGYARSSFP YFPLDWVFGR AYGNAHSYPG GEEGTEITVP AMDGDGTMSV DVQQHLADLS MARGDDVKPI VEELAWVSHQ DLPMIPIVEK LEQSFISTNN LSAPDPDSLA GNVKWPCFYA PREGEMQWQG GQE
|
| |