Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1068 |
Symbol | |
ID | 8383342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 1042073 |
End bp | 1043197 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 644972133 |
Product | Capsular polysaccharide biosynthesis protein- like protein |
Protein accession | YP_003129984 |
Protein GI | 257052151 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4421] Capsular polysaccharide biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGGGA GAAAGTACCT TAGTCAGGTT CTCAACCTTT CGTCTACCCT TCTATCTACG GTTGGGAAGC CGGTGGTGAT GCAGCGTCCC GAACTGAAAC AGTATGCACG GGAACGGGGT CAGTTATTCC ACTTTGGTTC ATCTGAGACC TATTCTTTCG GCCCGCCCCA ATATCCCGAC GAAGTGCTGT ACTCCCTGCA ACGCAGATTT GGCGAATATC AAACTCCGAA GCCGTTCGTA GCCGAAGTGC GCGACGTTTC ACTAATCGGT CTCTATCCAA TTCCGTTCAA GAGGGGAAAT GCCCTTGTAG AACCTGTCGT CAGCGAACGG AACTTAATCC TCAATCTCTT CTATTCGGTT GTTGACTCAC GCCTGCAACA CCGCAGCCTG ACGTCGAAGG ATTTCGATTG CGCCTGCCTC CTATTCAACT CCCAGAGCCG GGGGGTGTTT CATTGGATGG TAGAGGACGC CTTGAGAGCC GAAGGCGTCC TGAGATACGA AGAGAAAACG GGTCGCCGTC CGACTCTTAT TATACCCCCA AACCCGAAAT CGTGGCAGAC CGAGACACTG GAACTGCTTG GATTCGAGTC GGAAGACTGG GTCGAATGGG ACGCTTTTAG AGGGAAGGTG GACCGATTGG TCGTCCCCTC CGTCCGGCGG ATATACGATG ATGGAGTGGT CTCCCCGGCA CAAACGGAGT GGTTCAGTGA GCGGATGGTG GGCGGTGCAG AAGGCCAAGT GGAGGCTTCC AAAACCTCCT CACGGGTGTA TATCTCCCGT GACGATGCCG GGAGACGTCG GTTGACGAAC GAAGACGATC TTATGGATCA ACTCGGTGAT CTCGGGTTCG AACGCCACTA TCTTGAGCGC ATGTCCACCG CCCAGATCGT GAGCCTGTTC AATAACGCCA ACATAATAGT CGCTCCTCAC GGAGCAGGCT TGACCAACAT CATGTTCGCA ACGGATGCCA GCGTCATCGA ACTGCGCCCC AACGACTCCT ATTCCTGGGT ATATTACGTA TTGAGCGAGC AAAATGGACT CGACTACTGC TACGTGATGG GTGACGACGA TAAGGAAGGA ACGGACTTCC GGGTAGAGCC TGCGAAGGTC ATTGATGCTC TGTAA
|
Protein sequence | MIGRKYLSQV LNLSSTLLST VGKPVVMQRP ELKQYARERG QLFHFGSSET YSFGPPQYPD EVLYSLQRRF GEYQTPKPFV AEVRDVSLIG LYPIPFKRGN ALVEPVVSER NLILNLFYSV VDSRLQHRSL TSKDFDCACL LFNSQSRGVF HWMVEDALRA EGVLRYEEKT GRRPTLIIPP NPKSWQTETL ELLGFESEDW VEWDAFRGKV DRLVVPSVRR IYDDGVVSPA QTEWFSERMV GGAEGQVEAS KTSSRVYISR DDAGRRRLTN EDDLMDQLGD LGFERHYLER MSTAQIVSLF NNANIIVAPH GAGLTNIMFA TDASVIELRP NDSYSWVYYV LSEQNGLDYC YVMGDDDKEG TDFRVEPAKV IDAL
|
| |