Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0481 |
Symbol | |
ID | 8382748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 477922 |
End bp | 479049 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644971543 |
Product | protein of unknown function DUF201 |
Protein accession | YP_003129401 |
Protein GI | 257051568 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0458] Carbamoylphosphate synthase large subunit (split gene in MJ) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAAT CATCGACAGC CACACGAACC AACAGAAGCG ACGTGACCGT TCTCATGACC GGGGCCGGCG CACCAGGTGC CTGGGGCATC ATCAGGAGCC TTCGATTGAC CGAGGAACGC GACGTCCGCA TCGTCGGCGT CGACATGGAT CCCGACGCCT ACGGGTTCTC GCTCGTGGAC GCGTCCTACC GGGTTCCAGC AGGGACTGAC GACGGGTACG TGACCCGGAT CGCCGACATC GTCACGAAGG AAGACGTCGA CGTCGTCCTG CCGCTGACGA CCGACGAGCT ACAGCCGCTG GCGACCCACC GCGAGGACGT TCCCGCGAGC GTCATGGTTT CCGCCGCGGA GATACTTTCG ATCGCGAACG ACAAGGCCGC ACTGTACGCG TTTCTCGACA AGCACGGTTT CGACTCCGCG CCGCGGTTCT GCCGGGTTGA GGACGAGGCG TCGTTCGTCG ATGCGGTGCA AGCGCTCGAA TATCCCGACA ACCCGGTCTG TTTCAAGCCG GTCGTCGGGA GCGGCATGCG GGGCTTCCGG GTGCTCGACG AGGATGCCGA TCAGCTAACC CAGTTGCTCG ACGAGAAGCC GAGTGCGACG ACCACGACGT TCGAGGAGAT CCGTCCGGTA CTGGCCGAGG CCGATCCCTT CCCCAAACTC GTCGTCATGG AGTACCTCCC GGGTGAGGAG TACAGCGTCG ACGCGCTCGC GATGGGTGAT TCCGTCGGCC CCGTGGTCCC GCGCTCGCGG GCCAAGACAC GGGCCGGCAT CTCGTTTCAG GGCGTCGTCG AAGAGAACGA TCGCCTCATC GAGGAAGCGG GCGAGATATG CCAGAAGCTC GGCCTGGAGT ACAACGTCAA CCTCCAGTTC AAGTACGACG CCGACGGGAA TCCGAAGCTC ATCGAGATCA ATCCCCGTGT CGCCGGGACG ATCATCATGT GTGTCGGTGC CGGGGTGAAC CTGCCGTATC TGGGTCTCAA GCACGCACTC GAAGAGCCGA TTCCGTCGGT CGATATCGAG TGGGAAACGT CGATGACCCG ATACTGGAAC GAGGTGTTCC GATCACCCGG TGGCGACTCC TTCCACGTTG ATCCGGACGG GGTCACGAAC AGGATGGCGA CCCGATGA
|
Protein sequence | MSESSTATRT NRSDVTVLMT GAGAPGAWGI IRSLRLTEER DVRIVGVDMD PDAYGFSLVD ASYRVPAGTD DGYVTRIADI VTKEDVDVVL PLTTDELQPL ATHREDVPAS VMVSAAEILS IANDKAALYA FLDKHGFDSA PRFCRVEDEA SFVDAVQALE YPDNPVCFKP VVGSGMRGFR VLDEDADQLT QLLDEKPSAT TTTFEEIRPV LAEADPFPKL VVMEYLPGEE YSVDALAMGD SVGPVVPRSR AKTRAGISFQ GVVEENDRLI EEAGEICQKL GLEYNVNLQF KYDADGNPKL IEINPRVAGT IIMCVGAGVN LPYLGLKHAL EEPIPSVDIE WETSMTRYWN EVFRSPGGDS FHVDPDGVTN RMATR
|
| |