Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2966 |
Symbol | |
ID | 8385275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 3054166 |
End bp | 3056067 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644974044 |
Product | copper-binding protein |
Protein accession | YP_003131860 |
Protein GI | 257054027 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3420] Nitrous oxidase accessory protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.58958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGTCA CGCTCGTTCT GGCCGGTCTC GCTGTCGTGC TCGCGGGGGC CAGCCTGGCG TTCGCCGCTG ATCCGGGCGC GAGCGCGAGT GCAGAGCCGG TTCCGTTCTC CGATACCCTC ACGACCGGGC TTGCGGGTGT CGATGTCCAG CAAGCCTACG CCGCCGGCCA CGAGATCCCC CGCGTCGAAG TGTTCTACTC GCACTATCAG TACGTCGTGG GCTACTATGG GGTAGCGTCT GCGGTCACGG CAGTCGATCG CGAGGAGACG ACCCGGCAGT TCGGTCGCCC CATCGGGATA TACGTGAGCG ACTACTCCGG TGCCGGCCCT CGGCTCACTG ACGAGGGTTA TCTCACCGTC ACGTCCGATC CCGCAGTCGG GTGGACGCGA GCCAGCGAGG CTTCGTTCGT CGTCGGAAGC GGCGCACGGA CCCCTGGCAG TCCGGCGGTT GTCCCGTTCA GTGACGCGGC TGACGCGAGC GACTTCGCCG AGCGCTACGG AGGAGCCGTC CGCGAGTGGG GGGCGCTCGA GCCCCGCGAA GACGAACCGA GGGCATCGTG GCTACGTGAG GCGGCGAGCG ATCGCAAAGC GTGGGCCGAC CGGACCGTGC AGGATCGTCG ACCCCTCCTG GATCGGCCGG TCTCGATCAC GGTGGGTGGC GACGTTCCGA CGCTCGCCGC GGCGATCGAG GCCGCACCGC CGAACACCAC CGTTCGACTC CCGCCCGGAA CCTACGACGG AAACGTCACC ATCGGGAAGC CGCTCACGGT CCAGGGAGCA GGCGAACGGA CGCACGTCGT CGGTGATAAC GGAACCGTGC TTCGAGTGAC CGCGCCGCGG GTCGGTCTCG CCGATCTGTC GATCAGTGGC GTGGGTAAGT CGAACACGGG CACGCCGGCC AACGCCTCCG AAACAGCCGA CGACTGGGAC GAGAGCATCC GGACAACCTA CGGGTATGGC GATGCCGGGG TCGTCTTCGA CGGGGCGAAC CGATCGCTGG TGTCCCGGGT CACCATTCAG ACACCAGCGA GCGGCGCGAT CGTCCGAGAC AGCGATGGCG TCGTCGTCGA GAACCTCTCG ATCGACGGTA CCGCCGACTG GCAGGACGGA TTCATGGGCG TCCTTGCGAT GGACTCCCGG ATCGTGGTTC AGCACAGCAC GTTCACCGGC GGCCGGGACG CGGTCTACAC CCACCACGCC GATGGACTCG TGGTGCGGGA TAACCGAATG ACTGGCATGC GATTTGCGGT CCACGAGATG TACACATCCG AGACACTGGT GGCGAACAAC ACCGCCCGCG ATACCGATAT CGGGATCGTC GTCATGACGC GGCCGCGATC GAACGTGATC ATCGAAAACC GGGTTTCCGC GAGCGACGTC GGGATCTCAG TCGGCGGGAG TAGCTCACTC GCGGCCAACA ACACGCTCGT CGCCAACCGC TACGGGATGG ACCTCGGTGC CCAGCGATCG ACCTTCGCGC ACAACGTCCT GGTCGGCAAC GAGGTCGGAC TGCGAACGGG CACGATCGTT CCGACAAACC GCGTGACGGA CAACGACCTC GTGGACAACG ACCGCTACGT CGACACCGGA CGTGGGCCGG TCCGGGTGTG GACGGGCAAT CACTGGGGCA CCCTCCCGGG CCGCGATACC GATGCCGACG GTCGGATCGA TCGCGCGTTC CGCCCGACTG GACCGGTCGA TAGCGTCGTC GGGCGCTCCG ACGGGGCCGC AACGCTCGCG ACGTCGCCCG CCGTCACCAT GTTACGGCAG TTCCAGGCGG CGGTTCCGGG ATTGCGATCC GCGAACGTGA TCGACGACGA GCCGCGGACC GATCCGGTCC ATCCCGACCG GGTCGCGGCG GCACAGAATG CTACCGCCGC TGGTGCGGGG GTTTCGCCAT GA
|
Protein sequence | MRVTLVLAGL AVVLAGASLA FAADPGASAS AEPVPFSDTL TTGLAGVDVQ QAYAAGHEIP RVEVFYSHYQ YVVGYYGVAS AVTAVDREET TRQFGRPIGI YVSDYSGAGP RLTDEGYLTV TSDPAVGWTR ASEASFVVGS GARTPGSPAV VPFSDAADAS DFAERYGGAV REWGALEPRE DEPRASWLRE AASDRKAWAD RTVQDRRPLL DRPVSITVGG DVPTLAAAIE AAPPNTTVRL PPGTYDGNVT IGKPLTVQGA GERTHVVGDN GTVLRVTAPR VGLADLSISG VGKSNTGTPA NASETADDWD ESIRTTYGYG DAGVVFDGAN RSLVSRVTIQ TPASGAIVRD SDGVVVENLS IDGTADWQDG FMGVLAMDSR IVVQHSTFTG GRDAVYTHHA DGLVVRDNRM TGMRFAVHEM YTSETLVANN TARDTDIGIV VMTRPRSNVI IENRVSASDV GISVGGSSSL AANNTLVANR YGMDLGAQRS TFAHNVLVGN EVGLRTGTIV PTNRVTDNDL VDNDRYVDTG RGPVRVWTGN HWGTLPGRDT DADGRIDRAF RPTGPVDSVV GRSDGAATLA TSPAVTMLRQ FQAAVPGLRS ANVIDDEPRT DPVHPDRVAA AQNATAAGAG VSP
|
| |