Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2786 |
Symbol | |
ID | 8385093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2860074 |
End bp | 2862425 |
Gene Length | 2352 bp |
Protein Length | 783 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644973862 |
Product | hypothetical protein |
Protein accession | YP_003131680 |
Protein GI | 257053847 |
COG category | [S] Function unknown |
COG ID | [COG5427] Uncharacterized membrane protein |
TIGRFAM ID | [TIGR03662] Chlor_Arch_YYY domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.520143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGAGC TCGGACTCGT CGCACTGTGG CTGGGGCTGT ACCTCCTGTT GCTGTTTGTC GGTCTGACGA TCTCGCGTCG AATACTGCCC GGGCTGGCAG ACGAAGGAGC CGGGATCGCC ATCCCACTCA CGCTGACCAT CGTCTGGGTG GTCACTTTCC TGGTTGGCCA CGCCTCGATC GTTCTCGGGC TCTGGGCGGG GCTGCTCGTG CTTGGACTCG TCGCGGGTGC CGCCCGGTAC CGACAGGGGG GCGTGGACAC CCGGATCTAC CTGGAGACGG CTGCAGTGTT TTCGATCGCG TTCTGCTTCG TCGTGGCGAT CAGGGCTGTC GACCCGGCCG TCCACGCTAT CGCCGGTGAG AAATTTCTCG ACTTCGGGCT GCTCCAGACG TTGCTGCGGG CCGAAAGCCT GCCCCCCGAG GACATGTGGT TCGCCGGCGA ACCCGTCCAG TACTACTACG GCGGCCACCT CCTCGCGTCG CTGCTTGCCC GCATGACGGA CACTGCCGGT CGGTTCGCCT ACAATCTCAA CGTCGCCGGT TTTTACGCCA TGTTCGTCAC GGCTGCCTAC GGCCTCGCGA AGAACGTCGC TGCCGATCGC GATCTTCCGG CCCGCCTCGC CGGCGTGTTC GCGGCCGTCT TCGTCGGGTT CGCGAGCAAC CTCGTCTCGG CGGTCCACGT GATCGTTTTG ATGCTCCCCG ACGACGTCGC AACGCAACTC GCGGCGGTCG CCGGCCACGA ACTCGACGGC CTGGGGACCG GATTCGCCAA CTTCAGTTAC TGGACGGCGA GCCGGGTCAT CTCGGGGACG ATCAACGAGT TCCCGCTGTT CGCGTGGCTC AACGGCGACC TCCATCCTCA CATGTTGAGT CCGGCGTTCC TGTTGCTCGC GGCGACGCTC CTCTATGGCT ATTACCGAAC GCCAGCCAGC CACCGCGCTC GGCGGATCGC GCTGCTTGTC GCCCTCGGTC CGATCGCGGC GCAACTCGCC GCCAGCAACA CCTGGTCGTT CCCCTCGATC GGTGGACTCA CGATCCTGAC CGTCGCACTG GCACCGGCGA GTCCGGTGAC CCTCCTGCCG TCGTCAGTCC AGGCCCGCCT CGACGCTCGG AGCCGGATGC GACGGGAAGT GTGGCGACAC GCTCTCGCCG GGGTCCTGGG AATGTCGGTT CTCCTGCTCG GCGGGCTCCT GTCGCTGCCG TTCTGGCTGC AATCCGTGAG TAGCCAACAG CACCTGGCAC TGTTCCCGGA GCGGAGTGGA CTCGGCCCAC TCCTGCTGGT TCACGGGGCC TTCCTGGCGG CGTTCGTCTG CTATTACGTC CGATACACGC GTCCGCACGG GACACGACGA ATCCGGAGAT TGCTGTTGGT GGGTGTGATC GGTCTCTGTG CCGTCTTGGC ACCGTTCGAT CTGTCCGCAA TCGCTCTGTT CGGGCCACTG ATCCTGTTGG GCTGGTTCCT CCGTCGCGTC GAGGCGTACG AGGAGGTCCC GACGCCGGGA TACGAGACCG TCTTGATCGT GGCCGGGGCT GGGCTCGTGG TGCTCGTCGA GTTCGTCTAC GTGAGTGAAC AGGCCGGTCC GGGACGGATG AACACCGTGT TCAAGTTCTA CGCGCAGGTG TGGGCGCTCT GGTCGGTTGC CATCGGTGTC GTCCTCGTCG AACTCCTTGC TGATCAGCGT CCATCCCTCG GTCTCTCCAG TGACGACTGG CACCGTGGGC TTCGGGTCCT CGTTGCCGTG TTGCTGGTCT CGACGTCGAT CTACGGTGCG TTGGCCCTCT CCCAGCACTT CTCGGGATCG TCGGGGACAG CACCACCGGA CGAACCGACG CTTGACGCGC TTGCGTTTCT CGAGACACAC CATCCGAACG AAGCTCCGGC GATCCACTGG CTGAACGACA ACGTCGACGG GCAACCCACC CTGCTGTCCC GACCAACAAC CGGGTCGCTG AGTGGGTATT GTACCGCCGA CCGGGACCTC CCGAACGGCG TCACGCCGTG GGATTGGGAC GTCTACCACT GGGGCAACGC CCCGTCGACG ATGACCGGGA TTCCGACGGT CGCCGGCTGG AGTCACGAGG TCGGCTATCG CGACGCCTCG GTGTACTGTG ACCGCGTCCA GGACACCATC CAGCTGTTCA CTGGTGACCC CACAAATCAG CGCCAACTGC TCGCCCGATA CGATGTCACG TACGTCTACG TCGGCCCGCT CGAACGGGGG GCCTTCCCGG AGATAACGAT CCAGGAACTC GACGTCGTGA CGGTCGAAAA GCAGTGGGAC GACGTGACGA TCTACCGCGT CAACCAGTCG ATGCTCGGCT CGAACTATCT CAGGCCGACT CGACGACAGT AA
|
Protein sequence | MMELGLVALW LGLYLLLLFV GLTISRRILP GLADEGAGIA IPLTLTIVWV VTFLVGHASI VLGLWAGLLV LGLVAGAARY RQGGVDTRIY LETAAVFSIA FCFVVAIRAV DPAVHAIAGE KFLDFGLLQT LLRAESLPPE DMWFAGEPVQ YYYGGHLLAS LLARMTDTAG RFAYNLNVAG FYAMFVTAAY GLAKNVAADR DLPARLAGVF AAVFVGFASN LVSAVHVIVL MLPDDVATQL AAVAGHELDG LGTGFANFSY WTASRVISGT INEFPLFAWL NGDLHPHMLS PAFLLLAATL LYGYYRTPAS HRARRIALLV ALGPIAAQLA ASNTWSFPSI GGLTILTVAL APASPVTLLP SSVQARLDAR SRMRREVWRH ALAGVLGMSV LLLGGLLSLP FWLQSVSSQQ HLALFPERSG LGPLLLVHGA FLAAFVCYYV RYTRPHGTRR IRRLLLVGVI GLCAVLAPFD LSAIALFGPL ILLGWFLRRV EAYEEVPTPG YETVLIVAGA GLVVLVEFVY VSEQAGPGRM NTVFKFYAQV WALWSVAIGV VLVELLADQR PSLGLSSDDW HRGLRVLVAV LLVSTSIYGA LALSQHFSGS SGTAPPDEPT LDALAFLETH HPNEAPAIHW LNDNVDGQPT LLSRPTTGSL SGYCTADRDL PNGVTPWDWD VYHWGNAPST MTGIPTVAGW SHEVGYRDAS VYCDRVQDTI QLFTGDPTNQ RQLLARYDVT YVYVGPLERG AFPEITIQEL DVVTVEKQWD DVTIYRVNQS MLGSNYLRPT RRQ
|
| |