Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0356 |
Symbol | |
ID | 8382620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 343857 |
End bp | 346892 |
Gene Length | 3036 bp |
Protein Length | 1011 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644971415 |
Product | hypothetical protein |
Protein accession | YP_003129276 |
Protein GI | 257051443 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.161093 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGAGC ACAGCCGACG CACGTTCCTG CGTGCATTTG GGGCAGGGGG CATCGGGGCA CTGGGGGAAA CGAATGCGGT CAATCCTGCG AGGGCAGCGA CACCGATCAC GATCGAAGGT GGCGGTGACG ATATCTGGGA CGCGGCCGAC GCGTTTCAGT ACTACTATAC GGCGGTCTCG GGCGACTTCG ACGTCATCGT GAAGGTAACC TCACAGGAAG GGACCGACGA GTGGGCGAAA GCTGGGTTGA TGGTCCGCCA GACGTTGAAC GCCGACGCCG AGACGGCGAT GGTTCGACAG ACACCTGGCC ACGAGACATC CTTCCAGTGG CGTTCGGACG ACGGTGACCA GATGGTCAGT ACGACGTCCG AGGGCGGGAG CGACGAGGGC GAGGTCGCCG GCGGCACAAT GCCTGCGACC TGGCAACGAC TCGTCCGCGA CGGCGATACC ATCGAGGCCT ACGGATCGAC GGATGGCTCG AACTGGACGC GAATCGCTGC CATCTCGCCG AGCGACATCG ACTTCGCCGA GTCGGCGTAC GTCGGGCTGG CAGTCTGTAG CGGCGAGGAA GGCACGCTCT GTACGGCAAC GTTCTCGAAC CTTTCGGGCA TCTCACTCGC GAGTAACATT GACGTCGGGG ACGTGACTGT TCCAGGCAGT GTCTCGGAAC GTACCGACAC CGACACGAGC GTCGTCGTCT CGACGGGGTC GGCGTCGACC GTCGCGAGCA ACCGGGCGAC CGTCTCGGGG TCACTTGACG ACCTCGGTGG CGCGTCCTCG GCGACCGTCT CCTTTGAGTA TCGGGAGCGC ACGGGGGGAT CTTGGCAGAA TGTGGGGACT CAGACCCTGT CCAGGACCAG GTCCTACAGC AAACCCATTA CTGGACTCGA ACCGGGCACC GCTTACGAGT TCCGGGCGGT TTCGACGGCA AGCGACGGCG ATACCGACAC CGGTTCCGTC AACACGTTCA GGACCTCGGT CGGGGGTGGC GGGTCCGGCG TCGTCACCGT CGAGGGTGGC GGTGCAGACA TCTGGAACGA GGCCGACGAG GGCCATTTCT ACTACGCGCC GGTTGAGGGC GACTTCGACG TGGCGGTCAG CGTGGACAGT CTCGACGACA CCGATGAATG GGCCAAGACC GGGTTGATGG TCCGTGAGTC GCTGTCCGCC GACGCGGTGA ACGCGATGGT CAGGAAAACC CCGGGCCACG AAACGTCGCT CCAGTGGCGT GAGGGCGCGG GTGCGGAAAC CACGAGTACG ACCGCCGCTG TCGGCGAGGA CGAACGCGAG ATCGCGGGTG GGACGATGCC TGCACAGTAT CAGCGACTGG TCCGGACCGG CGACGTCATC GAGGCCTACG CGTCCACCGA TGGGAGCGAC TGGACGCTGA TCGCCGCTCT GGATGACTCG CGGGTATCCG TCTCCGAGAC CGCTTACGTC GGGCTGGCGG TCACCAGCCA CAACTCGGGC ACGCTGTGTA GGGCCGAATT TTCGGAACTG TCGGGACTCG CGCCGACGGA CAACCGGGAC ATCGGCGATC CGGACATTTC GGGGAGCGTC ACCCACGAAC GCGACCCCGG CGATGCTGAT CCAGTGGTTT CGACGGGTTC GGTCTCGAAC GTCGGAACAC ATTCGGCGAC GCTGTCAGGT TCACTCACAG ACCTCGGTGG GGCCACGTCA GTCGAGGTCG CCTTCGAATA CCGTGCGCGC GACAGTGCCT CGTGGTCCAC GACCGCGGCG CAGACGCTGT CCTCGACGGG ATCGTTTAGC GAGACACTTT CCGGCCTCGA TCCGGATACC GCCTACGAGT TCCGGGCGAT TGGGGACGCG AGCGACGGTG ATCCGGTCAC TGGCTCGGCG ACGATGTTCA GTACATCGAC CAGCACTGAC ACCGCCGGCG GATCATACTT CGACCCGTCG GACGGGTTCG CCGATCCGGC ACCGTGGCTG GACGACAGTA CGCAAGTCAT TCGGATCCAG AACGCCACCC GCAGCGAGGT CGAACGTGCG TTCAGCACTT CTGGCCCGCG CGTGATCGTC TTCGAGACCA GCGGGACCAT CGACCTCGGC GGTGAGGAAC TGGCGATTAC CGAGGACAAG TGCTGGGTCG CGGGCCAGAC CGCGCCCTCG CCCGGGATTA CGTTCATCAA AGGCCAGCTC CAGATCGACG CAAACGACTG CGTCGTCCAG CACATTCGGT CCCGTCATGG ACCGGGGTCC GAGGGCAGCA TCCAGAGCAA CGACGCGGTC AACACGCAGG ATGACACCAC CAACAACGTC GTCGATCACG TCTCGGCCTC GTGGGGTACT GACGAGTGCA TGTCCGTCGG GTACGACACC GACCGGACGA CCTACACGAA CTGTCTCATC TACGAGGGGC TGTACGATCC CTACGACGAC GGCGCCGATC ACAACTACGG GACGCTCATC GGTGACGGCG CTGAGAACGT CGCGCTGCTG GGTAACGTCT GGGCGAAGGT CCGTGGCCGG GTCCCACGGC TCAAAAGTGG GACTCGATCC GCCCTCGCCA ACAACGTGAT GTACTTCTTC AACGAGGCGA CGAACATGGA CGGCGACACG GAAGCGTCTA TCGTCGGGAA CGTCTACGTC CCCCAGGACC TCGAGGACAC GGTGATCGAA GACGGCACCG CCTATCTGGC GGACAACGTT ACCGATCCCA CGTCGACGCC GCTGACCGGC GACACGTCCG AGCTATCGTC TCGGCCGCTG TGGCCCGATG GGCTCTCCGC GATGGACTCG AGCGACGTCG AGAACCACAA CCTGAATTAC GCTGGCGCAC GCCCGGCCGA TCGCACCGAG GACGACTCAC GGATCATCTC CGAGATCGAA ACCCGCGCTG GCGATCCAGA CACCGAATCG CCGTACGACT ACTGGATTCC TGACCACGAA GCAGTCGGCG GCTACCCGCA ACTCCCGGAG AACACCCACT CGCTGACCGT TCCCGATTCA GGCCTTCGGG AGTGGCTCGA ACAGTGGGCA CTCGCCGTCG AAGCGGACGA CGCGAGCCCA CCCTGA
|
Protein sequence | MIEHSRRTFL RAFGAGGIGA LGETNAVNPA RAATPITIEG GGDDIWDAAD AFQYYYTAVS GDFDVIVKVT SQEGTDEWAK AGLMVRQTLN ADAETAMVRQ TPGHETSFQW RSDDGDQMVS TTSEGGSDEG EVAGGTMPAT WQRLVRDGDT IEAYGSTDGS NWTRIAAISP SDIDFAESAY VGLAVCSGEE GTLCTATFSN LSGISLASNI DVGDVTVPGS VSERTDTDTS VVVSTGSAST VASNRATVSG SLDDLGGASS ATVSFEYRER TGGSWQNVGT QTLSRTRSYS KPITGLEPGT AYEFRAVSTA SDGDTDTGSV NTFRTSVGGG GSGVVTVEGG GADIWNEADE GHFYYAPVEG DFDVAVSVDS LDDTDEWAKT GLMVRESLSA DAVNAMVRKT PGHETSLQWR EGAGAETTST TAAVGEDERE IAGGTMPAQY QRLVRTGDVI EAYASTDGSD WTLIAALDDS RVSVSETAYV GLAVTSHNSG TLCRAEFSEL SGLAPTDNRD IGDPDISGSV THERDPGDAD PVVSTGSVSN VGTHSATLSG SLTDLGGATS VEVAFEYRAR DSASWSTTAA QTLSSTGSFS ETLSGLDPDT AYEFRAIGDA SDGDPVTGSA TMFSTSTSTD TAGGSYFDPS DGFADPAPWL DDSTQVIRIQ NATRSEVERA FSTSGPRVIV FETSGTIDLG GEELAITEDK CWVAGQTAPS PGITFIKGQL QIDANDCVVQ HIRSRHGPGS EGSIQSNDAV NTQDDTTNNV VDHVSASWGT DECMSVGYDT DRTTYTNCLI YEGLYDPYDD GADHNYGTLI GDGAENVALL GNVWAKVRGR VPRLKSGTRS ALANNVMYFF NEATNMDGDT EASIVGNVYV PQDLEDTVIE DGTAYLADNV TDPTSTPLTG DTSELSSRPL WPDGLSAMDS SDVENHNLNY AGARPADRTE DDSRIISEIE TRAGDPDTES PYDYWIPDHE AVGGYPQLPE NTHSLTVPDS GLREWLEQWA LAVEADDASP P
|
| |