Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3891 |
Symbol | |
ID | 8744519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 125902 |
End bp | 127890 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646514475 |
Product | FG-GAP repeat protein |
Protein accession | YP_003405422 |
Protein GI | 284167144 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0201045 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAATC CAGACACGAA TGAGACGCGA CTGCGACGGT ACCGACGCGA GATTCTGAGC GGAGTGGCGG CCGGGACGGC AGCGTTCGCG GCTAGCGGTG CGGCGACTGC CAGCGGTCGG AACGGCCGAT CCACCGTTGG AAACGGAAAA GGCGACGAGA AGAACGAGTC GACGCGTCAG ATGGAGGCAC TCGGCCGGGG CCTCGTGGCC GTTCCGGCGG AAGATGGAGT GCTCGTCCGC TGGCGGCTGT TGGGAACCGA ACCCGCCGAT CTGGGATTCC ACGTGTATCG CGACGGTGAA CGGGTGAACG ACGAACCGAT CACCGAGAGC ACGAACTTCC TCGATCCCGA GGGAACGACG GACTCGACGT ACGCGGTTCG AGCGGTCGGA AACGGTCGAG CCAGCGGCAG GAAAGACGGC GGCCGGAAAC CCGGCATGTC GGAGTCTGTC GAAGTGTGGG ACAACCAGTA CACGGAAATC CCGCTGAACA AACCCGATCC GGTCGAAGGC GAGAACGGGG AGACGGTCAC GTACCACGCG AACGACGCGA GCGTAGCCGA TCTCACGGGC GACGGAACGC TCGACATCGT CCAGAAGTGG TCGCCGTCGA ACGCGAAGGA CAACGCCTTC GAGGGTCACA CAAGCGACGT TCTGATCGAC GGCTACACGA TCGAGGGCGA GCACCTCTGG CGGATCAACC TCGGGCAGAA CATTCGCGCC GGCGCACACT ACACGCCGTT TGCCGTCTAC GACTTCGACG GCGACGGGAA GGCGGAGCTG GCCGTGCGGA CGTCCGACGG CGCGACGGAC GGAACCGGGG CGGTCATCGG CGATCCCGAC GCCGACTACG CGAACGAAGC GGGACGGATC CTCGAGGGGC CGGAGTACCT CACCGTCTTC GACGGCGAAA CGGGCGAGGA ACTCGCGACG AAGGACTTCG AGCCCGCGCG CGGAGACGAC TGCGACTGGG GAGACTGCTA CGGGAATCGC GTCGATCGAT TCCTCGCCGG CGTCGCCTAC CTCGACGGCG AGCGACCGAG CATCGTCATG ACGCGGGGCT ACTACGCGAA GTCGATGCTG GCCGCCTGGG ACTTCCGCGA CGGCGACCTC GAGACCCGCT GGATCTTCGA CAGCGACGAC GGCAACGAGG AGTACGAGAG CCAGGGAAAC CACCAGCTCG CCACTGCCGA CGTCGACGGC GACGGGAAAG ACGAGATCGT CTACGGCGCG ATGGTCGTCG ACCACGACGG TACCGGACTC TACTCGACCG GATGGAACCA CGGCGACACG CTCCATGTGA GCGACTTCGT CCCGAGCCGG GAGGGACTCG AGGTCTTCCA GCCACACGAG TGGGGACCCC ACGGGGCGAC GCTCCGTGAC GCCGGCACGG GCGAGTTACT GTGGAGCGAG GAGGCGGACG CGGACGTCGG CAGAGGGATG ATCGCGGACG TCGATCCCAA CTACGAGGGC GCGGAGCTCT GGGCGTCCCA CGGGGTCGGC TTCTGGTCTA ACGAGGGCGA CAGACTCGGA CCGGCGCTCG ACTCGATCAA CTCCGCGCTC TGGTGGACGG GCGATCTGCA CCGGGAACTG CTCGACCACG ACTGGGACTC GGAAGAGAAC TACGGCGTCG GTCGACTCAC GAAGTGGAAC CCCGAGACGC AGGAACTCGA CCTGCTGAAG TCCTTCGACG GCACGCGGTC GAACAACTGG ACGAAGGGCA ACCCGTGTCT CTCGGGAGAC ATCCTCGGGG ACTGGCGCGA AGAGGTGATC TGGCGTCGAG AAGACGACGA AGCACTGCGA CTGTACGCGA CGCCCCACGA GACCGACCAC CGGCTGTACA CGCTGTTGCA CGACTCGCAG TACCGGACCG CGCTCGCGTG GCAGAACGCC GGCTACAACC AGCCGCCGTG GCCGAGTTAC TTCCTCGGAC ACGGGATGGA CGACCCGCCG AAGCCAAACA TCGACCCCGT CTCGGCGGAT CGCGACTGA
|
Protein sequence | MTNPDTNETR LRRYRREILS GVAAGTAAFA ASGAATASGR NGRSTVGNGK GDEKNESTRQ MEALGRGLVA VPAEDGVLVR WRLLGTEPAD LGFHVYRDGE RVNDEPITES TNFLDPEGTT DSTYAVRAVG NGRASGRKDG GRKPGMSESV EVWDNQYTEI PLNKPDPVEG ENGETVTYHA NDASVADLTG DGTLDIVQKW SPSNAKDNAF EGHTSDVLID GYTIEGEHLW RINLGQNIRA GAHYTPFAVY DFDGDGKAEL AVRTSDGATD GTGAVIGDPD ADYANEAGRI LEGPEYLTVF DGETGEELAT KDFEPARGDD CDWGDCYGNR VDRFLAGVAY LDGERPSIVM TRGYYAKSML AAWDFRDGDL ETRWIFDSDD GNEEYESQGN HQLATADVDG DGKDEIVYGA MVVDHDGTGL YSTGWNHGDT LHVSDFVPSR EGLEVFQPHE WGPHGATLRD AGTGELLWSE EADADVGRGM IADVDPNYEG AELWASHGVG FWSNEGDRLG PALDSINSAL WWTGDLHREL LDHDWDSEEN YGVGRLTKWN PETQELDLLK SFDGTRSNNW TKGNPCLSGD ILGDWREEVI WRREDDEALR LYATPHETDH RLYTLLHDSQ YRTALAWQNA GYNQPPWPSY FLGHGMDDPP KPNIDPVSAD RD
|
| |