Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1012 |
Symbol | |
ID | 7316589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1093563 |
End bp | 1095011 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643615899 |
Product | protein of unknown function DUF404 |
Protein accession | YP_002513087 |
Protein GI | 220934188 |
COG category | [S] Function unknown |
COG ID | [COG2308] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0541998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATCG ACTGGAAGAG TTACGACCCC AACGGCTTCT ACGACGAACT GATCAGTTCG CCCGGCTATC CCCGCGCAGC AGCCCGCACA CTGACCTCCT ATCTGCGCTC CCTGAGCGAC GAGGAGATCC ACGAGCGCAA GGCGGCCGCG GAGCATGCCA TCGTGGAAAT GGGCATCACC TTCACGGTGT ATACGGAAGG CGGCAACATC GACCGCGCCT GGCCCTTCGA CATTATCCCG CGCATCATCC CGCGCAAGGA ATGGATGCGC GTGGAGGAGG GCCTCAAGCA GCGGGTCACC GCGCTCAACC TGTTCATCAA CGATCTCTAC AACGAGCAGA AGATCGTCAA GGACAAGGTG TTCCCCAAGG AGGTGCTGGC CCAGTCCAAA AATTTCCGTG AGCAGTGCGT CGGCATCCAG CCGGCCCACG GGGTCTGGGC GCACATCTGC GGCTCCGACC TGGTGCGCGA CAAGGACGGC ACCCTATACG TGCTGGAGGA CAACCTGCGC GTGCCTTCGG GGGTCTCCTA TATGCTGGAG AACCGCCAGG TCACCAAGCG GGTGTTCCCG GAACTGTTCG AGAACTACTC CATCCTGCCG GTGGACGAAT ACCCATCGCA GCTGTTCGAC ATGCTGGCCT CCCTGTCGCC CCGGCCCCTG GATTATCCGG AGGTGGTGGT GCTCACCCCG GGCATCTACA ACTCCGCCTA CTTCGAGCAC TCCTTCCTGG CCCAGCGCAT GGGCGCGGAA CTGGTGGAGG GCAGTGACCT GCTGGTGGGT GAGGACGACT GCGTGTACAT GCGCACCATC GGCGGCCTGG AACGGGTGGA TGTGATCTAC CGTCGCGTCG ATGACCTGTT TCTGGACCCG GAGGCCTTCA ACCCCGACTC CATGCTGGGG GTGCCCGGCC TGATGCGCGC CTGGCGCAAG GGCAACGTGG CACTGGCCAA TGCGCCCGGG GCGGGCGTGG CCGACGACAA GGTGGTCTAC GCCTTCGTGC CGGAGATCAT CCGCTACTAC CTGGACCAGG ATCCCATCAT CCCCAACGTG CCCACTTTCC GCTGCATGTA CCCCGACGAG CGCGAGCACG TGCTCAAGCA CCTGGACGAA CTGGTGATCA AACCCGCCAA CGAGTCCGGC GGTTACGGCA TGCTGATCGG CCCCCATGCC AGCAAGCGCC AGCGCGCGGA GTTCGCCGAC CTGATCAAGA AGGATCCGCG CAATTACATC GCCCAGCCCA CCCTGGCCAT CTCCACCACG CCGACCCTGT GCGACGGACA CCTGGAGCCC CGTCACGTGG ACCTGCGCCC CTTCATCCTC CAGGGGGTGC GCACCGACGT GACCGCCGGC GGACTGACCC GGGTGGCCCT GCGCAAGGGT TCCCTGGTGG TCAATTCCTC CCAGGGCGGG GGCAGCAAGG ACACCTGGAT CGTGGATACG GAGTCATAA
|
Protein sequence | MSIDWKSYDP NGFYDELISS PGYPRAAART LTSYLRSLSD EEIHERKAAA EHAIVEMGIT FTVYTEGGNI DRAWPFDIIP RIIPRKEWMR VEEGLKQRVT ALNLFINDLY NEQKIVKDKV FPKEVLAQSK NFREQCVGIQ PAHGVWAHIC GSDLVRDKDG TLYVLEDNLR VPSGVSYMLE NRQVTKRVFP ELFENYSILP VDEYPSQLFD MLASLSPRPL DYPEVVVLTP GIYNSAYFEH SFLAQRMGAE LVEGSDLLVG EDDCVYMRTI GGLERVDVIY RRVDDLFLDP EAFNPDSMLG VPGLMRAWRK GNVALANAPG AGVADDKVVY AFVPEIIRYY LDQDPIIPNV PTFRCMYPDE REHVLKHLDE LVIKPANESG GYGMLIGPHA SKRQRAEFAD LIKKDPRNYI AQPTLAISTT PTLCDGHLEP RHVDLRPFIL QGVRTDVTAG GLTRVALRKG SLVVNSSQGG GSKDTWIVDT ES
|
| |