Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_0049 |
Symbol | |
ID | 7316699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 47995 |
End bp | 48990 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 643614939 |
Product | protein of unknown function DUF58 |
Protein accession | YP_002512140 |
Protein GI | 220933241 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.799483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCCT GGCTGCAAGC ACGCTGGCGT CGCCGCCCCG GGGCAGGCGC CGATCCGGCC GTCCGGCCCC TGCTGGCGGC GGACGAACTG CAGACCCTGT TGCGGCTCGC GGAACAGGCC CCGCCCCGCC CCCCCGGCGG CGAGTTGCCC GGCGGCGAAC GGCCCTCGCC CCTGCGCGGC CACGGCCTGG ACTTCCAGGA GCTGCGCCGC TACCAGGCCG GTGATGACGT GCGCGCCATG GACTGGCGCA CCACCGCCCG CACCGGCACC CCCCACATCC GCCTGCATCA CCTGGAGCCC CGCCCCTGCC TGTACCTGCT GGTGGACCTG GGGGCCAGCA TGCGCTTCGG CACCCGGGTG CGGCTCAAGG CGGCCCAGGC GGCGCGCATC GCCATTCACC AGGCCCTGGC CACCGTGGGC GAGGGCGGCA GCGTGGGCGC GGCGGTGCTG GGGGAGGATC ACCCCCGCCA GTGGCCGGTG CGCGGCGGTC GGGGCCATGC CCTGGCACTG GCCCGGGGGC TGGCCGCGCC CTGCCCGCCC CTGGAATCTG AGGCGACGGC GACAGGTGCC GCCCAGTTGG CCGCCCTGCT GGCTGGCCTG CCCCGGTTGC TGCCGGCCGG TGCCGGGATC CTGGTGCTCA GCGACCTGCG CTGGCTGGAC CTTGAGCAGG CGGCGGCCCT GGGGCGTCTG GCCGCCGGTG GACGGACCGT GACCGTGGTG CGCATCACCG ACACCGCCGA GCGCAGCCTG CCCCCCATGG GCCGGGTGCC GTTCCGGCTG CCCGGTTCGG ACCGGCCCCT GTGGCTGGAC ACTGCCCGGC CGGGGGTGCG CGCCGCCTTC GACGCCCACG CCGAGGCCCG CGCCCGGGAC TGCCGGCGCT GGCTGAGCGG CGCCGGCGTG GGGCTGCTGG AACTGGGCGC CGAGACCCCG GCGCGGATCT GGGTCCGGCA ACTGGCCGGA CCGGGGGTGC CACGGGCGCC TCGGCGGCAC CCGTGA
|
Protein sequence | MRAWLQARWR RRPGAGADPA VRPLLAADEL QTLLRLAEQA PPRPPGGELP GGERPSPLRG HGLDFQELRR YQAGDDVRAM DWRTTARTGT PHIRLHHLEP RPCLYLLVDL GASMRFGTRV RLKAAQAARI AIHQALATVG EGGSVGAAVL GEDHPRQWPV RGGRGHALAL ARGLAAPCPP LESEATATGA AQLAALLAGL PRLLPAGAGI LVLSDLRWLD LEQAAALGRL AAGGRTVTVV RITDTAERSL PPMGRVPFRL PGSDRPLWLD TARPGVRAAF DAHAEARARD CRRWLSGAGV GLLELGAETP ARIWVRQLAG PGVPRAPRRH P
|
| |