Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1013 |
Symbol | |
ID | 7316590 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1095015 |
End bp | 1095953 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643615900 |
Product | protein of unknown function DUF403 |
Protein accession | YP_002513088 |
Protein GI | 220934189 |
COG category | [S] Function unknown |
COG ID | [COG2307] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0471288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTCAC GCGTAGCCGA ACGCCTGTAC TGGATGGCCC GCTACATCGA GCGTGCCGAG AACACCGCGC GCATGGTGAT GGCCTTCTCC CACCTGGCCC TGGACATGCC CCGCTCGGTG CAGCTGTCCT GGAAGAGCAT GGTCTCCACC ACCGGCGGTG ATGCCGCCTT CGATCATCAC TACCAGCGGG ATGATGAGCG CAACTGCGTC AAGTTCCTGC TCGCCGACCT GGACAATCCC GGCTCCATCA TGAGTTCGCT CTCAGGGGCC CGGGAGAACG TGCGCACCAC CCGGGACCTG GTGCCCTCCG AGGCCTGGGA ATCGGTCAAC GAGCTGTACC TGTTCGGCAA GGCCAACGCG GAGGCCGGCG TGGGCAAGCG CGGCCGCTAT GCCTTCCTCT CGGAGATCAT CAGCCGTTGC CAGCAGATCA CCGGCCTGCT GGCGGGCACC ATGAGCCACG ACCATGCCTA TGATTTCATC CGCGCGGGGC GCAACCTGGA GCGGGCCGAC ATGAGTTCGC GCCTGCTGGA CGTGGCTGGC ATCGGCCTGC TCAGCAAGGA CGAGGACATG CAGCCCTTCG AGAACCTGCT GTGGATCAAC ATCCTGAAAT CCGTGAGCGG CTTCCAGATG TACCGCCAGC ACGTGCGCCG GCGCGTAAAC GGTGCCCTGG TGATCCAGTT CCTGCTCCAG GACAAGCAGT TCCCCCGCGC CATCGCTCAT GCCCTGGGCG AGGTGGAGAC CAGTCTCGCC AACCTGCCGC GCAACGAACT GCCGCTGCGC TTCATCGCCC AGGTCAAGCG ACACGTGCAG GACGCCAAGG TGGAGACCCT GCTCAAGCCC GAGGAACTGC ACCAGTTCCT GGACCAGATC CAGATGGAGA TGGCCGAGAC CCATGCCAGC ATCGCGGACA ACTGGTTCCG ACTGGACCGG GCGGCGTGA
|
Protein sequence | MLSRVAERLY WMARYIERAE NTARMVMAFS HLALDMPRSV QLSWKSMVST TGGDAAFDHH YQRDDERNCV KFLLADLDNP GSIMSSLSGA RENVRTTRDL VPSEAWESVN ELYLFGKANA EAGVGKRGRY AFLSEIISRC QQITGLLAGT MSHDHAYDFI RAGRNLERAD MSSRLLDVAG IGLLSKDEDM QPFENLLWIN ILKSVSGFQM YRQHVRRRVN GALVIQFLLQ DKQFPRAIAH ALGEVETSLA NLPRNELPLR FIAQVKRHVQ DAKVETLLKP EELHQFLDQI QMEMAETHAS IADNWFRLDR AA
|
| |