Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_2203 |
Symbol | |
ID | 7317450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 2332579 |
End bp | 2333847 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643617098 |
Product | aromatic hydrocarbon degradation membrane protein |
Protein accession | YP_002514270 |
Protein GI | 220935371 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2067] Long-chain fatty acid transport protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.92236 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATCAGT CCATCCGGCG CTCCGGCGTC TTCATCTGCC TGGCCTCCGG CCTCGCCGCC ACGGGCGTTG CCCAGGCCAC CAACGGCTAC CAGCTCATCG GCATCGGTGC CTACCAGAAG AGCCTCGGTG GCGCGGTGAC CGCCAACCCC GGCTCTGCCA TGACCGCCAT CACCAACCCC GCCGGCATGG CCCGCATCGG CAAGCGCGCG GACTTCTCCA TGGAGGCCTT CATGCCAGAG CGCTCTGTGG ATTTCACCGC CACGGGCGGC GAGACCGGCA AGAGTTCCGT GGACCTGTAC GGGATCCCGG CCATCGGCTG GACCGCACCG GTCAACGGGC GGGATGACCT GTGGTTCGGC GGCGGCATGT ACGGCACATC GGGCATGGGT GTGGACTATG CGCAAACCGA GGTCATGCCC GGTCTGTCAT TCGATGGTTA CAGCAACGTC ATGCTTTGGC AGATGGCACC GACCCTGGCC TGGCAGGTGG ATGATCGCCT CACCCTCGGC GCCTCGCTCA ACATCAATTA CCAGTCCGTG GCGTTCAAGC AGCGCTTCAT CAATGCACTG GGTGATGTGG AGAACAATTT TGACCTGTCA CGCTCTTCCA GTGCCTTTGG CATTGGGGCG TCCTTCGGTC TGCTGTTCGA TGTGAACGAC CGTCTCACTC TGGGCGCTGC CTACAAGAGC AAGCAGGTGT TCGGGGATCA TGCGTACAAC CTGGCCCAGG GCGACATCGA CATGAGCATG ATGGGCGGTG GACCGTTGCC CGCCGGCACC TACAAGCTGG GCCTGGATTT TCCCCAGCAG CTGGCTCTGG GTCTTGCCTA CCGCGCCATA CCGGCCGTGA CTGTCTCCGC TGACGTGAAG TGGATCAACT GGTCCGACAC CATGGACAAG CTGGCTGTGA CCGGTCCCGG CGGAATCTCC GTGCCCATGG ATCCGGGCTG GGACGACCAG ACCGTCTACG CCCTGGGCGT GGACTGGGCC GTGAACAACC GCCTCAACCT GCGGGCCGGT TTCAACTACG GCAAGTCACC CATCGGCAAC GAGGACGTCA GCCGCAACCT GATCCTGCCG GCCGTGGTGG AGACCCACTA CACACTGGGT GCCGGCTATA ACATGGACGG CAACTGGGAA CTGGCCTTCC ACTACATGTA CGTGCCCGAG AAGACCTTCC AGGCGCCGGC CACGGATCCC ATGCTGCCTG GCAGCAAGAT CTCCCTGTCC GAGCAGTCCT TCGGCGTGAA CCTGGGCTAT CGCTTCTGA
|
Protein sequence | MYQSIRRSGV FICLASGLAA TGVAQATNGY QLIGIGAYQK SLGGAVTANP GSAMTAITNP AGMARIGKRA DFSMEAFMPE RSVDFTATGG ETGKSSVDLY GIPAIGWTAP VNGRDDLWFG GGMYGTSGMG VDYAQTEVMP GLSFDGYSNV MLWQMAPTLA WQVDDRLTLG ASLNINYQSV AFKQRFINAL GDVENNFDLS RSSSAFGIGA SFGLLFDVND RLTLGAAYKS KQVFGDHAYN LAQGDIDMSM MGGGPLPAGT YKLGLDFPQQ LALGLAYRAI PAVTVSADVK WINWSDTMDK LAVTGPGGIS VPMDPGWDDQ TVYALGVDWA VNNRLNLRAG FNYGKSPIGN EDVSRNLILP AVVETHYTLG AGYNMDGNWE LAFHYMYVPE KTFQAPATDP MLPGSKISLS EQSFGVNLGY RF
|
| |