Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_3059 |
Symbol | |
ID | 7315988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 3205221 |
End bp | 3206549 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643617957 |
Product | aromatic hydrocarbon degradation membrane protein |
Protein accession | YP_002515115 |
Protein GI | 220936216 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2067] Long-chain fatty acid transport protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.292585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCATC GCCTGAACAA GACCACCCTG GCCTGGGCCG TGGGTCTGGC CCTGGGCGCC GCCCCCGGCG CCCTGCTGGC CGCCGGCTTC TACATCCAGG AGCAGAGCGT CAGCGGTCTG GGCCGCGCCT TCGCCGGTGA GGCGGCCATC GCCTCGGATG CCAGTACCAT CTATTTCAAT CCGGCGGGCA TGACCAGGCT CAAGGCGCCC GAGTTTCAGG CGGCGGTGCA TCTGCTGGTG CCGAAGTCGA CGGTGACCAA TAGCGATTCC AGGGCGATCA CCCCGGGTAC AGGTGGAAAT CCGCTGCCCT ATGAAGGCGG AAGCGGGGGC AACCCCTACG AGCCGAGCCC CGTTCCCAAC CTGTTTTACG CGCGCCCGCT GAACGAGCAG ACCTGGTTCG GCCTGGGGAT CACGGCGCCT TTCGGGCTGG CGAATGAGTA CGACCGGAAT TTCTTTGCCC GCTATGACTC CACCGAGACC TCACTCAAGG TCATCGACAT CGCACCGAGC ATCGCATACC AGTTGAACGA CCGGGTTTCC ATCGGCGGCG GCATCAATAT CCAGTATGCC GATGCGACCC TGAAAAACGC GCTTCCTGAT CCGACGCGCG TCGGGGGGCC TTCCGTGGAT ACCGATGGTG AGTTCGCACT GAGCGGTGAC AGTTGGGATT ACGGTTTCAA TATCGGGCTT CTCGTGGACC TGAGTGACGA CACCCGGATA GGCGTCCATT ATCGCCAGGG AATCACTCAC ACCCTCGAGG GTACTGCGAC AACCCGTCTG CCTTTCGGCC TCGGTGGCAC GTCGGTATCG CAGGGTGGCG AGGCAGACCT CAAGTTGCCC GACATGGTGT CAGTGGGCTT GTCGCACAGA TTCACGGATC GCTGGACCGG TCTGGCCCAG TACACCTGGT TCAACTGGTC CAACTTCGAT GAGATCGCCG TGAAGCTGGC ATCGGGTGAT CATCCGGATC CGGTCCGACA GAACTACAAG AACAGCTGGG CACTGGCCCT GGGGGCTGAA TACCAGCTGG ATGATCGCTG GACGCTGCGC GGAGGCATCC AGTACGACAC GACACCAACG CAGGATGGGT TCCGAACCAC CCGCACGCCC GATGGCGATC GCACCTGGTT CTCTGCGGGT GCCAGCTATG AACGGGATAG CCGTCTCGGC TTCGACCTTG CCTACACCTA CATTGATGTG GCCAAGGAGT CGCTGAATCT GGATCGTGAG TTCTACGATG GTTTGCCTTT TGCCTCGTCT GTCCAGTTGA GAGGCACCAC CGAAGGCCAC GTCCACATCC TTGCCGCGGC CCTGCGCTAC CGGTTCTGA
|
Protein sequence | MQHRLNKTTL AWAVGLALGA APGALLAAGF YIQEQSVSGL GRAFAGEAAI ASDASTIYFN PAGMTRLKAP EFQAAVHLLV PKSTVTNSDS RAITPGTGGN PLPYEGGSGG NPYEPSPVPN LFYARPLNEQ TWFGLGITAP FGLANEYDRN FFARYDSTET SLKVIDIAPS IAYQLNDRVS IGGGINIQYA DATLKNALPD PTRVGGPSVD TDGEFALSGD SWDYGFNIGL LVDLSDDTRI GVHYRQGITH TLEGTATTRL PFGLGGTSVS QGGEADLKLP DMVSVGLSHR FTDRWTGLAQ YTWFNWSNFD EIAVKLASGD HPDPVRQNYK NSWALALGAE YQLDDRWTLR GGIQYDTTPT QDGFRTTRTP DGDRTWFSAG ASYERDSRLG FDLAYTYIDV AKESLNLDRE FYDGLPFASS VQLRGTTEGH VHILAAALRY RF
|
| |