Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1076 |
Symbol | |
ID | 6166133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 961409 |
End bp | 962557 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641668228 |
Product | major facilitator transporter |
Protein accession | YP_001794453 |
Protein GI | 171185534 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.197988 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0589295 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTAAAT GGGGAGGGGT AGCGCGGCCC GTGGTGCCCG AGCTAAGGCT CATCGCGGTG GCAGGCGTGG GCTGGCTCTT CGACGCCATG GACGTCCTCC TCCTCTCGTA CATACTGGTG GCGTCTGCGG CGGAGCTGGG GCTGGGGGTC TGGGAGAAGT CCGCCGTCGT GCTGGCCAAC AACCTCGGCA TGTTGATAGG GGCAACCGCC TTCGGCAGAC TGTCGGATAG GCTGGGGAGG AGGGCCGTCT TCACCGCCAC GCTCCTCCTC TACAGCCTCG CCACGTCCGC CACAGCCCTC GTAAAAAACG GGTGGGAGCT GGCGGCGGTC CGCCTCATCG CAGGGCTGGG CCTAGGCGGC GAGCTCCCCG TCGTCGCCTC GTACGTGTCT GAGCTCTCGC CGCCGGAGAG GAGGGGGAGA AACGTGGTGG TTCTAGAGAG CTTCTGGTCC CTAGGCGCGT TGGCGGCAGC CGCCGTGGCC TACTTCCTCT TCCCCCGCCT CGGCTGGAGA ACCTCCCTGC TCCTCCTCGG CCTCACCGCG TTATACGCCG CGGTGATAAG GGCCGCCCTC CCGGAGCACA AGCCAGCCCC CCGGGGGGCC GCCCCGGTTG AGGCTAGGCG GCTCTACCCC GTGTGGTACA TATGGCTGGC CCTGGCGTTT GGATACTACG GAGTCTTCCT CTGGCTACCC ACCATCCTCG TCAGAGAGAG GGGGCTTGCC GAGGTGCAGA CCTACCAGTT CATGCTAATT ACGACGGTTG CCCAGATCCC CGGCTACCTC ACCGCCGCCT ACCTAGTGGA GAGGATAGGG AGGAGGCCCA CCGCCGCTGT CTTCTTCCTC GGCTCCGCCG CCTCGGCGGC CGCCCTCATA TACAGCGCAG ACCTGCCCCA GCTCTACGCG TCGGCCATCG CGCTGAACTT CTTCAACCTA GGCGCCTGGG GGGTGGTATA CGCCTACACG CCCGAGCTCT TCCCAGAACA CGTCAGAGGC CTCGCCGTCG GGACCGCCGG CTCCGCCGCG CGGGTGGGGA TGATCCTCGG CCCCTGGCTC TACCCCGCGG CCGGCCTCTA CGCCCTAGCG GCGGTGCCCC TCCTCTGGCT CGCCGTCCCC GCCGCCGTGT ACGCCCTGCC GGAGACCAAG AGGCGCTAG
|
Protein sequence | MLKWGGVARP VVPELRLIAV AGVGWLFDAM DVLLLSYILV ASAAELGLGV WEKSAVVLAN NLGMLIGATA FGRLSDRLGR RAVFTATLLL YSLATSATAL VKNGWELAAV RLIAGLGLGG ELPVVASYVS ELSPPERRGR NVVVLESFWS LGALAAAAVA YFLFPRLGWR TSLLLLGLTA LYAAVIRAAL PEHKPAPRGA APVEARRLYP VWYIWLALAF GYYGVFLWLP TILVRERGLA EVQTYQFMLI TTVAQIPGYL TAAYLVERIG RRPTAAVFFL GSAASAAALI YSADLPQLYA SAIALNFFNL GAWGVVYAYT PELFPEHVRG LAVGTAGSAA RVGMILGPWL YPAAGLYALA AVPLLWLAVP AAVYALPETK RR
|
| |