Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0644 |
Symbol | |
ID | 6165220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 577226 |
End bp | 578314 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641667797 |
Product | major facilitator transporter |
Protein accession | YP_001794029 |
Protein GI | 171185110 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.592278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAACT TATTTCTCCT CGTGCTACTG TCTAGGGGAG TGTATGCGTT GATGTGGTTC TACATAGCGC CTCTCCTCCC AGCCATGTTG AGGGACTACG GCGTTGATCC AGCCTATGCG GGGCTCCTCC CCGCCGCCTT CGTCGTGGGG GCCGCCTTGA TGCAACTACC GGCGAGCTAC TTAGGGGCTA GGTACGGCCA CAACAAGGTA GCCGGCTTTG GCATGGTGCT GTTCGGCGTC TCTTCCGTGC TCATGGCCCT CGCCCCCAGC TGGGGCTGGG CTCTGGCCTT CAGAGCGCTG GGGGGAGTCG GCGCCGGCCT CTTCTTCTCC ACCGCGGGCG CCGTCTTGGT GGCTCTAAGT CCAAGCTCGG TAGGCTCCGC CCTGGGTTGG TACGGCGCCT CCTTTAACCT GGGCGGCTTC GTAGGCTTCT ACTGGGGGGC CGTCGCAAGC GTTCTTGGCT GGCGCCCCGC TCTTGCGCTT CCTGGGCTAC TCTCAGCGGC GTTGGGCGTG GCACTTGTTA GACACGGCGC TGTTAAAAGC AGACCATCTC TCGAATGGAG GGCGGCGGCC TACGGCTTGG CGTCTTTCCC CTTCTGGGGG GCGGTCTACG CCGCGAACAA CCTCGCCGCC ACGTGGCTCC ACCTCTACCG CGGCGTTGGG GAGTGGGCGG CGGGGGCAGT CTCCTCGGCC TCAATGCTGT CGGGGCTCCT GGGCGGGTTG ACGGGCAGGC TCTACGACGC CGGCGGCAGA GGCCGGTCGG CCCTACTTGC GGCGGCTTCT ACTGCGTCTA TCGCGTTTCT GGCCATGCCC TGGGCTCCGC TGGAGGCGGT TCCGTTGCTC GCCTTCCTCT ACGGCTTGTC CTTCTCGACG TATATGACCG GGGTCTATGC GGCGTCGTCT AGGGCTGCGC AGAACCCCGC TTCGGCTCTT GCGGTTATAA ACGTCACAAA CATGGCGCTT GGGCTCCACG TGAGCTACCT ATTCAGCTGG CTTATGGCGC AGTCCCCGGA CTACCCGTGG CTTTTCCTCG CCTCTCTCGC CCTGGCCTCG GCCGTTGCCA CATACGCCGT GGTGGTAAGG GCTATATAG
|
Protein sequence | MLNLFLLVLL SRGVYALMWF YIAPLLPAML RDYGVDPAYA GLLPAAFVVG AALMQLPASY LGARYGHNKV AGFGMVLFGV SSVLMALAPS WGWALAFRAL GGVGAGLFFS TAGAVLVALS PSSVGSALGW YGASFNLGGF VGFYWGAVAS VLGWRPALAL PGLLSAALGV ALVRHGAVKS RPSLEWRAAA YGLASFPFWG AVYAANNLAA TWLHLYRGVG EWAAGAVSSA SMLSGLLGGL TGRLYDAGGR GRSALLAAAS TASIAFLAMP WAPLEAVPLL AFLYGLSFST YMTGVYAASS RAAQNPASAL AVINVTNMAL GLHVSYLFSW LMAQSPDYPW LFLASLALAS AVATYAVVVR AI
|
| |