Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1683 |
Symbol | |
ID | 4600571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1631173 |
End bp | 1632420 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639774456 |
Product | major facilitator transporter |
Protein accession | YP_921081 |
Protein GI | 119720586 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.249309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTCGC GCCTACTCCC CGTCTTGATG GCTGGCTGGG CTATCGGCGC GATGTACGCG GGAGTGGTCA GCGGGACCTT GACGCTGATA AGGGCGGAGC TCGCGATGGG AGCGGAGGAG GCCGGGAGGG TTCTGAGCAG CTGGCTCCTC GGGATGCTCC TCGGCGCCTC CCTGATCGGC TACCTCTCCG ACAGGGTTGG GCGTAGGGCG TCCCTCGTCG CCTCCTACGC ACTCATGGGT GTTTTCACCC CTATGAGCGC CGCGGCCCGG GGATGGCTGG ACCTATCGGT GTACAGGGTG CTCGCGGGGG CCGGGAACGC GGGCTACATG GTCACGGCGA GCGTGCTCCT AGCGGAGTAC GCGCCGACCT CTTCCAGGGG CAGGCAGGTC GCAGTGCTGG AGAGCGCGTG GGCGCTCGGG TGGCTGGCAT CCCTCGTGCT ATCGCGGCTA ATCGCGCCCA GCTACGGCTG GCGCGCCGTC TTCTTAGCGT CCAGCGCGGC GCTGGCACTG CTCCCTGTTC TCTACGTAGC CGTCCCGGAG TCCCTCAGGT TCCTCGTGTC TAAGGGTAGG CTCGCCGAAG CCGAGGCCCT CGCGGGGAGG CTCGGGGTAG AGGTGCCGCG GGCCCAGCCG GCCGCTAGGG CCGGGCTGAG GGACCTCGTC GGGAGGAGGT ACCTGAGGAG GAGCGTGATG CTGTGGATCC ACTGGTTCAT GCTCGTACTC GCGTACTGGG GGATCTTCCT CTGGCTCCCC GACATACTCT ACAAGCGCGG CATACCCTTC GTCAGGAGCC TCGACTACGC GATACTGATA ACGCTAGCCC AGATACCGGG CTACCTGTCG GGAGCCTACC TCGTAGAGGT CGCCGGCAGG AAGCCCGTCC TCGCCGCCTA CATGCTGGGC GCGGGCATCG CGAGCGCCGG GCTGTGGGCC GCGAAGACAG ACCTCGAGGC CCTCGCGTGG GGCGTGCTAG TCTCGTTCTT CAACCTCGGC GCCTGGGGCG TGACCTACGC CTACACTCCG GAGCTGTACC CAACCGAGCT CAGGGGAACC GGTAGCGGGT GGGCAAACGC ATTCGGGAGG ATAGGCGGGA TCCTCGGACC CTACGTGGCC GGCGTCATGA TACAGTCGTA CGGGAACCCC TCGGCCCCCT TCGCCGTCTT CGCGCTGGCG CACGTGGTCT CCGCCGCCGT GGTCGCCGCG CTGGGCGTCG AGACGAAGGG GAGAAGCCTC GAAGAGGTCT CGGCTTAA
|
Protein sequence | MKSRLLPVLM AGWAIGAMYA GVVSGTLTLI RAELAMGAEE AGRVLSSWLL GMLLGASLIG YLSDRVGRRA SLVASYALMG VFTPMSAAAR GWLDLSVYRV LAGAGNAGYM VTASVLLAEY APTSSRGRQV AVLESAWALG WLASLVLSRL IAPSYGWRAV FLASSAALAL LPVLYVAVPE SLRFLVSKGR LAEAEALAGR LGVEVPRAQP AARAGLRDLV GRRYLRRSVM LWIHWFMLVL AYWGIFLWLP DILYKRGIPF VRSLDYAILI TLAQIPGYLS GAYLVEVAGR KPVLAAYMLG AGIASAGLWA AKTDLEALAW GVLVSFFNLG AWGVTYAYTP ELYPTELRGT GSGWANAFGR IGGILGPYVA GVMIQSYGNP SAPFAVFALA HVVSAAVVAA LGVETKGRSL EEVSA
|
| |