Gene Tpen_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1043 
Symbol 
ID4600948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp982898 
End bp984220 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content62% 
IMG OID639773821 
Productmajor facilitator transporter 
Protein accessionYP_920446 
Protein GI119719951 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTACACG AGGCGCGCGC AGCGTGTAAA AAAGTTTTAA TTCCCGAGGG TTTTTCTCGT 
TTCCCCGTGG GTGGATGGCT GGGGAGGGTT AGGGAGTCCC TTGGAAGGGT TCCTAGGAAC
ATCAAGGTGC TAGCGCTAGG CTGGCTCGTC TGGTCGCCGG TACAGGCGAT GGCGGGGCCG
TACACGCAGC TCTACGTTAG CCGCCTGGGG GCATCTCCCG AGGACATCTC GCTGGTGCAG
TCAGCAACGC AAGTGGCGAA CGCGCTTTCA AGGATAGTCG GCGGCTTCCT CTCGGACAGG
TACGGTAGGA AGAGGGTTCT GTGGGTGGGG ACGTTCCTCG TAGCGCTCGC ATACCTCTTG
ATGAGCGTTG CAACCGACTG GCGGAGCTAC GCCATTGCCA GCGTGCTCAA CGGCTTCGCC
CTCTTTTACC AGCCTGCGCT CGAAGGGATA CAGGCGGACT CGGTTCCCTT GCACCTCAGG
GGGAGGATGA ACGCACTCCT ACACCTGGTG CCGGGCCTGG CATCCTCGCT GTCCCCGCTC
GGCGGAGCCG CATTGGTAAA TGCTTACGGG CTCGTCGGCG GCGTGAGGGT GATATTCTTC
CTCTCGTTCG TTACCGGCGT AGCCATCGCG GTTGCAAGGC TACTATGGAT AGAGGAGACC
CTCGAGCCGA GGGGCTCCGG GGTAAACATG CTCGAATCCT ACGTGGACGC CTTGAGGCAC
GTGTCCCGCG ATGTCTACAC GCTGATAACG CTGGACACCC TCTTCAACCT CGTGGGCGCG
ATGTCCTTCC TCTCGAACTA CTACATGTAC TACTACCTCG GGGTCGACAA GAGCGAGCTA
GCCATGCTGG CCTCCCTGGG AAGCCTCGTG AACCTAGGCC TGCTGATACC CGCCGGGAGG
GCTGTGGACA CCAGGGGGAG GAACTTCTCG ATAACCCTGG GCTTCCTGAT GGGCACGCTG
AGCCAGCTAT TCTTCGTCCT CTCCCCGCCA TCCTCCAGCT TCACGCTACC GGCACTCATC
GTCTCCACGC TTTTCGGAGC GGTGGGAGGA GCCTTCTACG GGCTCGCGTA CTCGTCTCTC
AGGGCGGACC TCGTAGCGAA AGAGTACAGG GGGAGGATCT ACGCCCTCTG GGGGCTCGCG
CCGGCGGCGA GCTGGAGCCT AGGCGCGTAC ATAGGGGGGT GGATGTACAG CAACCTGGGG
CCCCAAACCC CCTTCGTTGC CAGCTTCATG CTCAGAGTCC TGCTCACCCC GCTCGCCCTC
ACCCTCTTCG GGAAGCTCAC GAGGAAAGTC GACCTAGCGT TGCGAAACGT CGAGGGAAGC
TAA
 
Protein sequence
MLHEARAACK KVLIPEGFSR FPVGGWLGRV RESLGRVPRN IKVLALGWLV WSPVQAMAGP 
YTQLYVSRLG ASPEDISLVQ SATQVANALS RIVGGFLSDR YGRKRVLWVG TFLVALAYLL
MSVATDWRSY AIASVLNGFA LFYQPALEGI QADSVPLHLR GRMNALLHLV PGLASSLSPL
GGAALVNAYG LVGGVRVIFF LSFVTGVAIA VARLLWIEET LEPRGSGVNM LESYVDALRH
VSRDVYTLIT LDTLFNLVGA MSFLSNYYMY YYLGVDKSEL AMLASLGSLV NLGLLIPAGR
AVDTRGRNFS ITLGFLMGTL SQLFFVLSPP SSSFTLPALI VSTLFGAVGG AFYGLAYSSL
RADLVAKEYR GRIYALWGLA PAASWSLGAY IGGWMYSNLG PQTPFVASFM LRVLLTPLAL
TLFGKLTRKV DLALRNVEGS