Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1149 |
Symbol | |
ID | 4600955 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1087814 |
End bp | 1088890 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639773925 |
Product | ABC transporter related |
Protein accession | YP_920550 |
Protein GI | 119720055 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3839] ABC-type sugar transport systems, ATPase components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGTGG AGCTTAGAAA CGTAACGAAG AGGTTCGGGG AAGTAGCCGC GGTGTACAAC GTCAACCTGC GCGTAGAGAG CGGAGAGTTC TTCGTGCTAC TCGGCCCCTC GGGGTCGGGT AAGTCTACGC TCCTACGCAT AATAGCGGGC CTCGAAGAGC CCGACGAGGG AGAGGTTATC ATCGGGGGCA GAGTCGTGAA CGACGTAGAC CCGAGCGAGA GGGACATAGC TTTCGTGTTC CAGAACTACG CCCTCTACCC GCACATGACG GTCTACGACA ACATAGCGTT CCCGCTCAGG ATGAGGAAGG TCCCGAAGGA CCAGATCCAC GCGCGCGTGC TCGAAGTCGC CTCCATGCTA GGGCTTACGA ACCACCTCTC CAAGTACCCC TACCAGCTCT CGGGCGGCGA GCAACAGAGA GTAGCCCTGG CGAGGGCTAT CGTGAGGAAG CCCCGGGTAT TCCTCCTCGA CGAGCCTCTC AGCAACCTGG ACGCCAAGCT CCGCGTAAAG CTAAGGTTCG AGTTGAGGAA GCTCCTCCAC GACGAGCTGA AAACAACCAC GATCTACGTC ACCCACGACC AGGTCGAAGC AATGACGATG GCGGATCGCG TAGGCGTTAT CAACAGAGGG CAACTGGTAC AGGTAGGCAC CCCCGACGAG CTCTTCGAGA AGCCATCAAA CACCTTCGTC GCGGGATTCA TAGGAACACC GCCCATGAAC TTCCTGCCAG CCAGGGTCGC TGAGAAGACG CTACGCCTCG GCAACCTCGC TATCCGCTCC GAGGAGCTAG AGGGGCTAGC CGAGGGCGAG GTAATCCTCG GCATACGCCC CCAGCACCTC GAGGTAGGCG AGGAGGGGCT CCCAGTGCGG GTCGTGGGCG TCGAGAGGCT CGGCACGGGC TCGATACTCC ACGGAGTCTT CGAAGGCTTC GAGGTCACCG CGTACTCCGA GAAAAGGGAG CACGCCCAGC CGGAGCTTAA CGCCGAGGTA CGCTTAAAGC CCGTGGGGCC CCTCTACCTC TTCGACTCGA GGAGCGAGGA GCTACTCAGG GTTGTGCGGA GCTACAGGGT CGAGTAG
|
Protein sequence | MSVELRNVTK RFGEVAAVYN VNLRVESGEF FVLLGPSGSG KSTLLRIIAG LEEPDEGEVI IGGRVVNDVD PSERDIAFVF QNYALYPHMT VYDNIAFPLR MRKVPKDQIH ARVLEVASML GLTNHLSKYP YQLSGGEQQR VALARAIVRK PRVFLLDEPL SNLDAKLRVK LRFELRKLLH DELKTTTIYV THDQVEAMTM ADRVGVINRG QLVQVGTPDE LFEKPSNTFV AGFIGTPPMN FLPARVAEKT LRLGNLAIRS EELEGLAEGE VILGIRPQHL EVGEEGLPVR VVGVERLGTG SILHGVFEGF EVTAYSEKRE HAQPELNAEV RLKPVGPLYL FDSRSEELLR VVRSYRVE
|
| |