Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1451 |
Symbol | |
ID | 4601160 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1401556 |
End bp | 1402890 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639774226 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_920851 |
Protein GI | 119720356 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3833] ABC-type maltose transport systems, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0294771 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGCGC AGCAGAAGGC TAGGAGGAGC GACTTCTTCA AAAGCGCGGT GCTGACGCTG CTCGCGCTCG TCGTTATGGG CGTACTGCTC TTCCCCGTTT ACTACATGTT CATGGTCTCG CTTAAACCCG TTGGAACCCT AGCCACTACG AGCCTCGAGG TAATTCCGAG CAAGGTTACG CTCGACAACT ACCTAGAGAT ACTCGTGGGT CACTACGAGG CGACGCTGGA CGTGAAGAGC TTCGCGCTCC GCGCCCAGAA CGCCACGATA TCGGATGCCC TGAACCGCTA CGAGGTCGAC CTGTATGACG GAGTCGTGGC CGGCGACTAC CCCGTGAAGT TTACTCTAAG CAACGCTAAG ATTCTGGAGA GGAGGGGAGG ACAGGAGAGA GGTGAGAGGG ACGCCACGAT AATAGTCGGC GGGGACTACC TGAAGGTGGG CGCGGACTCC GCGGAGACGA TAACGGCGGC TAGAAGGCTG ACGGTTGAAG CGAGGAAAAT CGTCGTGAAG GTCAGTGGTC CCGGCGGAGC GCCCCTCGAC CTCTCGAAGT TCAGAGAGGT GGCTCCGGGC GTCTACGAGG CGGAGAACGC CAGGCTGTAC CTGGAGGACG GGGGGAGGAT CTCGGCGGAG AAGTGCACCG TTGAGACTAC CAGTTTCAGC TACATAAGGC TGGCGAAGGT CGGGGGAGAG ATATGGGGCT ACATGAGTAG GAGCCTGATA ATCGCAAGCC TAACGGTGGT TCTAACCCTG CTCTTCGTGG TGCCGTCCGC CTACGCGTTT TCGAGGCTCA AGTTCTTCGG GAAGGGGCAC ATACTCTACT CTTACCTCAT GTTCACGCAG GTAGCGGGAG GACTGGGGAT AGCCGGGCTC GTAGCCCTGT ACGGCATGCT CGTTAGGCTC AACCTGGTGA ACAACATCTT CGTGCTACCG GTGATCTACG CGGCGGGGAG CGTCCCGTTC AACACGTGGC TCCTCAAGGG GTACCTGGAC TCCATAAGCC CGGATTTCGA CGAAGCCGCC CTCGTAGACG GGGCGAGCTA CGCGCAGATC ATAGGGCAGG TCCTAGTGCC GATGGCGCTA CCGGGTATAG CGACGGTCGC AATCTTCTCC TTCATCGGGG GGTGGACGGA GCTCATACTT GCGAACCTGC TGCTCAACCA GGAGAACCAC CCCTTGACCG TCTACATCTA CGTGTTGCTC ACCAACCTCA GGAACGTGTC CTGGAACCAG TTCGCGGCCG CCGCTCTGAT CTTCGCCCTC CCCGTCGTAG TGATGTTCCT GCTGGCCCAG AACTACGTCA GAAGCGGGTT GACGATGGGA GGACTAAAAG AGTAA
|
Protein sequence | MRAQQKARRS DFFKSAVLTL LALVVMGVLL FPVYYMFMVS LKPVGTLATT SLEVIPSKVT LDNYLEILVG HYEATLDVKS FALRAQNATI SDALNRYEVD LYDGVVAGDY PVKFTLSNAK ILERRGGQER GERDATIIVG GDYLKVGADS AETITAARRL TVEARKIVVK VSGPGGAPLD LSKFREVAPG VYEAENARLY LEDGGRISAE KCTVETTSFS YIRLAKVGGE IWGYMSRSLI IASLTVVLTL LFVVPSAYAF SRLKFFGKGH ILYSYLMFTQ VAGGLGIAGL VALYGMLVRL NLVNNIFVLP VIYAAGSVPF NTWLLKGYLD SISPDFDEAA LVDGASYAQI IGQVLVPMAL PGIATVAIFS FIGGWTELIL ANLLLNQENH PLTVYIYVLL TNLRNVSWNQ FAAAALIFAL PVVVMFLLAQ NYVRSGLTMG GLKE
|
| |