Gene Tpen_1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1334 
Symbol 
ID4601309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1287476 
End bp1289266 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content53% 
IMG OID639774109 
Productmembrane protein-like 
Protein accessionYP_920734 
Protein GI119720239 
COG category[S] Function unknown 
COG ID[COG1470] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.513509 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCTTA AGGCAGGCGG TAAAGAAAGA ACTCTCCTCC ATACACGTTA CGCTCTCCTA 
GCGCTGATGA TCATACTGGT TATGACTCCC CGTCTATTAA CCGCGCAAAG CACTAGCGTG
AATGTACATG GAGTAGTCGT TGACCCGAGG GGGGCGCCGA TAAGCGATGT ATCAATACTG
ATCTTCGGCG AGGATAACAC TCTTGTCGCC AGAGTTAAAA CCAGCATGAC GGGTGACTTT
TGGGCTTTAC TGGCTCCAGG CACGTACAAA GCTTCCCTAA TTAAAGTGGG CTACGAGGCT
AAAACCATTT CCTTTAGCAT TTCCGGCGAC AGGCTACACG TAGAGCTGGG AGAGATCACC
CTGGATTACA GCCTCTCAGT CTCCGTAGAG CTCAGAGATG TAAGAGCCAG CTGTCTTTCC
ACGCTCCGCA TACCGGTGGT TTTGGCGGAG AAAGGTTCGA GAGAAGAAAC CGTCACCCTC
TCGGCGAGCG CACCTTCCGG GTGGACCGCT GGCTTCTACC TGGGAGACCT CGAGGTGAAA
AGCATAGTCT TAAGCCCTGG GCAGACCTTG AAGCTTGACC TAGTGCTGAA AGTACCTTAC
AACGCCTCCG GTCGGTACAA CATAACGGTA GACGTTCTTG GATACACTCT TCAGAGGAGA
GTGATCACAG TCGACATCGA GCATAGAGAC CCCCAGCTCG TAACATCTAC CTACCTAAGC
GTAAAGGCGT CTCCCGGCTC AACGGTGAGC CTCGACGGTT TAGAGATAAC GAACAAGCTT
CCCGATAGGA CTTCAGGGGT AGTCTTCCTG CTACTACCGA GCCGGTGGTC AGGGAGTATT
CTCGACTCAA CTAGCGGCAA TAACCTCTAC AAGCTCTCCT TGAACCCCGG AGAAAGCGTG
AAAGCTAAAG TCGTCCTGCA GATCCCCGAT ACCGCGGCAC CGGGTAACTA TACAGTAGGC
GTTGTGTTCA GAGGCGTGGA CCCGTACTTC GAGTCGAAGC TCTGGCTCAA CGTTACCGTC
GTAAAGGGAA AGCCCCTCGC GAAGCTAGAA ACCGAGACTC CTTTCCTCAA CACCTACGCC
GGCCGTAGTG CAAGCTTTCT CGTCACTACT AGGAATATCG GCGAAGGAGA CGGAGTCGTA
GAGCTAGTCG TGAAAGGACT GCCGCCCGGC TACAGCTGGC GTATAGAAGA CCTTAACGGG
AATGTTATCT CCAAGCTGTA CCTGAAGGCG GGAGAAAGCA GGCAATTAAA AGTTGTCGTA
AGCGTACCTC TCCTCGCCGA GCCCTCGGTC ATTCCTTTCA TACTCGAAGC CAACACGAAC
TATTCTCGTG TAAGCCTACC TCTCAACCTG GGAGTTATGG GAAGCTACAG CCTCGAGTAT
ACAACCCAGA ACTTCTACTT GGAGGTGACA TCGGGCTCTT CGGCCACCTT CCAATTAGGG
GTCAAAAACA CCGGCTACAG CTCATTAACA AACGTAAGAA TAGAGGTGTC TAACGTGCCG
AGGGGGCTCC GCGTAAACAT CTCTCCAGAA GTCGTTCTAT CGCTTAAAGC CCAGGAAAAC
GCCAACTTCA CGGTAACAGT TTACGCAGAG CCGGACGTAA GCGCTGGAGA CTACTACATA
ACCTTGAAGC CGCTGGCCGA CCAGCTCAGC GAGGATCAAT CCCTAGTTGC AACGAGGCAA
CTCCACGTAT ACGTAAAGAC GGGGGCCGGC GCGGTATACA TAGGGCTGGG AGCTCTACTC
GCACTAGTAG TGCTTTTAGT CGCGGTCTAC AGAAAGTTCG GCAGGAGGTG A
 
Protein sequence
MTLKAGGKER TLLHTRYALL ALMIILVMTP RLLTAQSTSV NVHGVVVDPR GAPISDVSIL 
IFGEDNTLVA RVKTSMTGDF WALLAPGTYK ASLIKVGYEA KTISFSISGD RLHVELGEIT
LDYSLSVSVE LRDVRASCLS TLRIPVVLAE KGSREETVTL SASAPSGWTA GFYLGDLEVK
SIVLSPGQTL KLDLVLKVPY NASGRYNITV DVLGYTLQRR VITVDIEHRD PQLVTSTYLS
VKASPGSTVS LDGLEITNKL PDRTSGVVFL LLPSRWSGSI LDSTSGNNLY KLSLNPGESV
KAKVVLQIPD TAAPGNYTVG VVFRGVDPYF ESKLWLNVTV VKGKPLAKLE TETPFLNTYA
GRSASFLVTT RNIGEGDGVV ELVVKGLPPG YSWRIEDLNG NVISKLYLKA GESRQLKVVV
SVPLLAEPSV IPFILEANTN YSRVSLPLNL GVMGSYSLEY TTQNFYLEVT SGSSATFQLG
VKNTGYSSLT NVRIEVSNVP RGLRVNISPE VVLSLKAQEN ANFTVTVYAE PDVSAGDYYI
TLKPLADQLS EDQSLVATRQ LHVYVKTGAG AVYIGLGALL ALVVLLVAVY RKFGRR