Gene Tpen_1105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1105 
Symbol 
ID4601099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1042466 
End bp1044091 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content65% 
IMG OID639773882 
ProductABC transporter related 
Protein accessionYP_920507 
Protein GI119720012 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR01166] cobalt transport protein ATP-binding subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGTAG TAGAAATCGA GGGGCTCAAG TGGAGGTACA GGGGGTCGCC GCACTACGCC 
TTGAACGGCG TAAACCTCGG CGTCGAGAAG GGCGAGTTCC TCGCGATCAC AGGGCTAAGC
GGGGCGGGCA AGACGACTCT CGTCCTCTCG ATCCTCGGCA TAATACCCCA GAGGCTCCCC
GGCGAGTTCA GCGGCAAGGT GAAGGTCCTC GGCCTCTCCA CGCTCTCGAC GGACGTGACC
ATCATCGCGC AGAGAGTAGG CGTGGTGTTC GAGGACCCGG AGATACAGTT CGTCATGGGC
ACGGTGGAGG ACGAGGTCGC TCTCTCCCTC GAAGCCATGG GGCTACCGCC GGAGGAAGTC
AGGGAGAGGA CGCTGTGGGC GCTGGAGCTC GTCGGGCTCG GCGCCGGCTT CCTCCAAAGG
AACCCCTCGC AGCTCTCGGG GGGCGAGAAA CAGCGGGTGG CGATAGCGTC CGCGGTCGCG
AAGGAGCCGG AGCTACTCAT ACTCGACGAA CCAACCTCGG ACCTGGACCC CGCCGGCAAG
GAGGAAGTCG TGTCGGCGAT CGAGAGCCTG CGCAGGCAGC TCGACGTCAC CATAGTAATG
GTTGAACAGG AGCCCGACAT CATATACAGG TTCGCGGACA GAGTGGTCGT GCTGGAGAAG
GGCAGGGTCG CGCTCGAAGG CACCCCGCGC GAACTCTACC ACAGAATGGA GGAGCTGAGG
AGGCTGTCCC TGCGCCCTCC GGAGCTCTAC GAGCTGTGCA GGGCCGCCGG CCTACGGGAA
CCGTCCCTGG AGGAGCTCGT AAGGCTCGCG GAGAAAGGGT TGCTCGACGG CTCCGTGTGC
GGTGAGCCGC GCGGGCGGCG AGGAGGGCTC GAGGAAGTGG TGAGGGTTCA AGGCGTTACC
CACGTCTACC CTGGAGGGAT CAGGGCGCTG GACAACGTGA CGCTCACCCT GTACTCGGGG
GAGCTCGTAG CGCTGATGGG CCCCAACGGG AGCGGGAAGA CCACGCTGGC AAAGGTGATC
GCGGGGCTCG TGCGCCCGAC GAGCGGGAGA GTCCTCGTAA GGGGGCGGGA CGTCTCCAGC
TACGGCAGGC TGGAGCTCTC GTCGATAGTG GGCTACGTCT ACCAGAACCC CCAGCACCAG
CTCTTCTGCC AGTCTGTCTA CGAGGAGGTA GCGTTCGGCC TAAGGTTGCG CGGAGCAGGA
GAGGGCGAGG TGAGGAAGGC GGTCGACGAG GCGCTGAGGC TCTTCAACCT GGAAGGCAAG
GCAGAGGAGC ACCCGTTCTT CCTGAGCAAG GGAGAGAAGA GGAGGCTCGC GCTTGCGAGT
GTATACGCGC TGAACCCTTC TGTGCTGATA GTGGACGAGC CCACCACGGG GCAGGACAGA
GCGTTCTCGG AGTACCTCTT CTCGACGCTG AGGAGGCTCG CGGAAGAGGG GAAAGCCGTA
GTCGCGATAA CGCACAGCGT CGACCTGGCC TCGGCGTACG CCGACAGGGT GGTCGTAATG
TGTGGGGGCA GAATCGTAGC TGACGGCGAG CCGGACAGCG TGCTGGCAGA CCCCGGTGTA
GCGGAGAAGG CGAGGATCAA GCGCCCACTG AGGTACGTCC TCTGCAGGCA GAGCCGGCAC
GGCTAG
 
Protein sequence
MRVVEIEGLK WRYRGSPHYA LNGVNLGVEK GEFLAITGLS GAGKTTLVLS ILGIIPQRLP 
GEFSGKVKVL GLSTLSTDVT IIAQRVGVVF EDPEIQFVMG TVEDEVALSL EAMGLPPEEV
RERTLWALEL VGLGAGFLQR NPSQLSGGEK QRVAIASAVA KEPELLILDE PTSDLDPAGK
EEVVSAIESL RRQLDVTIVM VEQEPDIIYR FADRVVVLEK GRVALEGTPR ELYHRMEELR
RLSLRPPELY ELCRAAGLRE PSLEELVRLA EKGLLDGSVC GEPRGRRGGL EEVVRVQGVT
HVYPGGIRAL DNVTLTLYSG ELVALMGPNG SGKTTLAKVI AGLVRPTSGR VLVRGRDVSS
YGRLELSSIV GYVYQNPQHQ LFCQSVYEEV AFGLRLRGAG EGEVRKAVDE ALRLFNLEGK
AEEHPFFLSK GEKRRLALAS VYALNPSVLI VDEPTTGQDR AFSEYLFSTL RRLAEEGKAV
VAITHSVDLA SAYADRVVVM CGGRIVADGE PDSVLADPGV AEKARIKRPL RYVLCRQSRH
G