Gene Tpen_0451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0451 
Symbol 
ID4601854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp409958 
End bp411334 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content62% 
IMG OID639773218 
Producthypothetical protein 
Protein accessionYP_919863 
Protein GI119719368 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGTGA ACAAGAAAGC CCTAGTCTTG GTAGCAGTTG CAGTGCTCAC AGCCCTGGCA 
GCCTACATCG CGCTTCTAGC AACGCCGCCC GCACAGCCAC CCGTGCAACC ACCTACACAG
ACGCCAGCCC AGGGGCCGGC GCCGGCCGAG AAGCCTAGGA ACGCCACCCT AACGCTCAAG
GTCTACGGCC CCGGCTCCCT GCTCGTCAAC GGCACGGGCT ACGTGAACGC CACGCTGACG
CTCAGGGCGC CCGCCGTCTT GGCTATCAAT GCCTCGCCCC AGCCAGGCTG GAGGCTGAAG
GCTTTACTCG TGAACGGGTC GCCCGTTTTA CCAGGCGTAG CGAGGGTATC GGGCGATACG
ACGGTTGAGG CGGTTTTCGC GTGGAGTGGC CCCGTGGTCA CGCTGAGGGT TTACGGCTCC
GGCTACCTGC TCGTCAACGG CTCGAGCTAC GGCAACGACA CGCTGTTGCT CAGGGGCGGG
GCACTGCTCT CGATCAACGC GTCCACGCCT AGAAGCTGGA GGCTGAAAGC ATTGCTGGTC
AACGGCTCCC CCACCTCGCC CGGCGATGTC AGAGTCTACG GCAACACGAC CATCGAGGCC
TTCTTCGAGC AGGTGAAGGT AAAGGTCAAG GTGGTGCCGG GCGAGCACCC CGTAACAATC
AACGGTTCCT GGGTGAACTC GACCACTGTG CTGGAGGTCC CGGCCTACTC TGTCCTCGTG
CTGGGCCCTG CGAGTGTGGA GCTCAACGAG ACCTGTGAGG CTGTGCACTA CTGGAACGCC
AGTGTGGCTG GGCGCTGGAC GCTCCTACAC GGCGACGCCT CGCTCGAAGT CGCGAACGAC
ACGGTACTGG TGGCGGGCTG GAGCCTCAAG TGCCACCCAC CACGCTCGAC GCTCGGAGGA
GTACTCTACG CCGGCAGAGA AGTCAAGGCC AGGATGGTGC TCACAGTGAA GGAGGCCCAG
TCGGGGTCCT GGAGGTACAA GGGCAACGGA GTCTGGGAAA TAGAGGCACC CGGCTTCCTC
ATAGTACTCC TAGAAACCCC GAAGAACTGG AGCAAAGTAG TAGTGAAGGG CAAGCCGCTC
GCGCGTAGCG GCATCATAGA GATCCTCGTG ATAGTCGAGA ACGGCCCCTC GATGTACCGC
TCGAAGGGCG CCGGCCTAAT ATTCGAGGAC ATATCCTACT TCGAGTTCGT ACTGCCGAGG
TGCATAATGC AGGGAACGTG CGACGCAACG GTAAACGCCT ACGGAAGCTT CGTGAACGAG
GGCTACCGCG AGCACTGGGG CCCGAGGCTG GAGCCACCAC CGGTAGAGCC CGGCTGGCTC
GAAATACAGG TCTACCCCGG GACGCACGTA GAGATACAGG TATTCGTCGA GCCCTAA
 
Protein sequence
MTVNKKALVL VAVAVLTALA AYIALLATPP AQPPVQPPTQ TPAQGPAPAE KPRNATLTLK 
VYGPGSLLVN GTGYVNATLT LRAPAVLAIN ASPQPGWRLK ALLVNGSPVL PGVARVSGDT
TVEAVFAWSG PVVTLRVYGS GYLLVNGSSY GNDTLLLRGG ALLSINASTP RSWRLKALLV
NGSPTSPGDV RVYGNTTIEA FFEQVKVKVK VVPGEHPVTI NGSWVNSTTV LEVPAYSVLV
LGPASVELNE TCEAVHYWNA SVAGRWTLLH GDASLEVAND TVLVAGWSLK CHPPRSTLGG
VLYAGREVKA RMVLTVKEAQ SGSWRYKGNG VWEIEAPGFL IVLLETPKNW SKVVVKGKPL
ARSGIIEILV IVENGPSMYR SKGAGLIFED ISYFEFVLPR CIMQGTCDAT VNAYGSFVNE
GYREHWGPRL EPPPVEPGWL EIQVYPGTHV EIQVFVEP