Gene Tpen_0503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0503 
Symbol 
ID4601337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp458348 
End bp459553 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content53% 
IMG OID639773270 
Productbeta-lactamase domain-containing protein 
Protein accessionYP_919913 
Protein GI119719418 
COG category[C] Energy production and conversion 
COG ID[COG0426] Uncharacterized flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0527629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAGAG TTTTTGTAGA CAAGGTAGTA GACGATTTGT ACGTACTCCG AGTTGACGAC 
GACAGGACAA GGTTTTTCGA GGCGCTGTGG GAAATAGAAG AAGGCATTAC GTACAACGCG
TACCTCCTGT TAACCGGCGA GGGGGCAGTG GTCTTCGATG GGTGGAAGAA GTGGTTTTCT
GAGCTATTCT TGGAGAAAAT TAGGGAAATA GTGGACTATG GGGATATCAG GTATGTAGTT
GTTCACCACG CTGAGCCGGA TCACTCGGGC AGCGTGCCGG ACGTTCTGCG GGTATCCGAG
AAGGCAGTCG CACTGGGACA CCCTATCGCC GGGAGGATGC TGTCCTCTTT TTACGGGGTT
ACCAGGTTCA GGCCCGTGAA GGACGGAGAG GAGTTAAAGG TGGGGGAGCG TAGCATTCAC
TTCGTGTACA CCCCGTGGCT TCACTGGCCC GAAACAATAA TGAGTTACCT TAGAGATGAC
GGCGTACTTC TGAGCTGCGA CGCTTTCGGG TCGTACTCCG TTCCAGCTCT CTACGATGAG
AGCGTTGAGA ATTTCGAAAA ACTGGCATGG TTCATCAAGA AGTACTATGT AACGGTGATC
GGGCATTACT CTCCCCACGT ATTGAAGGCC CTGGAAAAGC TCGGCAGTCT AGGGGTAAAG
CCGCGCATCA TTGCACCATC TCATGGTACG ATCTTTAGGA GAAACGCCGA GAGAATACTG
GACGAGTACC GACGTATAGC GCTCGGGGAG CCGGTTAAGG GTAAAGCAGT GGTCGTATAC
GTGTCAATGT ATGGCTTCGT GGAACGTCTG GTAAGCCTGG CTATAGAAGA GCTTTCGCAG
AGAGGCTTTG AGACAAGGGT TTACGCCTTC ACGGATACCT CCAGGCCTAG AATCTCCGAT
ATCCTGGGAG AACTCGCAGA TGCTGAGTTG CTAGTACTCG GCGGAGCAAC CTACGAGGCC
GGGGTTCAAC CGGTACTGGA CTACGTGGTT CGGTTAATAG CGGAAAAGCT CGAATATAGG
TCCGACAAGC TTGCCGTTCT GCTACTCTCA AGCTACGGTT GGGCCGGGAC GGCGGGCAAG
CTGGTAGCAG AGGCGCTTAA GGGGCATGGC TTCCAACGCG TAAGTAGCGT CGAAGTGCAA
GGCTACGCCC CGGAGAACGT GAAGGAAAAT CTGAAAAAGG CTCTGGACAG TCTTCTGTCT
CTATGA
 
Protein sequence
MPRVFVDKVV DDLYVLRVDD DRTRFFEALW EIEEGITYNA YLLLTGEGAV VFDGWKKWFS 
ELFLEKIREI VDYGDIRYVV VHHAEPDHSG SVPDVLRVSE KAVALGHPIA GRMLSSFYGV
TRFRPVKDGE ELKVGERSIH FVYTPWLHWP ETIMSYLRDD GVLLSCDAFG SYSVPALYDE
SVENFEKLAW FIKKYYVTVI GHYSPHVLKA LEKLGSLGVK PRIIAPSHGT IFRRNAERIL
DEYRRIALGE PVKGKAVVVY VSMYGFVERL VSLAIEELSQ RGFETRVYAF TDTSRPRISD
ILGELADAEL LVLGGATYEA GVQPVLDYVV RLIAEKLEYR SDKLAVLLLS SYGWAGTAGK
LVAEALKGHG FQRVSSVEVQ GYAPENVKEN LKKALDSLLS L