Gene Tpen_0204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0204 
Symbol 
ID4602215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp182677 
End bp183984 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content54% 
IMG OID639772958 
Productamine oxidase 
Protein accessionYP_919617 
Protein GI119719122 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.420312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGAGAG AGAGCATAGT AGTGCTAGGC GCTGGTCTCG CGGGGCTGTC GTTCGCCTAC 
GAGGCTACCA GGCTTGGTCA CTCCGTCACA CTGGTAGAAA AAGAGGCAGA GGCAGGGGGG
CTTCTGAGGA GCGAGGAGGT AAATGGCTAC GTGTTCGACA CGGGAGGCTC CCACGTGATT
TTCTCAAAGT ATCCCGAGAG AGTGAAGTTT TTGACGAGGA TTTTGAACGG GAATATTGTT
AGGAACAGGC GGGATGCACG CATATTCTAC AACGGTACGC TGGTGAAGTA CCCATTTGAG
AACGGTCTCT ACGCTCTCCC GGCAGAGAAG AGGTATAAAG CCTTGAAGGA CATTTTAGAA
AGGTACATAG AGTACAGGTG CGGGGGTTCG CGGAGCGTTG AAAACTTCGA GGAATGGCTT
TACGCAACGT TTGGGGAAAC CATAGCTAGC GAGTACTTGG TGCCGTACAA CAGGAAGCTT
TGGAAGAGGG ATTTGAAGCA AATATCCCTG GACTGGGTTG GCAACAGGGT TCCTCAGCCT
CCGCTGGACG ACGTGATTAG AAGCGCCGTT GGTATACCGA CGGAGGGCTA CTTGCACCAG
CTCTTCTTCT ACTACCCTCG AAGTGGCGGT ATTCAATCAC TCGCAGATGC CTTCAAAGCA
CGCGTGGAGG GATCGCCTCT ATCCTCTTTA GTGCTCGGAA AGGAGGCCGT AAAGGTGGAT
CCCTACGAGG GGGAGGTCGT GTTGTCGGAC GGCTCCGCGG TTAGGGGCGA TAGGATTGTC
TCCACGATCC CCTTGCCGGA GCTTTACAGG GCGCTAGGGC TACGCTTGAA CCTAGACTAC
AACTCCCTGG TGGTCGTAGG TGTAGGCGTT AAGGATGCCA GGCTACCCAG GGTGCACTGG
ATATACTTCC CGAACGAGGA TATCGTTTTC CACAGATTGG CGATCCTAAG CAACTACAGC
CCGTACATGT CGCCAAAGGG TTCTGCCACC TTAGTGGCGG AGATTACGCT TAGACCAGGA
GAGACCTTCG ACGAGGAAAA AGTCGTCAAC GCGACGCTGG ACGGATTAGA GGCGGCCGGC
TTGCTGAGAG GTAGAGGGAG CGTGGAGGTT GTACGCGCAT GGTACTGGAG ATATGCCTAC
ATAGTGTACG ATCACAATTA CTCGAGGAGA GTTCGCGAGG CCAAGGAGAG GCTCAGGAGG
CTAGGAATAT CGCTGATAGG CAGGTTCGGC CTGTGGGAGT ACATGAACAT GGACGACGTT
GTTTACCGCT CTATAAGCGA GGCCAGAAGG ATGGGCAGAG CGCGGTGA
 
Protein sequence
MRRESIVVLG AGLAGLSFAY EATRLGHSVT LVEKEAEAGG LLRSEEVNGY VFDTGGSHVI 
FSKYPERVKF LTRILNGNIV RNRRDARIFY NGTLVKYPFE NGLYALPAEK RYKALKDILE
RYIEYRCGGS RSVENFEEWL YATFGETIAS EYLVPYNRKL WKRDLKQISL DWVGNRVPQP
PLDDVIRSAV GIPTEGYLHQ LFFYYPRSGG IQSLADAFKA RVEGSPLSSL VLGKEAVKVD
PYEGEVVLSD GSAVRGDRIV STIPLPELYR ALGLRLNLDY NSLVVVGVGV KDARLPRVHW
IYFPNEDIVF HRLAILSNYS PYMSPKGSAT LVAEITLRPG ETFDEEKVVN ATLDGLEAAG
LLRGRGSVEV VRAWYWRYAY IVYDHNYSRR VREAKERLRR LGISLIGRFG LWEYMNMDDV
VYRSISEARR MGRAR