Gene Tpen_0539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0539 
Symbol 
ID4601053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp489640 
End bp490974 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content56% 
IMG OID639773310 
Product4-aminobutyrate aminotransferase 
Protein accessionYP_919948 
Protein GI119719453 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID[TIGR00700] 4-aminobutyrate aminotransferase, prokaryotic type
[TIGR00707] acetylornithine and succinylornithine aminotransferases 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGGAC CGAAAATAAT CGTTGAGCCT CCGGGACCTA ACTCTAGGAA GATCGCCGAA 
AAGGACTCCG CACTGTTGAT GCAGAGCTTC GCGCGCTGGT ACCCCTTGGT CGCCAAGCGC
GCACACGGCG TGTGGGTCGA AGACGTCGAC GGAAACGTCT ACCTGGACTT CAACTCCGGG
ATAGGCGTGA CGAACACGGG GCACTGCCAC CCTAAAGTCG TCAAAGCGAT AAAAGAACAA
GCAGAGAGGC TACTTCACTA CTCTTTGACG GACTTCCTCT ACGAGGAGCC CGTCAAACTC
GCCGAGAAGC TTGTATCGAT AACTCCGGGA CGCTTCCCGA AGAAGGTGTT CTACACGAAC
AGCGGAACGG AGTCCATAGA GGCAGCAATA AAGACTGCAA GAGGGCATTT CAGGGGCACG
CGGCCCTACA TAATCGCATT TGCCGGCTCG TTCCACGGGC GAACGTACGG GTCCCTCTCC
CTCACGAGTA GCAAGCCAGT ACAGAGAAGA CACCTAGGCC CGCTACTACC CGGCGTGTTC
CACGCACCCT ATCCATACTG TTACAGGTGC CCCTTCAGGC AGAAGTACCC TGAATGCAAC
CTTTGGTGCG TCGACTTCAT CGAAGAGTGG ATGCTCAAGA AGTACGTACC CCCAGAGGAG
GTCGCCGCAT TCGTCGTAGA GCCGATAGCG GGGGAAGGCG GCTACATAGT ACCGCCGCCC
GAGTTCTTTA AGAGGCTACG CGAGCTAGCG GACAAGTACG GAATACTCCT GGTGGTAGAC
GAGGTCCAGA GCGGATTCGG GAGAACCGGG AAGTGGTTCG CGATAGAGCA CTTCGGAGTG
GAACCCGACA TAATAGCCGT AGCCAAGGGG ATAGCTTCGG GGCTCCCGCT GGGCGCGATA
ATAGGTAGGG CGGAAGTCAT GGACCTACCT CCGGGCTCCC ACGCCTCCAC CTTCGGAGGA
AACCCCGTCA GCTGCGCCGC AGCTCTCGCA ACGATCGAGG TAATAGAGGA GGAAAAACTC
CTGGACAACG CGACGAGAGT AGGTGAATAC GCGATGAAGA GGCTACGCGA GCTACAGGAG
GAAATACCCT ATATAGGAGA CGTGCGTGGG AAAGGGCTCA TGATAGGCGT AGAGCTCATC
GCGAGAGACG GTTCCCCGAA CCCAAAGCTC CTGCAGAAAA CGCTCGAGAT AGCTTTCAAG
AAAGGCCTGC TCGTGATAGG AGCCGGGGTG AGCACTATCC GGATAGCCCC GCCCCTGATA
ATAACCCAGC AAGAAATGGA GACCGGGCTA CGCATACTAG AAGAATCCTT GAGGGAAGCT
TTAAAAGAGG TCTAA
 
Protein sequence
MNGPKIIVEP PGPNSRKIAE KDSALLMQSF ARWYPLVAKR AHGVWVEDVD GNVYLDFNSG 
IGVTNTGHCH PKVVKAIKEQ AERLLHYSLT DFLYEEPVKL AEKLVSITPG RFPKKVFYTN
SGTESIEAAI KTARGHFRGT RPYIIAFAGS FHGRTYGSLS LTSSKPVQRR HLGPLLPGVF
HAPYPYCYRC PFRQKYPECN LWCVDFIEEW MLKKYVPPEE VAAFVVEPIA GEGGYIVPPP
EFFKRLRELA DKYGILLVVD EVQSGFGRTG KWFAIEHFGV EPDIIAVAKG IASGLPLGAI
IGRAEVMDLP PGSHASTFGG NPVSCAAALA TIEVIEEEKL LDNATRVGEY AMKRLRELQE
EIPYIGDVRG KGLMIGVELI ARDGSPNPKL LQKTLEIAFK KGLLVIGAGV STIRIAPPLI
ITQQEMETGL RILEESLREA LKEV