Gene Tpen_1094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1094 
Symbol 
ID4600961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1031494 
End bp1033323 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content64% 
IMG OID639773871 
Productglutamine amidotransferase, class-II 
Protein accessionYP_920496 
Protein GI119720001 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0449] Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains 
TIGRFAM ID[TIGR01135] glucosamine--fructose-6-phosphate aminotransferase (isomerizing) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.162573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGGCA TATTCGGGGC CGTGTCTAGG AGCGGCGGGA ACGTCGTCCC GCTCGTCGTG 
ACAGGCCTCG AAAGGCTGAA GTACAGGGGC ACCGACAACT CGGGAATAGC CGTAGCCCGA
GAAGGGCGCC TAGAGGTCTA CAAGGACACA GGGCCCATAG ACGTGGTAGC GCGCAAGCTA
GGCCTCGACA AGCTCCAGGG TAGCGTAGCC CTCGGGCACA CGCGCTACGC CACTCACGGT
AGGCCTACAG CCGAGAACGC CCACCCCCAC GTCGACTGCG GGAGACGCTT AGCGGTAGTC
GGCGACGGCT CTATCTCGAA CTACGAGGAG CTGAAGGACA AGGTGTTACT CAACGGGCAC
AGGCTAACGT CGAGGAGCGA CTTCGAAGTC GTGGCCCACG TCCTGGAGGA AGCGTTCAGG
GAGGGCCGCG CCCCCGAGGC TTTACCGGGA GTGCTCTCCG AGAAGCTGCA GGGCTTCTTC
GCCGTAGCCT TCCTGGACGC ATCCACGGGT AGCATCTACG CCGCGACGAC GGGGCCGCAA
CTCTTCCTCG GCGCGTCCCG GGAACTCTTC CTCGTATCTA CGAGCAAGTA CGCTATGCAC
GGCTTCGCGG AGCGCTACAG GGAGGTGAGG AGGGGCGAGG TAGTCCGGGT CTCCAGCGAG
GGAGTCGAGG TGTACACGGG CGCGGGGGTC GGGGGGCTCG GGGAACTACA ACCCTTGCAT
CTCGACCCCT CCCTTGTGGA GAAGAACGGC TACAAGCACC ACATGCTCCG GGAGATATAC
GAGGTGCCCG AGTCCCTTAT GAGGACCCTG AGCTCCGTGC AGAAGAAGTA CCTCCAGCTC
GCGGCAAGGC TCGTTACAGG CGCGGACAAC GTCTACATAA TCGCCAACGG TACGAGCCTG
CATGCGGGGA TGGTCGCTTC CTACTACTTC TCGGAGCTCG TAGGGGTAAA CCCCGTAGTG
GTCAGCGCGG CGGAGTTCCC GCTCTACTAC CTCGAAAACA TAGGCCCCGG CAGCCTGGTC
CTAGCGATAT CCCAGTCCGG CGAGACGGGG GACGTGCTCT CCTCCCTCTA CGAGGCGAAG
CTCCGCGGGG CGACGATACT GGGCATAACG AACTACGTGG GCTCGCGGCT CGCCCGCCTC
TCAAACCTCT ACCTCCCGAT AGCGGCCGGC CCCGAGCTAG CGGTTCCGGC GACGAAGACC
TTTACCTCTA CTCTGCTACT GCTGTACCTG GTGGCCCTCA GGGCGTCCAG GCAGGAGGGC
AGGATAGACG AGGACACGCT TAACTCGAAG CTAGCCGCGG TAGCCGAGGC GGCGCGGCAA
CTCGGGGAGT GGCTACCCAA GGTTGACTCG GAGGCCTCCA AGGCGGCAAA CGAGGTATCC
GAGTGCAGGG GAGGCTACGT GGTGTCGAGG GGGCTTACCT ACCCGCTCGC GCTCGAAGGG
GCCTTGAAGC TGAAAGAAGC CTCCTACTTC CACGCCGAGG GGGTAGAGGC GGGCGAGTTC
AAGCACGGTC CATTCGTACT CGTCGAGAAG GGCTTCGGCG TGGTATTCGT CGTACCTGTC
GAGAAAGTCT CCGCCGAGGC TACCTACCCG CTAGTCGGGA TGGCTCTGGA AGCCGGCGCC
AAGGTCGTAG CGGTAGGCTT CGCCGGAGAC CAGAGCCTGG AAGCCCTCTC GGAGAAAGGC
GCCGCCGTCG TAGCCGCGCC GCCCGCGGAG AGGCACCTCG CGCCGATAGT CCTGGCGGTT
CCCCTGCAGC TACTGGCCTA CAGGCTGGGC GAGAGGCTTT CAAGACCAAT CGATTCCCCG
CGCTACCTGA CGAAAGCCGT TACCCAGTGA
 
Protein sequence
MGGIFGAVSR SGGNVVPLVV TGLERLKYRG TDNSGIAVAR EGRLEVYKDT GPIDVVARKL 
GLDKLQGSVA LGHTRYATHG RPTAENAHPH VDCGRRLAVV GDGSISNYEE LKDKVLLNGH
RLTSRSDFEV VAHVLEEAFR EGRAPEALPG VLSEKLQGFF AVAFLDASTG SIYAATTGPQ
LFLGASRELF LVSTSKYAMH GFAERYREVR RGEVVRVSSE GVEVYTGAGV GGLGELQPLH
LDPSLVEKNG YKHHMLREIY EVPESLMRTL SSVQKKYLQL AARLVTGADN VYIIANGTSL
HAGMVASYYF SELVGVNPVV VSAAEFPLYY LENIGPGSLV LAISQSGETG DVLSSLYEAK
LRGATILGIT NYVGSRLARL SNLYLPIAAG PELAVPATKT FTSTLLLLYL VALRASRQEG
RIDEDTLNSK LAAVAEAARQ LGEWLPKVDS EASKAANEVS ECRGGYVVSR GLTYPLALEG
ALKLKEASYF HAEGVEAGEF KHGPFVLVEK GFGVVFVVPV EKVSAEATYP LVGMALEAGA
KVVAVGFAGD QSLEALSEKG AAVVAAPPAE RHLAPIVLAV PLQLLAYRLG ERLSRPIDSP
RYLTKAVTQ