Gene Tpen_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0119 
Symbol 
ID4600921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp93423 
End bp94901 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content58% 
IMG OID639772873 
Productcarbamoyl-phosphate synthase L chain, ATP-binding 
Protein accessionYP_919532 
Protein GI119719037 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.334605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAGA TAAGAAAGCT CCTGGTAGCC AATAGAGGTG AGATAGCCGT CAGGATCTTC 
AGGACAGCGA GGGATCTAGG CATAAAGACC GTGGCTGTTT ACAGCGATGC GGACAAATTG
TCTCTGCACA GGCTTCTCGC GGACGAGAGC TACTACCTCG GCCCACCCGA GCCCGCGAAG
AGCTACTTAA ACGCGGAGAG GATAGTTAAG ATAGCGGTAA GCGCCGGTGC AGACGCGGTT
CACCCGGGTT ACGGTTTCCT CTCGCAGAAC CCCTCCTTCG CCCGCATGGT GATAGAGGAG
GGACTTATAT GGGTCGGGCC GAAACCGGAG ACAATGAAGC TCGTCGGGGA CAAGCTCGGG
GCCAGGAAGT TCTTCTCGAG TAAAGGCATA CCGGTCGTGC CCGGGGCCTT CGAAGCCGTA
GACTTCAATA GCGCTCTGAG CATTGCGGAG GAGATAGGCT TCCCGGTTAT CGTTAAGCCT
GCGGGCGGCG GGGGAGGGAT TGGGATGTTC GTCGCCCACA CACCCGAAGA CCTGGAGAAA
AACCTGGAGA AAGCTAGGCA ACTAGCAGGC TCGGCGTTCG CGAGGACAGA GGTGTACGTC
GAGAAGTACT TCCCAAGGGC GAAGCACATA GAGGTGCAGA TCCTGGGCGA TAAGCGGGGA
AGGGTGGTAC ACCTGTTCGA AAGGGAGTGT AGCGTCCAGA GGAGATACCA GAAAGTGGTG
GAAGAAGCTC CCTCTCCCTC GCTGACCCAG GAGGAGAGAG AGAAGCTGCT GAGCGCCGCG
GTGAAGGCGG CTGAGGCGTG CGGGTACGAG AATGCCGGTA CCTTTGAGTT CCTCTTCGAC
GTTGAGAGCA GAAACTTCTA CTTCCTAGAG GTAAACTCGC GCATTCAAGT CGAGCACCCC
GTCACGGAGC TCGTAACGGG GCTAGACATC GTCAAGCTAC AGCTGACGGT CGCCGAGGGA
GGCGAGATAC CGTTTAGACA AGGCGAAGTG CAGCTTAGGG GGCACGCGAT AGAGGCTAGG
GTGTACGCCG AAGATCCCTC TTCGGGCTTC ATACCCTCCC CAGGAAGGAT CACCTACCTT
CGGGAACCCG CGGGGCCATG GGTGCGTGTA GATTCGGGAG TCTACGAGGG GTTCGAGGTT
CCCCCGTTCT ACGACCCCCT GCTAATGAAA GTGGTCTCCT GGGGTCAGGA CAGGGAGGAG
GCCAGGACGC GCCTCCTAAG GGCTTTAAAC GAGCTGAGAA TATCCGGTGT CAAACACAAT
AAGTACCTCA TAGTACGGGT ACTTGAAGAT GCTCGTTTCC GCGATGCATC CTACACAACC
CGCCTCTTGG AGGACCCGGC GTTTTACGAG AACCTCTTAA GCGAGGATCC GGCGCCCGAC
GCTGAGCGGC GTATTGTGCA GGCTCAAGAA GGCGGAGAAA AAGGGGAGCG TACAAGGATT
AACGCTTGGA GAGTACTGGC GCGTATACAT CCAGGCTAG
 
Protein sequence
MGEIRKLLVA NRGEIAVRIF RTARDLGIKT VAVYSDADKL SLHRLLADES YYLGPPEPAK 
SYLNAERIVK IAVSAGADAV HPGYGFLSQN PSFARMVIEE GLIWVGPKPE TMKLVGDKLG
ARKFFSSKGI PVVPGAFEAV DFNSALSIAE EIGFPVIVKP AGGGGGIGMF VAHTPEDLEK
NLEKARQLAG SAFARTEVYV EKYFPRAKHI EVQILGDKRG RVVHLFEREC SVQRRYQKVV
EEAPSPSLTQ EEREKLLSAA VKAAEACGYE NAGTFEFLFD VESRNFYFLE VNSRIQVEHP
VTELVTGLDI VKLQLTVAEG GEIPFRQGEV QLRGHAIEAR VYAEDPSSGF IPSPGRITYL
REPAGPWVRV DSGVYEGFEV PPFYDPLLMK VVSWGQDREE ARTRLLRALN ELRISGVKHN
KYLIVRVLED ARFRDASYTT RLLEDPAFYE NLLSEDPAPD AERRIVQAQE GGEKGERTRI
NAWRVLARIH PG