Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0119 |
Symbol | |
ID | 4600921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 93423 |
End bp | 94901 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639772873 |
Product | carbamoyl-phosphate synthase L chain, ATP-binding |
Protein accession | YP_919532 |
Protein GI | 119719037 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.334605 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGAGA TAAGAAAGCT CCTGGTAGCC AATAGAGGTG AGATAGCCGT CAGGATCTTC AGGACAGCGA GGGATCTAGG CATAAAGACC GTGGCTGTTT ACAGCGATGC GGACAAATTG TCTCTGCACA GGCTTCTCGC GGACGAGAGC TACTACCTCG GCCCACCCGA GCCCGCGAAG AGCTACTTAA ACGCGGAGAG GATAGTTAAG ATAGCGGTAA GCGCCGGTGC AGACGCGGTT CACCCGGGTT ACGGTTTCCT CTCGCAGAAC CCCTCCTTCG CCCGCATGGT GATAGAGGAG GGACTTATAT GGGTCGGGCC GAAACCGGAG ACAATGAAGC TCGTCGGGGA CAAGCTCGGG GCCAGGAAGT TCTTCTCGAG TAAAGGCATA CCGGTCGTGC CCGGGGCCTT CGAAGCCGTA GACTTCAATA GCGCTCTGAG CATTGCGGAG GAGATAGGCT TCCCGGTTAT CGTTAAGCCT GCGGGCGGCG GGGGAGGGAT TGGGATGTTC GTCGCCCACA CACCCGAAGA CCTGGAGAAA AACCTGGAGA AAGCTAGGCA ACTAGCAGGC TCGGCGTTCG CGAGGACAGA GGTGTACGTC GAGAAGTACT TCCCAAGGGC GAAGCACATA GAGGTGCAGA TCCTGGGCGA TAAGCGGGGA AGGGTGGTAC ACCTGTTCGA AAGGGAGTGT AGCGTCCAGA GGAGATACCA GAAAGTGGTG GAAGAAGCTC CCTCTCCCTC GCTGACCCAG GAGGAGAGAG AGAAGCTGCT GAGCGCCGCG GTGAAGGCGG CTGAGGCGTG CGGGTACGAG AATGCCGGTA CCTTTGAGTT CCTCTTCGAC GTTGAGAGCA GAAACTTCTA CTTCCTAGAG GTAAACTCGC GCATTCAAGT CGAGCACCCC GTCACGGAGC TCGTAACGGG GCTAGACATC GTCAAGCTAC AGCTGACGGT CGCCGAGGGA GGCGAGATAC CGTTTAGACA AGGCGAAGTG CAGCTTAGGG GGCACGCGAT AGAGGCTAGG GTGTACGCCG AAGATCCCTC TTCGGGCTTC ATACCCTCCC CAGGAAGGAT CACCTACCTT CGGGAACCCG CGGGGCCATG GGTGCGTGTA GATTCGGGAG TCTACGAGGG GTTCGAGGTT CCCCCGTTCT ACGACCCCCT GCTAATGAAA GTGGTCTCCT GGGGTCAGGA CAGGGAGGAG GCCAGGACGC GCCTCCTAAG GGCTTTAAAC GAGCTGAGAA TATCCGGTGT CAAACACAAT AAGTACCTCA TAGTACGGGT ACTTGAAGAT GCTCGTTTCC GCGATGCATC CTACACAACC CGCCTCTTGG AGGACCCGGC GTTTTACGAG AACCTCTTAA GCGAGGATCC GGCGCCCGAC GCTGAGCGGC GTATTGTGCA GGCTCAAGAA GGCGGAGAAA AAGGGGAGCG TACAAGGATT AACGCTTGGA GAGTACTGGC GCGTATACAT CCAGGCTAG
|
Protein sequence | MGEIRKLLVA NRGEIAVRIF RTARDLGIKT VAVYSDADKL SLHRLLADES YYLGPPEPAK SYLNAERIVK IAVSAGADAV HPGYGFLSQN PSFARMVIEE GLIWVGPKPE TMKLVGDKLG ARKFFSSKGI PVVPGAFEAV DFNSALSIAE EIGFPVIVKP AGGGGGIGMF VAHTPEDLEK NLEKARQLAG SAFARTEVYV EKYFPRAKHI EVQILGDKRG RVVHLFEREC SVQRRYQKVV EEAPSPSLTQ EEREKLLSAA VKAAEACGYE NAGTFEFLFD VESRNFYFLE VNSRIQVEHP VTELVTGLDI VKLQLTVAEG GEIPFRQGEV QLRGHAIEAR VYAEDPSSGF IPSPGRITYL REPAGPWVRV DSGVYEGFEV PPFYDPLLMK VVSWGQDREE ARTRLLRALN ELRISGVKHN KYLIVRVLED ARFRDASYTT RLLEDPAFYE NLLSEDPAPD AERRIVQAQE GGEKGERTRI NAWRVLARIH PG
|
| |