Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0937 |
Symbol | |
ID | 9155077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 958984 |
End bp | 962370 |
Gene Length | 3387 bp |
Protein Length | 1128 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | pyruvate carboxylase |
Protein accession | YP_003645909 |
Protein GI | 296138666 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.218136 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCGAGA AGGTCCTGGT GGCCAACCGG GGTGAGATCG CGATTCGCGG TTTTCGTGCC GCGGTGGAGT GCGGCGCGAA GACGGTGGCC GTCTACCCCT ATGAGGACCG CAACTCGCCG CATCGGCAGA AGGCGGATGA GGCGTATCAG ATCGGGGTGG AGGGGCATCC GGTGCGCGCG TATCTCAGCG TCGAGGAGAT CGTCCGGACC GCCAAGCGCT CGGGCGCCGA TGCCGTGTAT CCCGGTTATG GGTTCCTCTC GGAGAACCCT GAGCTGGCTG CTGCCTGTGC CGTCGAGGGG ATCACGTTCG TCGGGCCCCC GACGGAGGTG CTGGAGCTGA CCGGTAACAA GGCGAGCGCT ATAGCCGCGG CGCGAACGGC GGGGCTGCCG GTGTTGGACT CATCGGCGCC GTCGGCCGAT CCGGACACGC TGATCGCCGC CGCGCAGGGC ATGCAGTTCC CGGTGTTCGT CAAGGCGGTC GCCGGTGGTG GCGGGCGCGG TATGCGCCGG GTGAACACGA TCGAGGATCT GCGCGATGCG GTCGAGATCG CCTCGCGGGA GGCGGAATCG GCGTTCGGTG ATCCGACGGT GTTCCTCGAG GAGGCGGTGA TCGATCCGCG GCATATCGAG GTGCAGATCC TGGCCGATAC CGCGGGTAAC GTGATCCACC TGTTCGAGCG GGATTGTTCG TTGCAGCGGC GGCATCAGAA GATGATCGAG ATGGCGCCGG CGCCGAACAT CTCGGAGGCG TTGCGCGAGC GCCTGTGCGC CGATGCGGTG AACTTCGCCC GGTCGATCGG TTACAGCTGC GCGGGCACGG TCGAGTTCCT GGTCGATCCG CAGGGCCGGC ACGTGTTCAT CGAGATGAAT CCGCGGATCC AGGTGGAGCA CACGGTGACC GAGGAGGTGA CCGATGTGGA TCTGGTGGCC GCGCAGCTGC GCATCGCGGC TGGCGCGACA CTGGCGGAGC TGGGCCTGTC GCAGGATGCG ATCCGGCTCA ACGGCGCGGC GCTGCAATGC CGCATCACGA CCGAAGACCC GACCGACGGG TTCCGTCCGG ACACGGGCCG CGTCAGCGCC TACCGCACTC CGGGCGGCGC CGGTATCCGC CTGGACGGCG GTACGTCGGT GGGCGCGGAG ATCAGCCCGC ACTTCGATTC GATGCTGGTC AAGCTCACCA CTCGCGGTCG CGATCGGGAG ACGGCGATCG TGCGCGCACG CCGCGCCCTC GCCGAGTTCC GGATCCGCGG TGTAGCGACC AACATCCCCT ATCTCCAAGC GGTGCTGGCC GATCCGGATT TCGTCGCCGG TGTGGTGACC ACATCGTTCA TCGAGCAGCG GCCCGAGCTG CTGACCGCGC GTGGATCTTC GGACTCGGCC AGCAAGATCC TGCGCTACCT CGCCGATGTC ACGGTGAACA AGCCGCACGG TGATCGCCCG ACCAAGGTCT ACGCCGTCGA CAAACTTCCG GCCGTCGACC TCGCGGCGCC CGCACCGGAC GGTTCCAAGC AGCGGCTCTC CGCGCTCGGT CCGGCGGGGT TCGCGGCCGA TCTGCGAGCA CAAACGGCAC TCGCGGTTAC CGACACCACC TTCCGCGACG CGCACCAGTC GCTCCTCGCC ACCCGGGTGC GCACCTCCGC GCTGGCCGCC ATCGCCCCGT ACGAGGCAAG GCTGACCCCG CAGCTGCTCT CGGTGGAGGC ATGGGGCGGC GCCACCTACG ATGTGGCGCT GCGATTCCTG CACGAGGACC CCTGGGAGCG GCTCGCCACG CTCCGCGAGG AGATGCCGAA CATCTGCCTA CAGATGCTGC TGCGCGGGCG GAACACCGTG GGGTACACCC CGTACCCGCA GCAGGTGACC GACGCCTTCG TGGCGGAGGC AGCCGCGACC GGGGTGGACA TCTTCCGGAT CTTCGATGCA CTCAACGACG TCGAGCAGAT GCGGCCCGCG ATCGAAGCGG TCCTGGCCAC CGGCACGGCG GTCGCGGAGG GTGCCCTCTC GTACACCGGC GACCTGTCGA ACCCCGGCGA GACTCTGTAC ACGCTCGACT ACTACCTCGC CGTCGCCGAG CGGATCGTCA CCGCGGGCGC GCACATCCTG GCGATCAAGG ATATGGCCGG GCTGCTGCGC CCCGCCGCCG CAACCACGCT CGTGACGGCG CTGCGCAAGG AGTTCGACCT CCCCGTGCAC GTGCACACTC ACGACACCCC GGGCGGGCAG CTCGCCACCT ACCTGGCGGC GTGGACGGCC GGCGCCGATG CCGTCGACGG TGCCGCCGCG CCGCTGTCAG GTACCACCAG CCAGCCCTCG CTGTCGTCGA TCGTCGCCGC GACCGCGAAC ACCGAGCGCG ACACCGGGAT CGACCTCGAC GCGGTGTGCG CGCTGGAGCC CTACTGGGAG GCGGTACGCG GCGCGTACGC CCCGTTCGAA TCCGGTCTCC GGGCCCCCAC CGGACGGGTC TACCACCACG AAATCCCCGG CGGGCAACTG TCGAACCTAC GGCAGCAAGC CGGTGCGCTC GGCCTGGCGG AACGGTTCGA GGACATCGAG AACGCCTACG CCGCAGCCGA TCGCATGCTC GGCCGGCTGG TGAAAGTGAC ACCGTCATCG AAGGTCGTGG GTGATCTGGC GCTCGCGTTG GTGGGCACCG GAGTCTCCGC GGAGGAGTTC GCCGCCGACC CCGCCCGCTA CGACATCCCC GATTCGGTGA TCGGGTTCCT GCGCGGCGAG CTGGGCACCC CACCCGGAGG CTGGCCCGAG CCGCTGCGCA CCCGCGCCCT GCAGGGCCGC GCCGACGCCG CACCTGTCAG CGACGTCCCC GCCGACGAGG CCGCCGAGCT GACCGGAACA TCGCAGCAGC GCCGCGCAGC CCTGAGCCGC CTGCTGTTCC CCGGGCCGAT GCGCGAATTC ACCGAACACC AGACGGAATT CGGCGACGTC TCCAAACTCT CGACGAACCA GTTCCTCTAC GGCCTACGCC AAGGCGAAGA GCACCGCGTC CGACTCGCCA CCGGCAAAGA ACTGCTGATC ACACTCGACG GCGTCGGCGA GCCCGACGAA CACGGCAACC GGACACTGGT GTGCACACTC AACGGCCAGT TGCGCACCGT CGCCGTCCGC GATCGCGCAG TACAGGCCGA CGTGCCCGCC GCCGAACGCG CCGACAAGAC CAACCCCGGC CACGTGGCGG CCCCCTTCGC CGGGGTGGTC ACCCCCACGG TGCACGCGGG ATCCACCGTC ACCGTAGGCG ATCAGATCGC CACCATCGAA GCCATGAAGA TGGAAGCCGC GATCACCGCA CCCACCGCAG GCACCATCAC CAGAGTCGCG CTCGCCGGAC CCACCCAGGT CGACGGCGGT GACCTGGTCG CCGTGATCGA AGGCTGA
|
Protein sequence | MFEKVLVANR GEIAIRGFRA AVECGAKTVA VYPYEDRNSP HRQKADEAYQ IGVEGHPVRA YLSVEEIVRT AKRSGADAVY PGYGFLSENP ELAAACAVEG ITFVGPPTEV LELTGNKASA IAAARTAGLP VLDSSAPSAD PDTLIAAAQG MQFPVFVKAV AGGGGRGMRR VNTIEDLRDA VEIASREAES AFGDPTVFLE EAVIDPRHIE VQILADTAGN VIHLFERDCS LQRRHQKMIE MAPAPNISEA LRERLCADAV NFARSIGYSC AGTVEFLVDP QGRHVFIEMN PRIQVEHTVT EEVTDVDLVA AQLRIAAGAT LAELGLSQDA IRLNGAALQC RITTEDPTDG FRPDTGRVSA YRTPGGAGIR LDGGTSVGAE ISPHFDSMLV KLTTRGRDRE TAIVRARRAL AEFRIRGVAT NIPYLQAVLA DPDFVAGVVT TSFIEQRPEL LTARGSSDSA SKILRYLADV TVNKPHGDRP TKVYAVDKLP AVDLAAPAPD GSKQRLSALG PAGFAADLRA QTALAVTDTT FRDAHQSLLA TRVRTSALAA IAPYEARLTP QLLSVEAWGG ATYDVALRFL HEDPWERLAT LREEMPNICL QMLLRGRNTV GYTPYPQQVT DAFVAEAAAT GVDIFRIFDA LNDVEQMRPA IEAVLATGTA VAEGALSYTG DLSNPGETLY TLDYYLAVAE RIVTAGAHIL AIKDMAGLLR PAAATTLVTA LRKEFDLPVH VHTHDTPGGQ LATYLAAWTA GADAVDGAAA PLSGTTSQPS LSSIVAATAN TERDTGIDLD AVCALEPYWE AVRGAYAPFE SGLRAPTGRV YHHEIPGGQL SNLRQQAGAL GLAERFEDIE NAYAAADRML GRLVKVTPSS KVVGDLALAL VGTGVSAEEF AADPARYDIP DSVIGFLRGE LGTPPGGWPE PLRTRALQGR ADAAPVSDVP ADEAAELTGT SQQRRAALSR LLFPGPMREF TEHQTEFGDV SKLSTNQFLY GLRQGEEHRV RLATGKELLI TLDGVGEPDE HGNRTLVCTL NGQLRTVAVR DRAVQADVPA AERADKTNPG HVAAPFAGVV TPTVHAGSTV TVGDQIATIE AMKMEAAITA PTAGTITRVA LAGPTQVDGG DLVAVIEG
|
| |