Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4202 |
Symbol | |
ID | 9158390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4330731 |
End bp | 4332785 |
Gene Length | 2055 bp |
Protein Length | 684 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | protein of unknown function DUF839 |
Protein accession | YP_003649109 |
Protein GI | 296141866 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCTGC CCAACCTGCG TCTGCTCGTC GATCACGACG GCCAGTCCGC GCGCTCGCAC ACCACCTGCA AGTACAAGTG CGGCGAGCAG TGTTTCCGCC CCCACGACAA CACGTCCGAC AACGAGTACT TCGGCGATCT GACCCGCCGC ACCATTCTCA AGGGTGCGGG CATCGCGGCC GTGGGCGTGG GCGCCACCAC GGTGCTCGCC GCCTGTGGCG ACGCGAGCAC CGGCTCCGCC GGTTCGTCCT CAGCGGGTTC GGCAGCAGGT GCCGATATCA AGGCTCCCGG ACTGAATTTC ACCTCGGTGG CCGCGAACAG CGAAGACCGC GTCGTGATTC CGGAAGGTTA CGAGCAGGCC GTGGTGATCG CCTGGGGTGA TCCGGTGCTG CCGGGCGCGC CGAAGTTCGA CTTCGACAAC CAGACTCCGG AGGCGCAGGC CCAACAGTTC GGCTACAACA ACGATTTCAC CGATCTGATC GAGGTGGACG GTAAGCCGGA CACGTATCTG ATGGTGTGCA ACCAGGAGTA CACCACCGGA AAGCACATGT TCAAGGGCTA CGACGAGAAG AACCCCACCG AGAATCAGGT GCGGGTCGAG ATGGCCGCGC ACGGCCTCAC TGTCGTGGAG GTGAAGGGCG AGCCGGGCTC CGGCAAGCTG ACTCCTGTTC TGGGCGAATA CAATCGGCGC GTCACCGCAT CGACGGAGTT CGAGCTGCGC GGCCCCGCCG CCGGTTCGTG GTTCGTCAAG ACCGCCGCCG ATCCCACGGG CACCCGGGTA CTGGGCACCA TCGCGAACTG CTCCGGCGGC ATCACCCCCT GGGGCACCAT GCTGTCCGGC GAGGAGAACT ACACCAACTA CTTCTCCGGC GCCGACACCC CGCCGATCGA CGGCCTCAAG GAGGGTTGGA AGCGGTACGG ACTGGGCAAG GACTCGGACT ACCACAACTG GGGTAAATAC GAGGACCGGT TCAACCTGGC CAAGGAGATC AACGAGGCCA ACCGATTCGG CTACATCGTC GAGCTCAATC CCCACGATCC GCGCTCCACG CCGGTGAAGC ACTCGGCCAT GGGCCGGTTC AAGCACGAGG GCTCCAACAT CCATGTGACC TCGAACGGCA CTGTGGTGGC GTATTCGGGT GACGACTCGA AATTCGAGTA CATCTACAAG TTCGTCTCGT CGCGGAAGAT CCAGCAGGGC AAGGGGCCCG CCGCAATGGA GTCGAACATG CGGATCCTCG ACGAGGGCAC TCTGTACGTG GCCAAGTTCG AGGGCCCGAA GGTCGAGGAC GGCAAGGTTC CGGCGGGCGG GTACAAGGGA ACCGGAAAGT GGATCGCCTT GCTCACGGTG GACGCATCCG GCAAGGCGAC CTCGCACATC GACGGCATGC CGCCGGAGAA CGTCGCGGTG TACACCCGGG TGGCCGCCGA TAAGGCCGGC GCCACCAAGA TGGACCGCCC GGAGGACATC CAGCCGCACC CGAAGACCGG GAAGGTGTAC TGCGCGCTGA CCAACAACAG CGACCGCGGT ACCGAGGGCA AGGCCGGTAT CGACGCGGCC AACCCCCGGG TGAAGAACAA GAACGGCCAG ATCCTGGAGA TCACCGACGA TCACACCGGC ACCGAGTTCT CCTGGGACCT GCTGCTGGTG TGCGGCGATC CCGCCGCGGC CGACACCTAC TACGGCGGTT ACGACAAGAG CAAGGTGTCG CGCATCAGCT GCCCCGACAA CGTGGCCTTC GACTCGTTCG GCAACCTGTG GATCTCGACC GACGGCACCC AGGACACCTT CAAGAGCAAC GACGGCCTGT TCGGCGTGGT GCTGGAGGGC AAGGACCGGG GCCTGACGAA GCAGTTCCTC ACTGTGCCCT ACGGCGCCGA GACATGCGGT CCCGTCATCC GAGACAACCG CGTTCTTGTC GCAGTTCAGC ATCCGGGTGA GACGGACGAC GGAGGCGCCG GCAGCCCCAC CTCGCACTGG CCCGACGGCG GCACGAACGA GCCGCGACCT GCGGTGGTCG CCGTTTGGAA GACGAGCGGC AACATCGGCA CGTAG
|
Protein sequence | MRLPNLRLLV DHDGQSARSH TTCKYKCGEQ CFRPHDNTSD NEYFGDLTRR TILKGAGIAA VGVGATTVLA ACGDASTGSA GSSSAGSAAG ADIKAPGLNF TSVAANSEDR VVIPEGYEQA VVIAWGDPVL PGAPKFDFDN QTPEAQAQQF GYNNDFTDLI EVDGKPDTYL MVCNQEYTTG KHMFKGYDEK NPTENQVRVE MAAHGLTVVE VKGEPGSGKL TPVLGEYNRR VTASTEFELR GPAAGSWFVK TAADPTGTRV LGTIANCSGG ITPWGTMLSG EENYTNYFSG ADTPPIDGLK EGWKRYGLGK DSDYHNWGKY EDRFNLAKEI NEANRFGYIV ELNPHDPRST PVKHSAMGRF KHEGSNIHVT SNGTVVAYSG DDSKFEYIYK FVSSRKIQQG KGPAAMESNM RILDEGTLYV AKFEGPKVED GKVPAGGYKG TGKWIALLTV DASGKATSHI DGMPPENVAV YTRVAADKAG ATKMDRPEDI QPHPKTGKVY CALTNNSDRG TEGKAGIDAA NPRVKNKNGQ ILEITDDHTG TEFSWDLLLV CGDPAAADTY YGGYDKSKVS RISCPDNVAF DSFGNLWIST DGTQDTFKSN DGLFGVVLEG KDRGLTKQFL TVPYGAETCG PVIRDNRVLV AVQHPGETDD GGAGSPTSHW PDGGTNEPRP AVVAVWKTSG NIGT
|
| |