Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0397 |
Symbol | |
ID | 9154532 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 414625 |
End bp | 417195 |
Gene Length | 2571 bp |
Protein Length | 856 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | protein of unknown function DUF214 |
Protein accession | YP_003645377 |
Protein GI | 296138134 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0721103 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTCAC CCATGCGCAA GGTGGCCTTG CGCAATCTGA TGGCGCACAA GGGCCGGCTG TTCCTGACGG TCCTCTCGGT GCTCCTCGGC ACCTCGTTCA TCGCCGGATC GATGGTCTTC ACCGGAACGC TGTCGAAGGC TTTCGACGGG ATCTCGGACC GGATCGCGGT GGGGGTCGAC GCGCGAGTCG CGCCGAAGGA TGCGCAGGGG CAGGGCGGTT TCGGGCCACA GGGACCCGGT GTCCCGCTCA GCGCGCTCGA CCGGGTCCGC GCCGTGCCGG GTGTGCGGAT CGCGCAGCCC GCCGTCACCG GTTCCATCGC GCTCCGCGAC GGCAAGGGCG ATGTGGTCTC GCCCACCGGC GCGCCGCAGA CCGGCGGTGC CTACCTGCCG CCGGGGGAGA GCTTCGAGTC CGATGGCCTG AAGATCACCG AGGGCCGCGC ACCGTCGGGC CCGAACGAGA TGGTGCTCAA CGACTCCGCG GCCGACCGGC TCGATCTACC GGTGGGGTCC AAGACCACCG TGGTGACGCC GCGGTCGCCG GGGCCGATCG AGGCGACCAT CGTCGGCACG TACACCGCGT CGGTCGACAC GGGCGGGTAC ATCGGCGCCC TGTTCGCGAA GGACGAGGCG CAGAAGCTGT TCGGCGACGG CGCGACCACA CCGAACATCG ACGTGGCCGC GGCGCCGGGA ACCAGCCCCG AGCAACTGCG CGACCGTCTC GCGCAGGCGC TACCCGAACT CGACGTCAAG ACCGGTGACG AGGTGCGCAC CGAGTTCACC GACCAGGTGA ACCAGGGGCT CAGCTTCCTC AATTACTTCC TCGTCGCCTT CGGTCTGATC GGTTTGCTGG TAGGCGTTTT CATCATCTAC AACACGTTCT CGATGCTCGT CGCACAGCGC CTCAAGGAGC TCGCGCTGCT GCGCGCGATC GGCGCCAGCC GTCCACAGGT ACGCAACTCC GTGGTGCTCG AGGCGCTCGT CGTCGGCGTG CTCGGCAGTG CTGCCGGACT CGCGACCGGC ATCGGTCTGG CCTGGCTGTT GCAGGCCGTG GTGAAGGCCG CGGGTGCGGG CTTCCCCGAT ACGGGCATCG TGGTCGCGCC CTCGGTCGCG ATCACCGTGA TGCTCGTGGG CACCGTGGTC ACGGTGATCT CCGCGTTGAT TCCCGCGGTC CGGGCGTCGA AAGTCCCACC CGTGGCAGCG ATGCGAGCGC AGGACGGCGG ATCCGCCGCA TCGGTCCTGG TGCGCGGCGC GATCGGTGCG GCCCTTTTCC TCAGTGCGCT CGCGCTGCTG TTCGTGGCGA CCACCGAGTC CGGTTCCGGT GCAGCGATCA TGGTGGGCGT CGCCGGCTTC GGACTGGTGC TCGCCGTGGT GATCGGCGGC CCGGCTCTCG TCGGCCCCGT GCTCGGCGGC GTCGGCACGG TGCTCGCCAA ACCCTTCGGC CCGGCCGGCC GATTGGGCCG TACCAACGTG ATGCGTAATC CTCAGCGCAC CACCGCGACC GCGTTCGCCC TGGTGATCGG CGTGGCGCTG GTGGGTGTGA TCGGCACCCT GGGGGCGTCC ATGCAGAAGT CGATCGACTC GCAGGTCGAC ACGGGTATCA GGTCCGATCT GGTGCTGCAG GCGCCGCAGT TCGGGATGCC GCCCGCGGCG CTCACCGCGA TCAAGGACGT CCCCGGCATC GGCACCAAGA CAACGCTGTA CGCGGTGCCC ACGCGGATCG ACGGTGACCG GCAGAACCTG CTCGCCGTCG ACGGCGATGT GAACGCGGTG TTCAATCTGA CCCGGGTCGC CGGCACGCTG GACCTGAAAT CCGGTGGCCT CCTGATCGAC GAGGACAAGG CGCGGGCAGA GGGCTGGGAG GTCGGCTCGA AGGTCGCCCT GGGCTCGGCC ACCGGCCGGG AGCAGGCCGA GACGACGGTC ACCGGGATCT ATGCCAAAGC CGGTGGCCTG TCCGGGCCGG TGGTCACCGT CGGTGAGGTG AACACGCTGT ATCCGCCCGC GCAGGGGGCG ACCAGCCCGA TCGTCACGCC GCAATCGGTC TTCATCGGCG CCGCCGACGG CACCTCCGTC GAGGACCTCA AGGACCGGCT GCGCGAGGCG GTCAAACCGT TGCTGGTGGT CAATGTCGAC GATCAGCAAG ACCTCAAGGA CCAAGCAGGC CAGGCGATCA CCGCCCTGAT GGGCGTGCTC TACGGGCTGC TCGGCTTGGC GGTGATCATC GCGATCCTGG GCATCATCAA CACACTGGCG CTGTCGGTGG TGGAGCGGCG CCGCGAGATC GGCATGCTCC GCGCGATCGG CATGATCCGC TCCCAGGTGC GCAAGTCGAT CTATCTCGAA TCGATGCTGA TCGCGCTGTT CGGCGCCGTA CTCGGCCTGG TGCTCGGCGT GCTACTGGGC ACGTCGCTGG TGTACGCGCT GCGTGATGAG GGGCTCGGCT CCGTGGTCGT CCCGTGGTCG ACGGTGTTGG TGATGCTGGT CGCCTCGGCC TTCGTCGGCG TGGGTGCGGC GATCCTTCCG GCCATCCGTG CCTCCCGCAC GCCGCCACTC GCCGCGATCG CGGAGGGCTA G
|
Protein sequence | MASPMRKVAL RNLMAHKGRL FLTVLSVLLG TSFIAGSMVF TGTLSKAFDG ISDRIAVGVD ARVAPKDAQG QGGFGPQGPG VPLSALDRVR AVPGVRIAQP AVTGSIALRD GKGDVVSPTG APQTGGAYLP PGESFESDGL KITEGRAPSG PNEMVLNDSA ADRLDLPVGS KTTVVTPRSP GPIEATIVGT YTASVDTGGY IGALFAKDEA QKLFGDGATT PNIDVAAAPG TSPEQLRDRL AQALPELDVK TGDEVRTEFT DQVNQGLSFL NYFLVAFGLI GLLVGVFIIY NTFSMLVAQR LKELALLRAI GASRPQVRNS VVLEALVVGV LGSAAGLATG IGLAWLLQAV VKAAGAGFPD TGIVVAPSVA ITVMLVGTVV TVISALIPAV RASKVPPVAA MRAQDGGSAA SVLVRGAIGA ALFLSALALL FVATTESGSG AAIMVGVAGF GLVLAVVIGG PALVGPVLGG VGTVLAKPFG PAGRLGRTNV MRNPQRTTAT AFALVIGVAL VGVIGTLGAS MQKSIDSQVD TGIRSDLVLQ APQFGMPPAA LTAIKDVPGI GTKTTLYAVP TRIDGDRQNL LAVDGDVNAV FNLTRVAGTL DLKSGGLLID EDKARAEGWE VGSKVALGSA TGREQAETTV TGIYAKAGGL SGPVVTVGEV NTLYPPAQGA TSPIVTPQSV FIGAADGTSV EDLKDRLREA VKPLLVVNVD DQQDLKDQAG QAITALMGVL YGLLGLAVII AILGIINTLA LSVVERRREI GMLRAIGMIR SQVRKSIYLE SMLIALFGAV LGLVLGVLLG TSLVYALRDE GLGSVVVPWS TVLVMLVASA FVGVGAAILP AIRASRTPPL AAIAEG
|
| |