Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4142 |
Symbol | |
ID | 9158330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 4272034 |
End bp | 4273842 |
Gene Length | 1809 bp |
Protein Length | 602 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | choline/carnitine/betaine transporter |
Protein accession | YP_003649050 |
Protein GI | 296141807 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACTG TTACTAACGA GACAGGAAAC CATCAGCGGA CCGACTGGGT CGTGTTCGGT GTAGCGGCCG TCAGCGTTCT GGCGTTCGTC ATATGGGGTT TCCTTGATTC GGACGGGTTG AAGCAGACGA CCTCCGATGT CCTTGACTGG ATCATTACCG ATTTGGGCTG GCTGTTCCTG ATCTCAGCCA CGCTCTTCGT TCTTTTCGCG ATCTTCCTGG CCGTTTCCCG CTTCGGTCGC ATCCCGCTCG GCCGGGACGG CGAGAAGCCC GAGTACAAGA CGGTCTCCTG GATCGCCATG ATGTTCAGCG CCGGAATGGG CATCGGCCTG ATGTTCTTCG GCGCCGCGGA GCCGATCTAC CACTTCGTCG GTGCGCCTCC GGGCACCTCC AGCCACGATG TGGCGGTGGC GATGGCCACC ACGATGTTCC ACTGGGGCTT CCACCCGTGG GCCATCTACG CCGTTGTCGG CCTGGCGATC GCCTACAGCA CCTTCCGGTG CGGCCGCAGC CAGCTGATCA GCTCGGTTTT CGCGCCCATC TTCAACCGCA CCGGTGGTCA GGGCGCGGGT GGGCGGATCA TCGACATCCT GGCCATCTTC GCCACCCTGT TCGGTACCAC CGCCTCGCTG GGCCTCGGCG CCGCGCAGGT GGGCGCCGGT CTGGAGCGGC TGGGCTGGGT GGGTGACGGT TCGAGCAAGC TCCTGCTGGT GGCGATCATC GCCCTCCTGA CGCTCGCCTT CGTGGCCTCC GCGGTCTCCG GTATCGCCAA GGGCATCCAG TGGCTCTCCA ACACCAACAT GGTGCTGGCA CTGGTACTCG CGGTCTTCGT CTTCGTGGTG GGGCCCACGG TGTTCATCCT CAACCTGCTG CCCACCACCA TGGGCGCCTA TGCGGCGGAC TTCATGGACA TGTCGGCCCG GTCGGCGGCC AACGAACCCG AGGCCGGTGC CTGGCTCGCG AAGTGGACCA TCTTCTATTG GGCCTGGTGG GTCAGTTGGA CCCCGTTCGT GGGCCTGTTC CTGGCGAAGA TCTCGAAGGG CCGCACCATC CGCGAGTTCG TGATCGGCGT CATGGCAGTG CCCACCCTGG TGTCGTTGGT GTGGTTCGTC ATCTTCGGCG GGACCGCGAT CAACCAGGAG CAGAGCGGGC TCGGGGTGAG TTCGGCCGAG AACGAGGAGA AGATGCTCTT CGACGTGCTC GGGAACCTGC CGTGGCCCAC GATCACGGCC TTCCTGGTGG TGTTGCTGGT GGGGATCTTC TTCGTCTCCG GTGCTGATTC GGCATCGATC GTGATGGGAA CGCTGTCGCA GAAGGGCGAA GAGGAACCGA ATCGCCTGAT CACCATCTTC TGGGGTGTGC TCACCGGCGG CGTGGCGGCG CTGTTGCTCT GGGTGAGCGG TAACAACGCG CTGGAGGGCA TCAAACAGAT GGCCATCATC GCCGCGGCAC CGTTCCTGGT GGTGATGCTC GGCATGTGCG TGGCGCTGAT GATGGACCTC TGGCACGATC CGCTGATCGT CGCCGAGCGG CAGCGCCGTG ACGATCTCGG TCTTCGTGTC CGGGTGCACG CGAACACCCT TGCCGTGACG GATGATTCCA CCGACGTGCT CCCCTCCGAG GACGTCCCGG TGTACGTGGA CGGCGAGGTC CCGGAGGACC TCTACCATCC CGCGCACACC GGCGAGATGG TCGCCGTCGA GGTCTTCGAG GCCGATGCCG AGAACAACGA GGCGTCGACC GCGCGCAAGA CGGTCGAGGC GAGCGGCGAT GTCCGTGTGG TTATCACCAA GAACGACCAG AAGCCGTAG
|
Protein sequence | MATVTNETGN HQRTDWVVFG VAAVSVLAFV IWGFLDSDGL KQTTSDVLDW IITDLGWLFL ISATLFVLFA IFLAVSRFGR IPLGRDGEKP EYKTVSWIAM MFSAGMGIGL MFFGAAEPIY HFVGAPPGTS SHDVAVAMAT TMFHWGFHPW AIYAVVGLAI AYSTFRCGRS QLISSVFAPI FNRTGGQGAG GRIIDILAIF ATLFGTTASL GLGAAQVGAG LERLGWVGDG SSKLLLVAII ALLTLAFVAS AVSGIAKGIQ WLSNTNMVLA LVLAVFVFVV GPTVFILNLL PTTMGAYAAD FMDMSARSAA NEPEAGAWLA KWTIFYWAWW VSWTPFVGLF LAKISKGRTI REFVIGVMAV PTLVSLVWFV IFGGTAINQE QSGLGVSSAE NEEKMLFDVL GNLPWPTITA FLVVLLVGIF FVSGADSASI VMGTLSQKGE EEPNRLITIF WGVLTGGVAA LLLWVSGNNA LEGIKQMAII AAAPFLVVML GMCVALMMDL WHDPLIVAER QRRDDLGLRV RVHANTLAVT DDSTDVLPSE DVPVYVDGEV PEDLYHPAHT GEMVAVEVFE ADAENNEAST ARKTVEASGD VRVVITKNDQ KP
|
| |