Gene Tpau_4142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_4142 
Symbol 
ID9158330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp4272034 
End bp4273842 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content64% 
IMG OID 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_003649050 
Protein GI296141807 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACTG TTACTAACGA GACAGGAAAC CATCAGCGGA CCGACTGGGT CGTGTTCGGT 
GTAGCGGCCG TCAGCGTTCT GGCGTTCGTC ATATGGGGTT TCCTTGATTC GGACGGGTTG
AAGCAGACGA CCTCCGATGT CCTTGACTGG ATCATTACCG ATTTGGGCTG GCTGTTCCTG
ATCTCAGCCA CGCTCTTCGT TCTTTTCGCG ATCTTCCTGG CCGTTTCCCG CTTCGGTCGC
ATCCCGCTCG GCCGGGACGG CGAGAAGCCC GAGTACAAGA CGGTCTCCTG GATCGCCATG
ATGTTCAGCG CCGGAATGGG CATCGGCCTG ATGTTCTTCG GCGCCGCGGA GCCGATCTAC
CACTTCGTCG GTGCGCCTCC GGGCACCTCC AGCCACGATG TGGCGGTGGC GATGGCCACC
ACGATGTTCC ACTGGGGCTT CCACCCGTGG GCCATCTACG CCGTTGTCGG CCTGGCGATC
GCCTACAGCA CCTTCCGGTG CGGCCGCAGC CAGCTGATCA GCTCGGTTTT CGCGCCCATC
TTCAACCGCA CCGGTGGTCA GGGCGCGGGT GGGCGGATCA TCGACATCCT GGCCATCTTC
GCCACCCTGT TCGGTACCAC CGCCTCGCTG GGCCTCGGCG CCGCGCAGGT GGGCGCCGGT
CTGGAGCGGC TGGGCTGGGT GGGTGACGGT TCGAGCAAGC TCCTGCTGGT GGCGATCATC
GCCCTCCTGA CGCTCGCCTT CGTGGCCTCC GCGGTCTCCG GTATCGCCAA GGGCATCCAG
TGGCTCTCCA ACACCAACAT GGTGCTGGCA CTGGTACTCG CGGTCTTCGT CTTCGTGGTG
GGGCCCACGG TGTTCATCCT CAACCTGCTG CCCACCACCA TGGGCGCCTA TGCGGCGGAC
TTCATGGACA TGTCGGCCCG GTCGGCGGCC AACGAACCCG AGGCCGGTGC CTGGCTCGCG
AAGTGGACCA TCTTCTATTG GGCCTGGTGG GTCAGTTGGA CCCCGTTCGT GGGCCTGTTC
CTGGCGAAGA TCTCGAAGGG CCGCACCATC CGCGAGTTCG TGATCGGCGT CATGGCAGTG
CCCACCCTGG TGTCGTTGGT GTGGTTCGTC ATCTTCGGCG GGACCGCGAT CAACCAGGAG
CAGAGCGGGC TCGGGGTGAG TTCGGCCGAG AACGAGGAGA AGATGCTCTT CGACGTGCTC
GGGAACCTGC CGTGGCCCAC GATCACGGCC TTCCTGGTGG TGTTGCTGGT GGGGATCTTC
TTCGTCTCCG GTGCTGATTC GGCATCGATC GTGATGGGAA CGCTGTCGCA GAAGGGCGAA
GAGGAACCGA ATCGCCTGAT CACCATCTTC TGGGGTGTGC TCACCGGCGG CGTGGCGGCG
CTGTTGCTCT GGGTGAGCGG TAACAACGCG CTGGAGGGCA TCAAACAGAT GGCCATCATC
GCCGCGGCAC CGTTCCTGGT GGTGATGCTC GGCATGTGCG TGGCGCTGAT GATGGACCTC
TGGCACGATC CGCTGATCGT CGCCGAGCGG CAGCGCCGTG ACGATCTCGG TCTTCGTGTC
CGGGTGCACG CGAACACCCT TGCCGTGACG GATGATTCCA CCGACGTGCT CCCCTCCGAG
GACGTCCCGG TGTACGTGGA CGGCGAGGTC CCGGAGGACC TCTACCATCC CGCGCACACC
GGCGAGATGG TCGCCGTCGA GGTCTTCGAG GCCGATGCCG AGAACAACGA GGCGTCGACC
GCGCGCAAGA CGGTCGAGGC GAGCGGCGAT GTCCGTGTGG TTATCACCAA GAACGACCAG
AAGCCGTAG
 
Protein sequence
MATVTNETGN HQRTDWVVFG VAAVSVLAFV IWGFLDSDGL KQTTSDVLDW IITDLGWLFL 
ISATLFVLFA IFLAVSRFGR IPLGRDGEKP EYKTVSWIAM MFSAGMGIGL MFFGAAEPIY
HFVGAPPGTS SHDVAVAMAT TMFHWGFHPW AIYAVVGLAI AYSTFRCGRS QLISSVFAPI
FNRTGGQGAG GRIIDILAIF ATLFGTTASL GLGAAQVGAG LERLGWVGDG SSKLLLVAII
ALLTLAFVAS AVSGIAKGIQ WLSNTNMVLA LVLAVFVFVV GPTVFILNLL PTTMGAYAAD
FMDMSARSAA NEPEAGAWLA KWTIFYWAWW VSWTPFVGLF LAKISKGRTI REFVIGVMAV
PTLVSLVWFV IFGGTAINQE QSGLGVSSAE NEEKMLFDVL GNLPWPTITA FLVVLLVGIF
FVSGADSASI VMGTLSQKGE EEPNRLITIF WGVLTGGVAA LLLWVSGNNA LEGIKQMAII
AAAPFLVVML GMCVALMMDL WHDPLIVAER QRRDDLGLRV RVHANTLAVT DDSTDVLPSE
DVPVYVDGEV PEDLYHPAHT GEMVAVEVFE ADAENNEAST ARKTVEASGD VRVVITKNDQ
KP