Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4072 |
Symbol | |
ID | 9158259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 4198370 |
End bp | 4199722 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | permease for cytosine/purines uracil thiamine allantoin |
Protein accession | YP_003648981 |
Protein GI | 296141738 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCGCT ACTACCGGCG CCTCGACGCG CACCTGCACG CTGGAGCCGA CGACGGTACC CCCGTGACCG GCGACCTCAG TGGTCGCCGC ATCTTCGCGA TCTGGCTCGC GGCGAACCTC GTGGTGACCA CGATGCTCAC CGGCACACTG TTCGTCCCCT CGGTGTCGCT GAGCAGTGCG CTGTGGTTGA TCCTGGCCGG CACGGTGGTC GGCGCCCTGG TGCTGGTGAC CGTCGGTAAC GTGGGCACCC GCACCGGGCT TCCCACGATG GCGCTCACCC GCGCCTCGTT CGGGGTGCGT GGCAGCCGGA TTCCCGTGGC CGCCAACATC GCCGTGCTCA TGGGCTGGAG CTGGGTGCAG GCCATGCTGG CCGGTGTCAC GGTGAATTAC GTGGTGGCGC AGCAGACCGG GTACTCGAAC CCGACACTGT TCTCCGTGCT GTGCGAGGTG ATCGTGGTCT CTCTCGCGAT CTTCGGGCAC ACCGGTATCG CCAAAGTGGA ACCTATTCTG GCTGTGGTGA TCCTGGGCAT CATGGCCTAT CTGTTCATCG ATATGTTCGG TGACGGATTC GCCCGCCTCG ACACCCTGCC GCGCGACCCC GAAGGACTCA CCGGGCTCGG CATTTTCGAC ATCGTCGTGG CCACCGCCAT CTCCTGGACC GTGCTCTCCG CCGATATCAA CCGTTTCGCC GCCACCTCTC GCGGCGGCGT CGTGGGATCC GGGCTCGGTT ATGTGGCGTC CACGGTGGCC GCGATGGCGT TGGGCGCCAC CGCGATCGCG TGGCTGCTCG CGGGTGGCAA GGAAGCGCCG CCGTTCGATC CGACGGTGAT CGTCGCCGAA TTCGGCGTGC CACTGGCGAT CGTGGTGTTC TTCTCCGTGA TGGCCACCAA CACCATGGCC GTCTACGGGA TGGTCACCTC GCTGGTGAAC GCGGAGCCCG GACGCATCCG GTTCCTGCCC GCCGCCCTCG GACTCGGCGT GATCTCGATC GCCGGATCGG CATGGCTGGC GCTGCTGGAC AAGTTCACCG CGTTCCTGAC CGCCATCGGA ATCGTCTTCA TCCCGGTTTT CGCCGTCATC ATCGTGGACT TCTACGTGCT CCGGCGTGGC CGCTACGCCG CGTCGCTCGC GGAAGCGGGT TCGCGGGATT ACTGGTACCG CGGCGGCTGG AATCCGGTGG CGGTCATCGT ATGGCTGGTG GGTGTCGGAT TCTCGTCGGC GATCACTTAC CTGTGGGTGA GCCCCGTGGG AGCCACCGTG CCCACCTTCC TGGTGAGCGC GGGGCTGTAC TGGGCCGCGA GCGTCGCCGT CCGGAACCGC GATCACCGCG ATACCGATAG CGTGGCCGTG TGA
|
Protein sequence | MLRYYRRLDA HLHAGADDGT PVTGDLSGRR IFAIWLAANL VVTTMLTGTL FVPSVSLSSA LWLILAGTVV GALVLVTVGN VGTRTGLPTM ALTRASFGVR GSRIPVAANI AVLMGWSWVQ AMLAGVTVNY VVAQQTGYSN PTLFSVLCEV IVVSLAIFGH TGIAKVEPIL AVVILGIMAY LFIDMFGDGF ARLDTLPRDP EGLTGLGIFD IVVATAISWT VLSADINRFA ATSRGGVVGS GLGYVASTVA AMALGATAIA WLLAGGKEAP PFDPTVIVAE FGVPLAIVVF FSVMATNTMA VYGMVTSLVN AEPGRIRFLP AALGLGVISI AGSAWLALLD KFTAFLTAIG IVFIPVFAVI IVDFYVLRRG RYAASLAEAG SRDYWYRGGW NPVAVIVWLV GVGFSSAITY LWVSPVGATV PTFLVSAGLY WAASVAVRNR DHRDTDSVAV
|
| |