Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4197 |
Symbol | |
ID | 9158385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 4324112 |
End bp | 4326115 |
Gene Length | 2004 bp |
Protein Length | 667 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | oligopeptide transporter, OPT family |
Protein accession | YP_003649105 |
Protein GI | 296141862 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAGTC CGGCTTCCGC CGCGCCGCGA TCCGGCGTAC ACGAGCTGAC AGTCCGCGGC ATCGCTCTCG GTGCCGTCAT CACGCTCGTC TTCACCGCCG CCAACGCCTA CCTGGGGCTG AAGGTGGGCC TGACCTTCGC CACATCGATT CCGGCGGCGG TCATCTCGAT GGCATTGCTG CGCTACTTCG CGAACCACTC GATCATCGAG AACAACATCG TGCAGACGAT CGCGTCGGCC GCGGGAACGC TTTCGGCCAT CGTGTTCATC CTGCCGGGCC TGGTGATGAT CGGCTGGTGG CAGGGCTTCC CGTACTGGAC CACCACCCTG GTCTGCCTCG TCGGCGGCGT GCTCGGCGTC ATGTACTCGA TCCCGCTGCG GCGAGCCCTG GTGACCGGGT CCGACCTCCC GTTCCCCGAA GGCGTGGCCG CCGCCGAGGT GCTCAAGGTG GGCGATACCG CCGAGGGAGC CCAGGAGAAC AAGACCGGCC TGCGCCTGAT CCTGCTCGGC TCGCTCGCCT CCGCGGGCTA CGCCCTGCTC GGCAAGATGA AGGTGGTCGC CGAGTCCATC TCGATCCCGG TGAAGATCGG TAGTGGTGGC ACCATCGTGG TCCCCGGCCT GTCCTTCGCG CTGATCGGCG TGGGCCACCT CGTGGGCGTC ACCGTGGGCA TCGCGATGAT CGTGGGCCTG GTCATCTCCT ACTTCGTGCT CCTGCCCATC TGGACCTCCG GCGAGCTCGG CGGCGGCGAG GCCTTCTCGG ACGTCGTCAA CGGCATCTTC AAGAACGACA TCCGGCTGAT CGGCGCCGGC GCCATCGCGG TGGCCGCCGT GTGGACCCTG GTGAAGATCC TCGGGCCGGT GCTGCGCGGC GTCGCCGAGT CGATCGCCTC GGCGCGCAAA CGGCGCGACG GCGAACTCGT CGACATCACC GAGCGCGATA TCCCGTTCCC CTACGTGGCC GGGATCGTCC TGGTCTCCAT GGTCCCGATC GGCGTGCTGC TGTGGCTGTT CACCACCGAC ACCCCACTCG ACGGCAAAGC CGGCGGCATC ATCACACTCA GCGTGCTGTT CGTGCTCGTC CTGGGCTTGC TGGTCGCCTC GGTGTGCGGC TACATGGCCG GTCTGATCGG CGCCTCCAAC AGCCCCATCT CGGGCGTCGG CATCCTTGTG GTGCTCGCCG CGGCGCTGCT CATCCGCGCG GTCTACGGCC CGTCGAGCGG TGACGAGACC ATCGCGCTCG TCGCGTACAC GCTGTTCACT GGCGCCATCG TGTTCGGCAT CGCCACCATC TCCAACGACA ACCTGCAGGA CCTCAAGACC GGCCAGTTGG TGGGCGCCAC CCCGTGGAAG CAGCAGGTGG CCCTCGTGAT CGGTGTGGCC TTCGGTTCGG CGATCATCCC GCCGGTGCTC GGAGTGCTGC AGAAGGGCTT CGGCTTCGCG GGCGCCCCGG GCGCGGGTGA CAATGCGCTC GCCGCCCCGC AGGCCTCGCT GCTCGCGAAG CTCTCCGAGG GTGTTTTCGG CGGCGACCTC GACTGGGGAC TCATCGGCCT CGGCGCCCTG ATCGGCGTTG TGGTCATCGT GATCGACGAG ACGCTCGCCC GCTCCGGCAA GTTCCGGCTG CCCCCGCTCG CGGTGGGCAT GGGGATGTAC CTGCCGATGA GCGTGACCTT GATGATCCCG ATCGGCGCCG CGATCGGGTA CTACTACAAC CGCTGGGCCG ACCGCTCCGA CAACGCCGAG GGCCGCAAGC GGCTCGGCAC GCTCATGGCG ACCGGTCTCA TCGTGGGTGA GTCGCTGTTC GGCGTGCTCT ACGCTGGCAT CGTCGTGCTG GCGGACCGCG TTCCCGATCT GCCGATCATC GGTGGCAAGG AAGAGCCGCT GGCGCTGCCC TTCATCGGCG AGGGCTACCT GCACTGGGGC GAGGCGCTCG GCGCGATCCT GTTCGCCGCC ATCGTCTACG CGCTGTACAC CCGGACGAAG AAGGTCGCCG CCGCCGAGGC CTGA
|
Protein sequence | MSSPASAAPR SGVHELTVRG IALGAVITLV FTAANAYLGL KVGLTFATSI PAAVISMALL RYFANHSIIE NNIVQTIASA AGTLSAIVFI LPGLVMIGWW QGFPYWTTTL VCLVGGVLGV MYSIPLRRAL VTGSDLPFPE GVAAAEVLKV GDTAEGAQEN KTGLRLILLG SLASAGYALL GKMKVVAESI SIPVKIGSGG TIVVPGLSFA LIGVGHLVGV TVGIAMIVGL VISYFVLLPI WTSGELGGGE AFSDVVNGIF KNDIRLIGAG AIAVAAVWTL VKILGPVLRG VAESIASARK RRDGELVDIT ERDIPFPYVA GIVLVSMVPI GVLLWLFTTD TPLDGKAGGI ITLSVLFVLV LGLLVASVCG YMAGLIGASN SPISGVGILV VLAAALLIRA VYGPSSGDET IALVAYTLFT GAIVFGIATI SNDNLQDLKT GQLVGATPWK QQVALVIGVA FGSAIIPPVL GVLQKGFGFA GAPGAGDNAL AAPQASLLAK LSEGVFGGDL DWGLIGLGAL IGVVVIVIDE TLARSGKFRL PPLAVGMGMY LPMSVTLMIP IGAAIGYYYN RWADRSDNAE GRKRLGTLMA TGLIVGESLF GVLYAGIVVL ADRVPDLPII GGKEEPLALP FIGEGYLHWG EALGAILFAA IVYALYTRTK KVAAAEA
|
| |