Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3953 |
Symbol | |
ID | 9158134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4073245 |
End bp | 4074702 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | amino acid permease-associated region |
Protein accession | YP_003648864 |
Protein GI | 296141621 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAACA GCCCGCAATC CGATCCGCCG AGGGCGGTCG ACGACGCGCC GTCCGGCGAG CACCTGCTGC AGCGTGGCCT CGGCATCGGA TCGATCGTGT TCATGGTGGT CGCGGCGGCG GCACCGCTCG CCGTCGTGAC GGCGAGCGTC CCGATCATCA TCTCGGTGAG CGGGAGCGCG GCAGCGCCAC AATTCTTCCT GATCGCGATG TGTGTACTCG CCCTATTCTC CGTCGGGTTC ACAGCGATGA GCAGGTACGT GCCGAACGCG GGCGCGTTCT ACTCGTACAT CCAGCGCGGA CTGGGCCGTC ACGCGGGAGT CGGGGCGGCA ACGCTGGCGC TGGGCTCGTA CGGGGTGATG CAGATCGGGC TGTATACCTA CATGGGGGTC GCGACGTCGA AGCTGTTGTC CACGATGGAT ATCGGGCTCC CGTGGTGGAC GTGCTCGCTC GCGTGGGTAT CGATCGTCGG CGTCCTCGGC TACCGCGACA TCGAGCTCAG CTCGAAAGTG CTCGGCGTGC TACTGGTCGC GGAGACGCTG GCCGTTGTGG TCATCGAGGT CGGCGTGATC AGTGCGGGCG GGGCCGGCGG ACTGGACCTG GTTCCGCTGT CTCCCAGTGA GTTCGCCACC GGCGCACCGA GCCTCGGGCT GATGTTCGGG TTCTTCAGCT TCATCGGATT CGAGGCCACG GCCGTGTTCC GCAACGAGGC CCGCGATCCG GAGCGGACGA TTCCCCGGTC GACCTATGCC GCGGTGGTCT TCATCGGCCT GTTCTACGCC TTCGCGGCGT GGGCGGTGGT GGAGGGACTC GGCGTCGGGC ACGCGGTCGA GGCGGCGAAG GCCGATCCCG GCAACCTAGT GCAGAACCTG GCCAGCGAGT ACGTTTCACC GATCCTCACC GACGTCATCC AGGTGCTGCT CGTGACCAGC TTCTTCGCGT GCGTGCTCTC GTTCCACAAC GTGATCACCC GCTACCAGTT CACGCTGGCG ACGAAAGGAC TACTGCCGCA GTCCCTCGCC GAGATCAGCC CGCGCCACCG CACCCCGTCG AAGTCGTCGC TGACGTTCAC CGTGGTCTCT CTGGTGGCGG TGGCGGCCGT CGCGGCGGTG GGCTGGGATC CCGTGGCCCA GACGTACATG TGGTCGTCGG GGGCGTCGAC GCTCGGCCTG ATCGCGCTGA TGGCGATGAC CAGCCTGGCG GTGATCATCT TCTTCCGGAC GCGGGTCCGC AACCAGGGCC GGTGGCGGAC GCTGGTCGCG CCGGGGCTGT CGTTCCTTGG GCTCACCGCG ATCCTGCTCC TGGTGATCGC CAACTTTCCG GTGCTCGTGG GCACCACCAC CACCGCGGTG GTGCTCGGCG TGATCATCCT CGTGACCTTC CTCGCCGGCG TCGTTGCCGC CGAGCGACTG CGCCGCACCC GCCCTGAGCA CTATCAGGCT CTGCTACACG AGGACTGA
|
Protein sequence | MINSPQSDPP RAVDDAPSGE HLLQRGLGIG SIVFMVVAAA APLAVVTASV PIIISVSGSA AAPQFFLIAM CVLALFSVGF TAMSRYVPNA GAFYSYIQRG LGRHAGVGAA TLALGSYGVM QIGLYTYMGV ATSKLLSTMD IGLPWWTCSL AWVSIVGVLG YRDIELSSKV LGVLLVAETL AVVVIEVGVI SAGGAGGLDL VPLSPSEFAT GAPSLGLMFG FFSFIGFEAT AVFRNEARDP ERTIPRSTYA AVVFIGLFYA FAAWAVVEGL GVGHAVEAAK ADPGNLVQNL ASEYVSPILT DVIQVLLVTS FFACVLSFHN VITRYQFTLA TKGLLPQSLA EISPRHRTPS KSSLTFTVVS LVAVAAVAAV GWDPVAQTYM WSSGASTLGL IALMAMTSLA VIIFFRTRVR NQGRWRTLVA PGLSFLGLTA ILLLVIANFP VLVGTTTTAV VLGVIILVTF LAGVVAAERL RRTRPEHYQA LLHED
|
| |