Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_0565 |
Symbol | |
ID | 9154701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 593236 |
End bp | 594381 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003645544 |
Protein GI | 296138301 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.420906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGATC CCGGCAATCG GAAGCGAGCC CCGCGTAGGT CCAACGCCGG TGCCGGGCAG TGGATGCTGG GTGCGGTGAT CGTGCTGGCC GTGGCCGGTA CCGGGATGTG GATCTTCTCC GACGACAAGC GCTGGGGTCT TCTCGCCCTG ATACTGCTGG TCTGGGGTCT GGTGATCGCC GCATTCCTGA TCGCTCGCAA CGCCCGCGAC ATCCGCTCGG CGGAGTCGAA GGAGAGCGGT CTCAAGGCGG TCTACGAGCT CCAGCTCGAG CGCGAGATCT CCGCCCGGCG CGAATACGAA CTGCAGCTCG AGCACGACCT GCGCTCCGAG CTGCGGCACG AGTCCAATGA AGAGCTCACA GCACTCAAGG CAGAGGTGCT CGCGTTGCGC GCCAACCTCG AAGAGCTGTT GGGTCGCGAC CTGGGCCCGT CGTACACCGA GCTGTACGCC GCCGCCGAGC AGCGCGCGAT CGACGCCCAG CGCGCCAAGT CCGTGTTCGA AGACGACGAC CGATTCCGCG CGCAGCAGGA TTTCGCCGGC GTGCCGCCCC CGGTGCCCGA CGTCGACAGC TACTACGGCG CCGCTTCCGC GGTGAACGAC GTCCCGGACT TCCCGGCGGC CGCGGAGCAG CCCGTCGACG CGCCCTCCGC GCCGAACCGG CCTCAGGAGG ACACGCCCAC CGCGGTCTTC GAGGCCGTCG TCGACCCGCA GTCGCCGGCC GCTGCGCCGC CGGTCGATGC CGAGCCGGTC GACGTCGAGC ACATCGACCA CATCGACGGC GACGAGCCCT TCGTCCCGCT GGCCCCGCAG TCCGGCTACC GCCCGCCCTT CCCTCCGCGC CGGTTCACCC CGGATGCCGC GCAGCAGCAG CCGCCGCAGT TCGCTCAGTA CGCCCCGCCG GAGCGCGCTC CGCAGCAGCC CGCGCCCGCG GACCAGGTGC ACAGCATCCC CACCGTGCCG CCCTTCCCCG CCGCCGAACC CACGCCCGCG GCGCCGCTGC AGGAGACGCG GCCGGAGGAC GACGACGTGC ACGAGCAGGA CAGCACGCGC GGCCAGCACT CCTCGGGCAG CACCGTGGCC GAGCTGATCG CCCGGATGAA CGCCGACACC GAGCGCAGCG GCGGCCGCCG GCGCAAGCCG GAGTGA
|
Protein sequence | MMDPGNRKRA PRRSNAGAGQ WMLGAVIVLA VAGTGMWIFS DDKRWGLLAL ILLVWGLVIA AFLIARNARD IRSAESKESG LKAVYELQLE REISARREYE LQLEHDLRSE LRHESNEELT ALKAEVLALR ANLEELLGRD LGPSYTELYA AAEQRAIDAQ RAKSVFEDDD RFRAQQDFAG VPPPVPDVDS YYGAASAVND VPDFPAAAEQ PVDAPSAPNR PQEDTPTAVF EAVVDPQSPA AAPPVDAEPV DVEHIDHIDG DEPFVPLAPQ SGYRPPFPPR RFTPDAAQQQ PPQFAQYAPP ERAPQQPAPA DQVHSIPTVP PFPAAEPTPA APLQETRPED DDVHEQDSTR GQHSSGSTVA ELIARMNADT ERSGGRRRKP E
|
| |