Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3794 |
Symbol | |
ID | 9157974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3915113 |
End bp | 3916228 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003648711 |
Protein GI | 296141468 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.659057 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTTCCCG AGCGCGACGG TGAGCTGCTG ATGCGGCCCG GAAGGATCGG CCTGCTCACG GTGGTCACGA TGCTCGCGTT GACGGCGGTC CCGTTCACCG CCGGAACGGC GCAGGCGGCC TCGTCCAGCA CGCTGTGCGC CTACCTCGCG GACGGTATCG GCCTGTACCC CGACAACGAT GTCACGCAAA TGGGCGTGAA GATCGGCTCG GTGCGCACGG TCGAGCCGCA GGACGACGGG GTCAAGGTCA CCTTCGAGGT GTCCGGTCGC ACGCTTCCGG CATCCGTCAA GGCAGTCACC CGCTCCACCT CGATCCTGGC CGACCGCACG CTCGAACTCG TCGGGAACTA CGCATCCGGC CCCACGCTCG GACCGGAGAC GTGCATCGGC CGCGCCAACA CGGCCACCCC GAAGTCGATC TCGGAGACCA CGGAATCGGC TACCCGGCTG GTGGATTCGG TCACCGAGGG CGGCGCGTCC GCGGATCTCG GCAAGCTGCT GACCACGCTC GATACCCAGG TCGGCCCCAG CGTCCCGCCG CAGGCCGCGC GGAGCCTCAC CAACCTGTCC ACGTTGCTCA GTGATCCCGC AGGCTTCCTC GGCGACGTGA CCACGGTGGT CGGCAACGTC CGGCCGCTGA TGCAGAGCTT CGACGGCCAG TGGGGCGAGA TGCTGCTCAC CCTGCAGCAC GCGGGCAACG TGATGGAGCA GTACGGCCGT GTCACCTTCC CCGCGGTATC GGAAATCTTC CAGACGCTAC CGGTCTTCTT CCTCGTGGGC GACGACATGA TGCGTCGGTA CGGCGACATC CTGCGCCCGG GCGGCACCGT CGCCGCCGAT GCGGTGAAGC TCGCCGCCAC CTCGGTCCGC TCGAACGAGG CGCTCGCGAA GATGCTCCCG GTGTTCGGCG CGCAGATCGC GCCCTACCTG CCGCGCGGCG ATGGCGCCGA GACCACCGTC GCGGGCGTGC CGGTCACCAC GGTGGCCACA CGGGATCCGG AGGCGCTGTG CGCCCGGGTC AATGCCGAGG TGCCCGGTGC GTGCGCGGTG ACCGGCGGCT CGGCCCAGGT GGCCACCGTG AACCTGCTCC AACTCCCTCT CGGGGGAGGC CGATGA
|
Protein sequence | MLPERDGELL MRPGRIGLLT VVTMLALTAV PFTAGTAQAA SSSTLCAYLA DGIGLYPDND VTQMGVKIGS VRTVEPQDDG VKVTFEVSGR TLPASVKAVT RSTSILADRT LELVGNYASG PTLGPETCIG RANTATPKSI SETTESATRL VDSVTEGGAS ADLGKLLTTL DTQVGPSVPP QAARSLTNLS TLLSDPAGFL GDVTTVVGNV RPLMQSFDGQ WGEMLLTLQH AGNVMEQYGR VTFPAVSEIF QTLPVFFLVG DDMMRRYGDI LRPGGTVAAD AVKLAATSVR SNEALAKMLP VFGAQIAPYL PRGDGAETTV AGVPVTTVAT RDPEALCARV NAEVPGACAV TGGSAQVATV NLLQLPLGGG R
|
| |