Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3796 |
Symbol | |
ID | 9157976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3917241 |
End bp | 3918314 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003648713 |
Protein GI | 296141470 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCGCT CGGGTATCGC CTCCGCACTG GGACTGGTGG CACTGGTCGT GGTATCGGTG CTGCTGCTGG GACAGCAAGG TGTGCAATGG CTCTTGCCGT CCGATCAGCG CACCGCATAC ATCCAGCTGA CCGACGCCAA CGGGCTGATG CAGGGGTCTC GCGTCCTCGA CCGCGGTGTG GAGGTGGGCA CTGTGCGGGC GCTGCGAGTC GACGCCGAGC GGGTCGAGGT CGAGGTCGGT TACGACCGGG ACACGCGGAT CAGGTCCGGG GCGAAGGTCA GGGTGCAGAA TCTATCGGGC CTGGGCGAGA CGTATCTCAC GCTGTCCCAG GGCGACGGTG CGCCGTTGCC CGACGGTGCT CGGCTGACCG GGGTCGTCGA CACTTCCGAC TCCACCATCG GCGCGCTGTC GAGCTCGCTG TCACGGCTGA TGAGCCAGCT CGATCCGGGC ACGGTGCAGC GGTTGCTCAC GCGCGCCGAT GCCGCCCTGC CCGCGCAACA GCAGACGGTG CCCACGATCG ATGCCGGCGC GGTACTGACC GCAACGTCGA TCCTGAGGCA ATTGCCCGAT ATCGGCGACA TCCTCAAGTC CTTCAACTCG ATCACCCCGC GCGCGGAGCA GATCGGCCCA CTGCTGCTCT CGCTCGACGC ACCGCTGCGC ACGTTCGGCA AAGGCTACGG CGAGATGGCG CCGTTCGCGG TGTCGTACAT CATGGAGGGC GACTACCCGA ACGTCATCAA GAACGGAATG CTGGAGCTGT TCAAGGTGAT CCAGAAGTTC GAGGACGATG CGGCGCCCAG CGTGAAGTAC CTCTCGGAGC TCGTCCTACC GTCGGCGCGG GCGATCGCCG AACCACTGTC CACGATCAAC GCCGGCGCGC TCCTCGACAC CGCACTGGGG TCAGTGCGCG GCCGTGACGG CGTCACGCTG CGCCTGGCTC CGCGGGACGC GGCACCCGGA CCGGCCCCGT CCGCGACGCC CGTGCCGTCT GGGAATACCT CCGCGGCATC GTCGTCCGGT CCGCGGCCGG CGGAACGGAC CGCCACCTCG TCCTCGCCCA CCAAACCTCG CTGA
|
Protein sequence | MTRSGIASAL GLVALVVVSV LLLGQQGVQW LLPSDQRTAY IQLTDANGLM QGSRVLDRGV EVGTVRALRV DAERVEVEVG YDRDTRIRSG AKVRVQNLSG LGETYLTLSQ GDGAPLPDGA RLTGVVDTSD STIGALSSSL SRLMSQLDPG TVQRLLTRAD AALPAQQQTV PTIDAGAVLT ATSILRQLPD IGDILKSFNS ITPRAEQIGP LLLSLDAPLR TFGKGYGEMA PFAVSYIMEG DYPNVIKNGM LELFKVIQKF EDDAAPSVKY LSELVLPSAR AIAEPLSTIN AGALLDTALG SVRGRDGVTL RLAPRDAAPG PAPSATPVPS GNTSAASSSG PRPAERTATS SSPTKPR
|
| |