Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3792 |
Symbol | |
ID | 9157972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3913156 |
End bp | 3914154 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003648709 |
Protein GI | 296141466 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.170276 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGCCA AAAACACTCG CGCCGCGATC TGGCTCGCGG TCGCCCTCGT GATCACCCTT CTGGTCGCGG CGGGCATCTA CGCCGCCCTC CGCAACCCCG TCGACGGCGA CACCCGCGGC TACGAGGCGC ACCTCTCGGA CGCCTCCGGG CTCCGGGTGG GCAGCGATGT GCGCCTGCGC GGCGTGCTCA TCGGCAAGGT CACGTCGGTG GACGTGCGCG GCACGGACAC AGGTAACGAG GCGGTGGTCG GATTCACCGC GCAGAACCAG CATCCGGTCA CGTCCACGAC GCAGGTCGCG GTCAAGTACC GCAACCTCAC CGGTGAGCGG TTCATCGAGG TGCAGCCCGG CACCTCGGGC GGCGGCATCC CCACGACCTC GATCCCCATG GCGCGGACCA CGCCGTCGTT CGACATCACC ACGCTGTTCA ACGGGCTCGC GCCGGTGCTC CGCACGCTCA GTCCCGACGA TGTGAACGAA CTGACGCGCA ACCTGCTGGG ACTGCTCCAG GGCGACGACA CCACGGCCGC CGAGACTTTC AGCGCCATCG ACAAGATCAC CGCGAACCTC GCCGACCGGC AGACGGTGAT CAAGACGCTG ATCGACAACG TGACCCGCGT GGCGAACATG ATCAACGACT ACTCGCCCCA GGTGGTCGAA TTCGTCACCA ACTTCGACCT GCTGTTGACC AAGGTGCTGG AGAATCTGGA CGAGTTCCGC CGCACCGCCA CCTACGGGCC CGGCTTCGGC GCCGCCACCA ACCGCACGCT CACGGCACTC GGCCTGTCCA AGGAGCTCGA CGTGGAACGG CTGTTCACCA CCGCCTTCCG AGATCCGCAG GCGGCGGTGC AAGCGATGAG CAAGCTCCCC GGCCTGTTCA CCGGCGTCGC CGCACTGCTC GGCACTCCCC CCACCTCGTG CTCCAAGGGC GAGGCGAAGC TTCCGAAGTC GACCACGGTG CTGCTCGGCG GCACGAATGT GACGGTGTGC CTGAAATGA
|
Protein sequence | MRAKNTRAAI WLAVALVITL LVAAGIYAAL RNPVDGDTRG YEAHLSDASG LRVGSDVRLR GVLIGKVTSV DVRGTDTGNE AVVGFTAQNQ HPVTSTTQVA VKYRNLTGER FIEVQPGTSG GGIPTTSIPM ARTTPSFDIT TLFNGLAPVL RTLSPDDVNE LTRNLLGLLQ GDDTTAAETF SAIDKITANL ADRQTVIKTL IDNVTRVANM INDYSPQVVE FVTNFDLLLT KVLENLDEFR RTATYGPGFG AATNRTLTAL GLSKELDVER LFTTAFRDPQ AAVQAMSKLP GLFTGVAALL GTPPTSCSKG EAKLPKSTTV LLGGTNVTVC LK
|
| |