Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3795 |
Symbol | |
ID | 9157975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3916228 |
End bp | 3917244 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003648712 |
Protein GI | 296141469 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCGTA CGGTGCGCTG GCTACCTCGT CCCGCGTCGG CGGCGCTGGC CGTGATCGGC GCGGTCACGC TTGCGGGGTG CACGGTGAGC CCGGCCAACA TGGATCTACG CAATCCGCTC GCCGACGGCG ACCAGTACGA ACTCACGATG GAGTTCGCCG ATGTGCTCAA CCTGCCCGTC GGCGCGCGTC TCGCGCTCGG CGGGCTCACA GTGGGCCGCG TCAAAGCGGT GTCACTGTCG GAGACCGCGG CCACTGTCAC GGCGCGCGTG GACAAGACCG TCACCCTCCC CACCGACATC CATGCCTCGG TGCTGCAGGA CACCCTGCTC GGCGAGGCGT ACGTCCGGCT GGAGGCGCCC GAACAGCCGT CACCGTCGTC CACCTCGCTG CGGGCGGGCG CGACGCTGCC GCTCGCGCAG ACCAGTCCGC CGCGGTCGGT GGAGAGCACG CTCACCATCT TGGCCGATTA CTTCGGTAGC GGCTCCGTCC AGGACATCAG CCGCACCATC ACCAAGCTCA ACACCTCGCT CCCCAAGGAG CGCCCACAGT TCGATCAGCT CTCGACACAG CTCAGCCGCG ATGTCACAGG GATCGGTGCG TCGACCGCCG AGGTGAACCG CCTGCTCGAT TCGGTCTCCG ATATGGCGGA CCGAACGGCC AAGCAGGAAG CCAACTTCAC ATACACGCTC AACCCGTACA ACATGAAGTA CTGGAAGAAC CAGGGCAAGC TGATGTCGAA CATCGGTGTG CTGCTCCCCG CCGTCGGCGG CATGCTGCAG CAGGGCTACT GGCTGATCCC CCTGATGAAC TCCTCATCGG ACCTGTTCGA GTTGCTCACC GCCGATATGG TCTCGCTGGG CAGCGCGATC CACGGCGGTT CGATCCTGGT GAACAAGAAC CTCGCCCCAT TCCTCGCCAA GCCCACGGTG AAGGTGGAGT CAGTGACCGG GCCCGGCGGC CAGGACCTCT CGCCCGCTGC GGTGAAGCTG CTGCGGATGA TCGGACAGGC ACGATGA
|
Protein sequence | MTRTVRWLPR PASAALAVIG AVTLAGCTVS PANMDLRNPL ADGDQYELTM EFADVLNLPV GARLALGGLT VGRVKAVSLS ETAATVTARV DKTVTLPTDI HASVLQDTLL GEAYVRLEAP EQPSPSSTSL RAGATLPLAQ TSPPRSVEST LTILADYFGS GSVQDISRTI TKLNTSLPKE RPQFDQLSTQ LSRDVTGIGA STAEVNRLLD SVSDMADRTA KQEANFTYTL NPYNMKYWKN QGKLMSNIGV LLPAVGGMLQ QGYWLIPLMN SSSDLFELLT ADMVSLGSAI HGGSILVNKN LAPFLAKPTV KVESVTGPGG QDLSPAAVKL LRMIGQAR
|
| |