Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3791 |
Symbol | |
ID | 9157971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3912117 |
End bp | 3913151 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003648708 |
Protein GI | 296141465 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.929447 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACGAAC TGCTCGGGGG TACCCCCCAG CGTCAACAAC GGATCTTTCT CACGATCGGC GCCGCCGCCG TCGCGGTGGT CGTGATCGGG GTTCTCGTGG CCGTCCTGTC GTCCCGCCTG CCGGACTCGG ACGACCGGCT CGCGCTGACC GTCGAATCCG CGGACGTCGG ACCGGGGGTG AAGGCGGGAA CCCGGGTCAC ACTGCGGGGC GTCCCGGTCG GCGAGGTGAA ATCCGTCGAG ATCCCCCGGC CCGGCCAGGC TCGCGTCGCG ATCCGGCTCG AACAGGGCGG GATCCGCGGT CTCACCGACG CACTCGCGGT CCGGTACCAG CCCGGCAACT ACTTCGGCGT CACCGAGATG GCACTGCTCC CCCGCACCGG CGGCGCCGCG TTGCGCTCCG GTGGGGTGCT GCGTCCCTCG TCCAGCAACG ACACCATCTC CGATCTGCTC ACGCGCAGTT CGGTTTCGGT GAGCGGCGTG ATGACGCCGA AATTGATCGA CGTCACCACG AAGGCCACCA AGTACACCCA GGCCATCACG CCTCTCCTCG AGGTGGCGTT CGCCGTCGAG CGGCAGAACG CGCAGTCGAC CACGGTGCCG ATCGGCACCA CCCTGCGCAC CGTGCGCAGC ACCGTCGACG GTGTGCCCGG CGTGATCGAG GCAGCGTTCG AGGGATGGTT CGTGCCTCAG CTGCGCGGCG GCGAGAAGGC CGGAACACTG GATTTCGTGG TGAATTCCTA CGATAAGGTC GACGCCACCA CCACCCTGCT CGGTACCGGC TTCTTCACCC CGCTGGCCGA CATGTTCGGT TCGCACGAGG CCGATTTCAA ACCCGGCACC GTTCTGCTCC AGGTGGTCAT GGACGCCACC ACCAAGGTGA TGCGCACGGT GAGCATCCCG CGACAGGTCG TGCCCGTCAT CTCGGGTCTC AACTCGGCCT TCGTCACCGT CCCGGAGGGC ACCGCGCTGC GTATGAACAT CGTCGCCGAC CGCTTCCCGG CCGCCGCCTC CGTCCTCCCG AAGGGAGGTC GCTGA
|
Protein sequence | MYELLGGTPQ RQQRIFLTIG AAAVAVVVIG VLVAVLSSRL PDSDDRLALT VESADVGPGV KAGTRVTLRG VPVGEVKSVE IPRPGQARVA IRLEQGGIRG LTDALAVRYQ PGNYFGVTEM ALLPRTGGAA LRSGGVLRPS SSNDTISDLL TRSSVSVSGV MTPKLIDVTT KATKYTQAIT PLLEVAFAVE RQNAQSTTVP IGTTLRTVRS TVDGVPGVIE AAFEGWFVPQ LRGGEKAGTL DFVVNSYDKV DATTTLLGTG FFTPLADMFG SHEADFKPGT VLLQVVMDAT TKVMRTVSIP RQVVPVISGL NSAFVTVPEG TALRMNIVAD RFPAAASVLP KGGR
|
| |