Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4174 |
Symbol | |
ID | 9158362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4296837 |
End bp | 4297940 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003649082 |
Protein GI | 296141839 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAATCCG TGACGACCGG CGATCCCAAC GATCCCAACG ACCCGGGCAA CCAGGACCTC GCCCGGCGCG CCGTCGCCGG GGCACTGCGG CGCCGGCCGC CGCGCCGCGT GTGGCGCCGC ACCTGGTTCC GGTTCAGCTC CGATTCCGAC CGCCAATTGG GCTGGGGCAT CGCCTCGGCG ATGCTCGCGG CGGTGATCAT CATCGCGGCC GGGTACCTCT CGCTGGCCCC TCCGGGCAGC ACCTCGTACT CGGTCGATAT GGCCGAGACC GGCCAGTTGC GCAAGGGCGA CGATGTCCGC GTGGCGGGTG TCCCCGTGGG ATCGGTCAGC GGTGTAGCGC TGCGCCGCAA CGATGTCCGG GTCACCATCC GGGTGGATAA GAGCGCCTTC ATCGGCAATC AGTCCACCGC GGCGGTGAAG ATGCTGACGG CGGTCGGAGG CTACTACCTC GATATCGACT CCCTGGGAGC CACATCCCTC GGCGACGGGT CGATTCCGGC GAACCGGGTG CGCCTGCCCT ACACCCTGAC CGAGACCTTC CAGACCGCCG GTCCCAAACT CGGTGCGGTC GACGGTCAGC CGCTGCGCGA ATCCCTGGTT CAGATCCAGC AGGCGACCTC CGGGCAGCCG GGGCAACTGC GGCACGCGAT CACCACGCTG TCCGGAATGG TCGATGCCCT GGGCCGGCAG AAGGACCAGA TCGGCAGGTT CATCACGGTG GTCTCCGAAT ACACCACGGC GGTGAACGAG AACGGTGACC GGCTCACGGC GATCATGCAG GATATGAGCC TGTTCCTCTC GACCGCATCG CTCAACGTGG CCGGATACAA GGCGTTCATG CGGGCGCTCG AGCTGACGCT GCTGCGCATC AAACCCCTCG CCGATCTCTA CCTGCGCGAT ATCGACGGCT TCGAGAGGCA GCTCCGGGTG ATCTCCGGGC AGATGCAGGA ACTGCTCCAG AAATTCGAGC CGATGATCGA CGAGGGTAAG AAGGCCCTGG CCCGGGTCAA CGAGGCCATT CAGCCCGACG GCAGCGTCAA GATCGGTGGG AAGACGGTGC TCTCGTCCGC GTTCTGCATT CCGATTCAGG GGGTGAATTG CTGA
|
Protein sequence | MQSVTTGDPN DPNDPGNQDL ARRAVAGALR RRPPRRVWRR TWFRFSSDSD RQLGWGIASA MLAAVIIIAA GYLSLAPPGS TSYSVDMAET GQLRKGDDVR VAGVPVGSVS GVALRRNDVR VTIRVDKSAF IGNQSTAAVK MLTAVGGYYL DIDSLGATSL GDGSIPANRV RLPYTLTETF QTAGPKLGAV DGQPLRESLV QIQQATSGQP GQLRHAITTL SGMVDALGRQ KDQIGRFITV VSEYTTAVNE NGDRLTAIMQ DMSLFLSTAS LNVAGYKAFM RALELTLLRI KPLADLYLRD IDGFERQLRV ISGQMQELLQ KFEPMIDEGK KALARVNEAI QPDGSVKIGG KTVLSSAFCI PIQGVNC
|
| |