Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2080 |
Symbol | |
ID | 9156235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2170709 |
End bp | 2171692 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | von Willebrand factor type A |
Protein accession | YP_003647031 |
Protein GI | 296139788 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.829092 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGGAT TGACCTCGCT CGCGAACCCG ATCTGGCTGC TCGGCATCCT GCTCGTCGCC GCACTGCTCG CCGGTTACGT CTACAACGAA CGCCGGAGAC AGAAGCGGAC ACTGAAGTTC GCGAACACGT CGGTGCTCGA CTCGGTGGCG CCGCCCGGCA CGAACCGGTG GAAGCACGTC CCGATCGCAC TGCTGGCGAT CGGCCTGGTG CTGCTGATGG TCGCGCTGTC GGGGCCGCAG GCTATGCGGA AGGTGCCGCG TAATCGCGCC ACCGTCGTGC TCGCGATCGA CGTGTCGTTG TCGATGGAGG CCCGCGATGT GGAGCCCGAC CGCCTCACCG CGGCCAAGGA GGCGGCCAAG AAGTTCGTGA CCGAGCTGCC CAACGGTGTG AACCTCGGCA TCGTCTCGTT CGCGGGCACG GCTTCTCTCC TGGTCTCGCC GACCCCCGAC CGGACGCTCG CGCTCAACGC CGTCGACAAG CTCGAACTGG CGCAGCGCAC CGCGACCGGC GAGGGGATCT ACACCTCGAT CCAGTCGATC AAGAACATCC GGGACGTGCT GGGAGGCGAG GACAACGCCC CACCCGCGCG GATCATCCTG GAATCGGACG GTAAGCAGAC CGTGCCCACC GACCTCGACG ACCCGCGGGG CGGATTCACC GCGGCGCGCA AGGCGAAGGA GGAGGGGATA CCCATCTCCA CCATCTCGTT CGGTACCACC AGCGGCAGCG TGAACATCGG CGGTCAGAAC ATCCCGGTCC CGGTGGACGA CGCCTCACTG AAGCGGATCG CCGAATTGTC GGGCGGCCAG TTCTTCGCCG CCTCGTCGCT CAACGACCTC AATGAGGCGT ACGGCAGCCT GCGCGACGAG ATCGGCTGGG AGATGCAGAA GGGCGACAAC TCGCGGATCT GGATGCTGTG GGGAACCCTG ATCATCCTCG TGGGCGCGGC CGCCGCGGTG GGCATGAACC GACGCCTGCC GTGA
|
Protein sequence | MGGLTSLANP IWLLGILLVA ALLAGYVYNE RRRQKRTLKF ANTSVLDSVA PPGTNRWKHV PIALLAIGLV LLMVALSGPQ AMRKVPRNRA TVVLAIDVSL SMEARDVEPD RLTAAKEAAK KFVTELPNGV NLGIVSFAGT ASLLVSPTPD RTLALNAVDK LELAQRTATG EGIYTSIQSI KNIRDVLGGE DNAPPARIIL ESDGKQTVPT DLDDPRGGFT AARKAKEEGI PISTISFGTT SGSVNIGGQN IPVPVDDASL KRIAELSGGQ FFAASSLNDL NEAYGSLRDE IGWEMQKGDN SRIWMLWGTL IILVGAAAAV GMNRRLP
|
| |