Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3801 |
Symbol | |
ID | 5318099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 250914 |
End bp | 251918 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640775614 |
Product | von Willebrand factor type A |
Protein accession | YP_001312547 |
Protein GI | 150375951 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1240] Mg-chelatase subunit ChlD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.142641 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.822741 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGTCC TTGATCATCC CTGGCTCCTG CTGCTTCTGC CGGCACCCCT GTTCGTCTGG TGGCTGCTGC CGCCCTATCG GGAACAGACG CCGGCAGTGC GCATTCCCTT TTTCGAGGAC ATCACCCGGG CCGCTGGCAT CGGCCCGACG GAAGGCTCGG TCGTGCCGCG TGCCAACCTC CTGCAGAAGA TCATCGCGCC GATCTGCTGG CTCCTGGTGT TGACGGCGCT TGCCCGGCCG CAATTCGTAG AGCCGCCGAT CGAGAAGACC GAGCCCCAGC GCGATCTGAT GCTCGCGCTC GACCTCTCGC AATCGATGGA CACGCGCGAC TTCAGCGACC CGCAGGGCAA TCTTCAGGCG CGGGTAGATG CGGTGAAGAC CGTGGTGGCA GACTTTGTCG ATCGCCGTCC GTATGATCGC CTCGGTCTCG TCGCTTTCGG TGACGCCCCC TATCCGCTCG TTCCCTTCAC CATGGATCAT GCCACCGTCC GGTCCATGCT GACCGGCGCC TTACCGGGCA TGGCCGGCCC AAAAACGGCT CTCGGCGATG CGCTGGGGCT TTCGATAAAA CTGTTCCAGC AGAGCCAGGC TCCTGACAAG GTGCTGGTCG TTCTGACCGA CGGCAACGAC ACCGCCAGCA AGATGCCGCC GGACAAGGCC GCCGAGATTG CGAGCCAGAA CCACATCCGT ATTCATACGG TCGGCATCGG CAATCCCGAC GCCCAGGGAG AGGAAAAGCT CGATACCGAG ACGCTGCAAA AGATCGCCAC GGCTACCGGA GGACGCTATT TCTTCGGTCA GGACCAGCAA GCGCTCGCCG AGATATACAC GCTGCTCGAC AGCATCACAC CGGCGAACCA GAAAACGCTG AGCTGGCGCC CGCGCATCGA GCTGTTCCAC TACCCACTCG GCGCTGCCGT CCTCCTCGTA CTCGGCTATC ATGCCCTAAT GTGGCTTCTC TCGGTTAGTG CCGCCCGCAG GCGAAACAAC GAGGCTGAAG CATGA
|
Protein sequence | MYVLDHPWLL LLLPAPLFVW WLLPPYREQT PAVRIPFFED ITRAAGIGPT EGSVVPRANL LQKIIAPICW LLVLTALARP QFVEPPIEKT EPQRDLMLAL DLSQSMDTRD FSDPQGNLQA RVDAVKTVVA DFVDRRPYDR LGLVAFGDAP YPLVPFTMDH ATVRSMLTGA LPGMAGPKTA LGDALGLSIK LFQQSQAPDK VLVVLTDGND TASKMPPDKA AEIASQNHIR IHTVGIGNPD AQGEEKLDTE TLQKIATATG GRYFFGQDQQ ALAEIYTLLD SITPANQKTL SWRPRIELFH YPLGAAVLLV LGYHALMWLL SVSAARRRNN EAEA
|
| |