Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2216 |
Symbol | |
ID | 5703897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2551758 |
End bp | 2553041 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641271696 |
Product | von Willebrand factor type A |
Protein accession | YP_001537067 |
Protein GI | 159037814 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.359753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.122606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAGGA CGAGACGATC GGCGGCAGTC CTCGTCGGAC TGCTGGCGAT GAGCGTGATG ACGGGCCCGG CACCGGCCCT CGCGGACGGC GAGGCCCCGG TCGAACCGCC GAAGGTCGAG TTGGTCCTCG ACGTCAGCGG TTCGATGCGG GCCACGGACA TCGACGGGCG AAGCCGGATC TCGGTGGCCC AGCAGGCGTT CAACGAGGTG GTAGACGCGC TGCCCGACGA GACTCAGCTC GGCATCCGGG TCCTCGGTGC CACCTACCCG GGTGAGAACA AGGAGCGGGG CTGCCAGGAC ACCCAGCAGA TCGTACCGGT GGGGCCGGTC GACCGGGTGC AGGCCAAAGC CGCGGTCGCG ACCCTCCGTC CGACGGGATT CACCCCGGTC GGGCTGGCCC TGCGCTCGGC TGCCCAGGAT CTCGGCACCG GTAGCACCGC CCGGCGGATC GTGCTGATAA CCGACGGTGA GGACACCTGC GCCCCTCCCG ACCCCTGCGA GGTGGCCCGG GAACTGGCCG CGCAGGGCAC GAAACTGGTC GTGGACACCC TTGGCCTGGC GCCGGACGAG AAGGTACGCC GCCAGCTGCT CTGCATCGCC GCAGCCACCG GTGGCACGTA CACCGCGGCG CAGAGCGCGG ATGAACTGAC CGGCCGGATC AAGCAACTGG TCGACCGGGC ACGGGACACG CACACCGCCA CGCCGGCAGT GGTCGCCGGC ACCCCGGCCT GCGCCGACGC GCCGCTGCTT GGCGCCGGGG TCTACAGCGA CCGTGAGAAG TTCTCGGAGC ACCGTTGGTA CCGGGTACCG GTGCACCCGG GGCAGGAACT GCGCGCCTCG GTCAGCGTGG CGTTGGACCG GCCGGTCAAC CCCGACCATG CGGTGCTGCT GCGGGCCGTG GCCACCGACG GTCGGGAGTT GGTGCGTGGC GTGGATGCCG GCAGCGGCCG GACCGACGTG GTCTCCGCCG GCCTGCGGTG GTCGGCGAGT GAGGAGCCGG AGGACGGGCC GTCCCCGACC CCGTCGGCCA CGACCGGTGC CGAAGCCACC ATCGTCTGCC TCGTGGTGAG CAATGCCTTC GCGCCTCAGC CGGGAACCCA GACGTCACCC GGGCTGCCGG TCGAGCTGAC CGTGGACGTG GTCGCGTCCT CGCCTGCCCC GGCGGCTCCG GATCTGGGTC GTGGCTGGGT GCTGCTCGTC CTGCTGACCG TGGTCGGCCT GCTGGCCGGA CTGGCGTCCG GGATGCTCAG CCGGTGGTGG CTGGCGACCT GGAGGGAGAA GTGA
|
Protein sequence | MIRTRRSAAV LVGLLAMSVM TGPAPALADG EAPVEPPKVE LVLDVSGSMR ATDIDGRSRI SVAQQAFNEV VDALPDETQL GIRVLGATYP GENKERGCQD TQQIVPVGPV DRVQAKAAVA TLRPTGFTPV GLALRSAAQD LGTGSTARRI VLITDGEDTC APPDPCEVAR ELAAQGTKLV VDTLGLAPDE KVRRQLLCIA AATGGTYTAA QSADELTGRI KQLVDRARDT HTATPAVVAG TPACADAPLL GAGVYSDREK FSEHRWYRVP VHPGQELRAS VSVALDRPVN PDHAVLLRAV ATDGRELVRG VDAGSGRTDV VSAGLRWSAS EEPEDGPSPT PSATTGAEAT IVCLVVSNAF APQPGTQTSP GLPVELTVDV VASSPAPAAP DLGRGWVLLV LLTVVGLLAG LASGMLSRWW LATWREK
|
| |