Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3975 |
Symbol | |
ID | 5705252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4515671 |
End bp | 4517422 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273400 |
Product | von Willebrand factor type A |
Protein accession | YP_001538756 |
Protein GI | 159039503 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.734666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0036434 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCTCCAG GCCGCCATCG CATCCGAACG AACGTCCGCG CTGCCGGAGC CGCGGCAGCG GCCGGCGTGC TCGCCGTCGC TGCTGGTGGC TACTTCGGCT ACCGCCAGCT CGCCTCACCG GGCTGCTCAG GCCAGATCGA GTTGGCTGTC GCGGTCGCGT CCGAGCTGGC GCCGGCGGTC GACACCACGG CGACCGAGTG GGAGAACGAG GGCGCAGTGG TCGGCGGCAC CTGCATCGAG GTCAACGTCA CGGCCTCCGA CCCGGTCGAG GTAGCCGCCA CCGTCGCGGC CAAGCATGGT GCCGTCCTGG CCGGGGTGGG CCAGGCCAGC GGCACCGCGA TCAGCCCGGA CGTCTGGGTG CCCGACTCGT CCGCGTGGCT GCTGCGGCTC AAGACCGGGG GCGCGACCGC GTTCGACCCG GGTAACAGGG CGTCCATCGC CTACAGCCCG GTGGTCGTGG GGGTGCCGGA GCCGATCGCC ACCCAGCTTG GCTGGCCAGA GAGCAAGCTC ACCTGGTCAG GGCTGGTCGG CCAGGTCAAC AACTCCAAGC CGATCAAGGC CGGCACCGTG AATCCGACCC GGGATGCCGC CGGTCTCTCC GGGCTTCTCG CGCTGAGCGC TGCCGCCGGG GCCGGGGAGA ACGGCCAGGC AGCCACCGTC GGCGCGTTGC GTGCGCTGTC GACCAACAGT GCGAATCTGC GTCAGGAACT GCTCGCGAAG TTCCCCACCT CCCCGGATTC CACATCGGTG GCCCGTGGTC TCGGCGCGGC GGCGTTGTCC GAGGAGGATG TGCTCTCGTA CAACGCCAGG AAGCCGGCGG TGCCGTTGGT GCCGCTCTAC CTGGAGCCGG CGGCGATGCC GTTGGACTAT CCGTACGCGG TGCTGCCCGG GATCGAGCCA GCCAAGGCGT CCGCGGCGCA GATGCTGTTT GAGGTGCTCG CCACAGCCAG TTTCAAGGAC CGGTTGGCGC CGCTGTCGCT GCGTGCGCCG GATGGCACCT GGGGCGCTGG TTTCGGCGCG CCCCAAGGGG CGCCGAGTCC GGAGGTCGGT GGGGCATCGA CGGAGCCCGG CAGTGGCGAC GCCGCGGGTG CCGTGGATCC GGTGGCGGTT GACCGGGCGG TCGCCAGCTG GTCGATTGCC ACCCAGTCTG GCCGGATGCT CTGTGTCATC GATGTCTCCG GCTCGATGAA GGGTTCGGTG GCGGGCGCCG GCGGTGCCAG CCGTCAGCAG GTCACCCTGG ATGCCGCGCG GCGAGGGCTC AGCCTGTTCG ACGACAGCTG GCAGATCGGA CTGTGGGAGT TCTCGACGAA TCTCGGCAGC GGACGGGACT ACCGGCGACT GGTCGAGATC GGCCCGTTGA GCAACCAGCG AAGCAGGCTT GAGCAGGCGT TGACCCAGAT CCAGCCCACT CGGGGTGACA CCGGCCTGTT TGACACGGTG CTCGCCGCCT ACGAGGCGGT TCAGGAGGAA TGGGATCCAG GCCAGGTCAA CTCCATCGTG CTCTTCACCG ACGGTAAGAA CGACGACGAC AACGGCATCA GCCAGCAGCA ACTGCTCGCT GAACTGGAGC GGATCAAGGA CGCGGAGCGG CCGGTGCAGG TGGTGCTGAT CGGGATCGGC GCGGATGTCA GCAAGGCAGA GTTGGAGTCG ATCACCAAGG TCACCGGTGG TGGTTCCTTC GTCACGGAGG ATCCAACCAA GATCGGGGAC ATCTTCCTCA AGGCCATCGC GCTGCGGAAG CCGGGTGCCT GA
|
Protein sequence | MSPGRHRIRT NVRAAGAAAA AGVLAVAAGG YFGYRQLASP GCSGQIELAV AVASELAPAV DTTATEWENE GAVVGGTCIE VNVTASDPVE VAATVAAKHG AVLAGVGQAS GTAISPDVWV PDSSAWLLRL KTGGATAFDP GNRASIAYSP VVVGVPEPIA TQLGWPESKL TWSGLVGQVN NSKPIKAGTV NPTRDAAGLS GLLALSAAAG AGENGQAATV GALRALSTNS ANLRQELLAK FPTSPDSTSV ARGLGAAALS EEDVLSYNAR KPAVPLVPLY LEPAAMPLDY PYAVLPGIEP AKASAAQMLF EVLATASFKD RLAPLSLRAP DGTWGAGFGA PQGAPSPEVG GASTEPGSGD AAGAVDPVAV DRAVASWSIA TQSGRMLCVI DVSGSMKGSV AGAGGASRQQ VTLDAARRGL SLFDDSWQIG LWEFSTNLGS GRDYRRLVEI GPLSNQRSRL EQALTQIQPT RGDTGLFDTV LAAYEAVQEE WDPGQVNSIV LFTDGKNDDD NGISQQQLLA ELERIKDAER PVQVVLIGIG ADVSKAELES ITKVTGGGSF VTEDPTKIGD IFLKAIALRK PGA
|
| |