Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3110 |
Symbol | |
ID | 5706550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3534486 |
End bp | 3536072 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641272542 |
Product | fibronectin type III domain-containing protein |
Protein accession | YP_001537909 |
Protein GI | 159038656 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0314465 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCATCTC CCTCCCCGCT GGCCGCTGTC CAGCATCGGG CCCTCGCCCT GCGCCAGGCT GGTGACCTGG CCGGCGCGCG GCTGCTACTC ACCGACACCG TCGAGTCGGC CAGTCCACCG TACGGCCCGG ATCACCCTGA AATGCTCGGT GCTGCGCACC TGCTGGCGCG GCTGCACCGG GAGGCCGGTG ACCCGAGCTC CGCCCGGCGG GTGTTGGAGG AGGCGCTGGC AGCCGGCGAG CGCTGCCGGC CGGACGACGA TCCCCTGATG CTGGCGCTCG CCTTCGAGCT GGCGACACTG GCGGATGAAC TCGGTAACCG ACACGAGGCC CGCCGCAACT TCCGCCGGCT CGTCGCCGCC GGTCCGGACG TGCTTGGGAC CGACCATCCG GCGGTCCGCG AGGCACGGGC ATACCTCGAC GATGCCGGTC CCAGCCCGGC CGATGGCCCG TCCGTGCCGG GGCCCGTGCT CCCGGGATAC CGGGCTTTCG GGTCGGTCAC GGCACCTGTC GCGGACGAAC GGCTGGCGGA AGAGCCGGTC CGACCGGGTA CGTCACCAGC TCCGGATGGT TCCCCCGACG ACAGCGGCCA CCTGGTGCCC CACTGGCCGC CGGTCGGGTC GGCCTCCGGC AGCAGTGAAG CCCGCCAACA GGGCTGGCCG CAGGGTCTGG GATCTCCGCT CGGCTCGATT GGCGCTGGCC CCGCACCCGG GGCCTCGGGC CCGGACCCGG TGCTTGAACC GGGCCCGGCT CGGTGCGGCG ACGGGCGGGT CGCAGCCGAC GCCGCGCCGT CGGATGCCGC GCCGTTGACG GACGCCCACA CTGACCCGGC GTCGCGGCCT AGTCCGACCG GCCGGGGCAC AGCGACCAAC GGGCTGCCTG AGGGCGCTCC GCCACATCTC GGCGAGCCCG CCCCCCGACC CGGGTCCGCC CCCCGACCCG GGTCCGCCCC CCGACCCGGG TCCGTCCGCC GACCTGGGCC TGGTCCCCGC CCAGGGCCCC GCCCGCGACC CGGACCTCCC CCGCGGCTCG ATCCGCACGC CACGTCCGCT CCCCACCCGG AGTACGACCC GACGCACGGG TATGCGGGCG GACCGTCGAT CGTCCGTCCT GCCAGCGAGA CGTCGTCCGC GTACCCGGGG CGGAGGCCGG AGCGACACGG CAGGAACCGA ACGGCGGTGG TCGCGCTGGT CGCCGCTACG GTGGTGGCCG CGGCGGCGGT CGCCGGAGCG GGAGCGGTGG TACTCCTCCG GAACGCGCCG ACCGCTCCGC TGGTCTCGTC CATGCCGGAA ACGTCGACGG CCGGCCCGCC GAGCGAGGCG CCGCCGACGG ACCTCAGGTT GCGGGACGAA TCGACCTCGA TCACTCTCAC CTGGACTGAC CCGTCCGGCG GTACCGTTCC GTTCGTGGTG GCCGCCGGTC GGGCAGGGCA ACAACTGTCG CCGCAGGACA GTGTGGACCC GGGGCGGACC AGCTACACGA TCAACGGGTT GAGTTCCCGA CTGGACTACT GTTTCACCGT GCTGGCGGTG TATTCGACGG ACTCGTTCGC CACGTCCGGG CAGGTGTGCA CCGATCGCGA GGGATAG
|
Protein sequence | MSSPSPLAAV QHRALALRQA GDLAGARLLL TDTVESASPP YGPDHPEMLG AAHLLARLHR EAGDPSSARR VLEEALAAGE RCRPDDDPLM LALAFELATL ADELGNRHEA RRNFRRLVAA GPDVLGTDHP AVREARAYLD DAGPSPADGP SVPGPVLPGY RAFGSVTAPV ADERLAEEPV RPGTSPAPDG SPDDSGHLVP HWPPVGSASG SSEARQQGWP QGLGSPLGSI GAGPAPGASG PDPVLEPGPA RCGDGRVAAD AAPSDAAPLT DAHTDPASRP SPTGRGTATN GLPEGAPPHL GEPAPRPGSA PRPGSAPRPG SVRRPGPGPR PGPRPRPGPP PRLDPHATSA PHPEYDPTHG YAGGPSIVRP ASETSSAYPG RRPERHGRNR TAVVALVAAT VVAAAAVAGA GAVVLLRNAP TAPLVSSMPE TSTAGPPSEA PPTDLRLRDE STSITLTWTD PSGGTVPFVV AAGRAGQQLS PQDSVDPGRT SYTINGLSSR LDYCFTVLAV YSTDSFATSG QVCTDREG
|
| |