Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0933 |
Symbol | |
ID | 5708044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1044008 |
End bp | 1045780 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641270451 |
Product | hypothetical protein |
Protein accession | YP_001535839 |
Protein GI | 159036586 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCACCGT CGACCGCAGT AGCGACTCGA AAACGGGAGT GGGGCATGGT GAAGGCGAAG GAACTCACTG GCGACGAGAG TTCCTGGAAC TGGAAGCAAA TCAAGGCGGC CGTCAACGGT GGATCCGAGA TTCCCGCCGG TGACAGCAGC GCCCACGAGG AAGCCCGCGG GGTGAGCAAC CCGGAAAGCT TTTTCCGGTT GGCGCAGAGC TTTGCTCGGG TCCAGGCGAC ACTGGCGCTG TGTCGCGACG TCACCCAACT CCATTCTCAC GCCATCGCCG GCGACGACCG TCCCTGGCAG GGTGGCGCAG CAGATGCGTT TAACCGTTCT ATGAAGTGGA CCGTGGGTGT GCTCGACTCG CACATCGATC AGATCACCGC GGCGGCCGAC AAGGATCGGC CGGAGCAGGG TTCGATCATT GAGAGTCTGG TGGCTGCGGG CAACCGACTC GCCTACTCCC GCGCCGTGCT CGACGCCATC GACAGCCACT ATGCCAGCGA GGCGAGGCGG CTGGGTGTCG AGCCCATGGA CAACGGCCTG ATTCCAGTTT CCAAGCGCCC TGACATCGTT GCGATGATGG ACCGTGACAT GCGCGCGGAG TTCGAAAGGC TGAACGAGCA ATACTCGTTG ACCGTCAACG ACCTGCGCCC GCCGCCTCCG GAGGAGCCGC CAACTACCGA TCCTTTGGCC CCGCCAGGGG AGGGGCCGGT CACTGACCCA CCGCCACTCC CGGATCGGGA TCCGTCGACT CGGCCCGACA TGCCCGCGGA CTGGATCCCG AACGGTGCAA GCCCGGACTC CTTTCCGACT ACCGAGCCGC CAGGAGGGCT GTCCGATTCA TCGTCGATCG GATCCCCTGA GCCTGTCAGT GGCACCGGCC TTGAGGGGCC GTTTGTTCGA CCGTTCGACA GCCCCAATGG CGCTGATCCA CTGCTCGACG AGCCCGCCAC ACCCTTTTCC GAACCCACCC TCGGCCCTGG TGACACTCCC GCAATGGACT TCACGCCGGC GACGACCTTG GCCGGCGCTG ACGCCCTCAC CAGTCCCACC GCGTTTCCCG GCGGCACCGT TTCGCCAGCG TCATCGACTG GCATCGGGTC CTTCCCGGGT GGTGCTGGCT CACCGGCGGC GGCACCCATG AGCCACAGCG CCCCGCTGCC CGCCGTTCCC CTGTCCACTC CCGCTTTCAA CGCCAGAGAA AGCCCAGCCG CTCGAGCTCC CAAGCTTGGT GGTGTTGGTG GTGGTGCTCC TGTGCCGTTT GTGCCTGGTG GTGCGCCGGG TTCCGGGTCG GGTGGGGGTG CTGGTTCCCG CAGGCCGAGG GTCAGTGGTG TTGGTGGTGG TGCTCCTGTG CCGTTTGTGC CTGGTGGTGC GCCGGGTTCC GGGTCGGGTG GGGGTGCTGG TTCCCGCAGG CCGAGGGTCA GTGGTGTTGG TGGTGGTGCT CCTACGCCCT TCGTCCCGGG CGGCGGTCCG GCCTCCGGCG GTAACGGTGG CAGCTCGCTC CGACGGACCC GACCTTTCGT TGGCGGACCT GGAGGCATCC GAAGCAGTTC TACGGGTTCG GCGCCCGCAC CGGGTGCCGG CAAGGTCGAG GGCACCGACA GGCTGCGGCC AGGCAGCGGG GGTACGGGAG TTCCCTTCGC AGGTGCCCCC GCCACGACCG CGACAGGAAA GGAGGCCTCG CGGGAGCGCA AGACGTGGCT TGTCGAAGAG GACGACGTGT GGGGCGCCGA CCCTGACTCC CCTGCTGGCC CGATCGGGCG CCCAGGCGCA TGA
|
Protein sequence | MAPSTAVATR KREWGMVKAK ELTGDESSWN WKQIKAAVNG GSEIPAGDSS AHEEARGVSN PESFFRLAQS FARVQATLAL CRDVTQLHSH AIAGDDRPWQ GGAADAFNRS MKWTVGVLDS HIDQITAAAD KDRPEQGSII ESLVAAGNRL AYSRAVLDAI DSHYASEARR LGVEPMDNGL IPVSKRPDIV AMMDRDMRAE FERLNEQYSL TVNDLRPPPP EEPPTTDPLA PPGEGPVTDP PPLPDRDPST RPDMPADWIP NGASPDSFPT TEPPGGLSDS SSIGSPEPVS GTGLEGPFVR PFDSPNGADP LLDEPATPFS EPTLGPGDTP AMDFTPATTL AGADALTSPT AFPGGTVSPA SSTGIGSFPG GAGSPAAAPM SHSAPLPAVP LSTPAFNARE SPAARAPKLG GVGGGAPVPF VPGGAPGSGS GGGAGSRRPR VSGVGGGAPV PFVPGGAPGS GSGGGAGSRR PRVSGVGGGA PTPFVPGGGP ASGGNGGSSL RRTRPFVGGP GGIRSSSTGS APAPGAGKVE GTDRLRPGSG GTGVPFAGAP ATTATGKEAS RERKTWLVEE DDVWGADPDS PAGPIGRPGA
|
| |