Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4840 |
Symbol | |
ID | 5707745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5490262 |
End bp | 5492988 |
Gene Length | 2727 bp |
Protein Length | 908 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641274236 |
Product | hypothetical protein |
Protein accession | YP_001539581 |
Protein GI | 159040328 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.191443 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00180703 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCATCCG TGATCCGAGA CCTGCTGTCC GGCAGGAAGG ACGGCCCTGG TGACATCGTG CAGCACACCG TGGGTAACGC GTTGGTCCTG CACGCCGAGG AGACGATCAG CGCCGAAGCG CAGTCGCTGG CCCTGTCGGT CGTCAAGGAC GCTGACAACG ACGTGGTCGT CCTCGATCTC AGCGACGGCA TACCGATCAG CTCCTGGGAG TCCATGGCCG GCGTACTACC GCGTCGTCGA CGCGGTATCC GGCTCATGGC GTGCGGTCCG CACGCGAACA CCGCGGCGAT GGCCGGACAG TGGCTGTCGG AGCGGTTGCA TCGCACGGTC ATCGCCCCCG ACGGTGACCT GGTCCGCGGC TCGGCCGGCG GCTTGTTCGT GCACTCGATA CCGGGCAGCG GCTGGGTCCG GTTCCGGCCC GGCAGGCCAC CGACCTGGGA TGCCAAGCGC TACCCGACTC CGCTGTGGGA TCGGGCCGCC ACCGACAACC GGGCGTCCAG CTCGACCGGG GAGATCGAGC CACTGCCCGG TGGGGTGTGG ATCCGCGACG TGCGCGAGCC GGACGTCGTC GCGGAGCACC GCCGACGGCT CATAGCCGAC GTGCCGTGCC ATCCCGAGAC GATGACCGTG TTGCTGGGCT GCCCTGGCAC GGCCCCACTC TGCCTCGATG ACGTGGTCCG CTTCTGGCGG AACCTGGACG AGGACAGCCG AGCCCGCACC CGTTTCGTCC AGTACGGTGA CCTGCGCCTA CCGAAGGGTG AGACCTTCGG TCAGGCCCTC GCCGACCTGC TCGGCACCAC GATCGTCTGC TACACCGGGG TTCCGGTCGG GGGGCCGCGA CGATTCGAGG TCCGTACCGT ACGTTCCGAC GGCGCGCTCG GGTGGCCGCC GTTCGCGCTG GAGCTGCGAT ACGCGCCTCG CGCGCACCCC AACTCGAAGG CACACCGGCC CGTCGTGCTG AGTCACCGAC CGCCCTTGGC TGAGGCCGAG GAGGTCGCGC CGCGGGTCTA CTGGTACGCC CCGGACGCGG TGATCGAGGT GATCCAGTCC GGGCTGTGGG TCCGAAGCGC CGAGGAGCCG GTGAACGCGG AACGGGTCCG CGGTATGCCA CTCGACCCCG AGGGCAGTGT GCTCATCTAT GACGACACCA TCGGCGAGTC GGCTGAGCGA ATGTGGGCAC TCGCCGAGGA TCTCGCGGCC CGGATCGACC CGAGCGTGGG CGCGGACAGC TCACTGTTCT CGGCATCCGC ACTGGTCCCA GGCCAGTCCG CGGCGGGGCT CGCCCAGGCG TCCCTCGGGG AGGATTCGCC GCGTCCGACG GTTACCGGGC AGCTGTCCAC GTCAAGGCCG CCGGCCCCGG AGGACCTGCC CGCCCCCACC GTGGACACGA TCGTGATCAG GCCCGTGGTC GAGTCGGTCG CGGAGGCCGT TCCCCCGGTG GCCACCGCAA CGGCCGCGAC GGTGGTGCGT CCAGACGTCA CGCCGGAGAC GCGGCCGGCC GGGGAACCGG CACCACTGCA GGCCGAGGCA CCCGCACCGC AACCGGCCGA CGAACCAACA CCACAGCCAG CCGAGGCACC CGCACCACAG CGGGCCGAGG AATCGGCACC ACAACCGGCC GATGAACCGG CGCCGCGGCA GGCTGAAGCA CCCGCACGGC CAGCCGGGAG CCCGACACCG CCACCGGTCC TGGATCCAAC GGCGCCGGTC GCGGAGTCCT TCGTCAAGAC CGTGCAGGAC CCGATCGCCA CCGTCGAACC GGTCGAGGGC ACCGTTGCCG AGCCGACCCG GCTGCACGCG GTCGCCGGCG ACGAAGCACC GGTCCAGAAA CAGCGACGGA TCCGGCCGGC CACAGCGTCC GCACCGGTGA CCGACACGGA CGCGCCGGAC GTGCGAATGC AGCCGGTGCC CCCGGCGTCG GCGTCGGCGC TGCTGCCGGG CCGCCCACTC GACGAGGAGC GGGCCTGGCT GCGCCGCACG CTGAGCCGGG AGTTCGACAG CATGGCCAGC TCGGTCGCGC GGATCATCTC CGAGCACCCC GGACTGCAGG CCTCCGGCGC GGTCGGTCGA GACGACGTCC TCGCCGACTC GGTCGCGGTC CGTCTCTACC TGTCGCGGCG GGGCACGGGA GTCGACGTCG GGTTGCGATC TGGCAGCAAC GGCCCGCACG TGCCGTTCGC CCGGTGCGCG GTCTCGGGCC TGTCCCGGCT ACCGTCGTAC CGCGGCGCCA CGATCTACCG CACCTCGCCC ACCGAGCAGG AATGGCAGCA CTACCGCACC CGTAGGCTGG TCACGAACTG GGCGTTCGTC AGCACGTTGA CCGGACCGTG CGAGAGTCAG GACGGCGACA CCGACGTCCT GGTGTGGTCG ATGACCGCGC GCCGAACGGC GCTGCTGGAA TCCGACGGTG CCGAGCGGGT CGAGGACCGG GCGCTTTTTC TTCCCGGCAC CCACTTCAAG ATCTTGGAAC TACAGGAACC GTCCGGCGGC GACCGTGGCG CGATCCTCAT GCGGGAGATC GGCGCGAAGG AAATCGACGG GGACGGCCGG GTTGATCCCG ACCGGGCGCC GCTCGACGAC CTGGCCATCG TGTCGCTACG TCGCAGTCTC GAACGCTGGT CGACCGCTGA ACCCACCCGA CGGGTCGGCG CGGGCTCGAC GGGTCGGTTC GGCCTCCTAC CGGGGCTGGA CCGCCGCGCG GTCGGCGAGA AGGTGAGCGA ACGGTGA
|
Protein sequence | MASVIRDLLS GRKDGPGDIV QHTVGNALVL HAEETISAEA QSLALSVVKD ADNDVVVLDL SDGIPISSWE SMAGVLPRRR RGIRLMACGP HANTAAMAGQ WLSERLHRTV IAPDGDLVRG SAGGLFVHSI PGSGWVRFRP GRPPTWDAKR YPTPLWDRAA TDNRASSSTG EIEPLPGGVW IRDVREPDVV AEHRRRLIAD VPCHPETMTV LLGCPGTAPL CLDDVVRFWR NLDEDSRART RFVQYGDLRL PKGETFGQAL ADLLGTTIVC YTGVPVGGPR RFEVRTVRSD GALGWPPFAL ELRYAPRAHP NSKAHRPVVL SHRPPLAEAE EVAPRVYWYA PDAVIEVIQS GLWVRSAEEP VNAERVRGMP LDPEGSVLIY DDTIGESAER MWALAEDLAA RIDPSVGADS SLFSASALVP GQSAAGLAQA SLGEDSPRPT VTGQLSTSRP PAPEDLPAPT VDTIVIRPVV ESVAEAVPPV ATATAATVVR PDVTPETRPA GEPAPLQAEA PAPQPADEPT PQPAEAPAPQ RAEESAPQPA DEPAPRQAEA PARPAGSPTP PPVLDPTAPV AESFVKTVQD PIATVEPVEG TVAEPTRLHA VAGDEAPVQK QRRIRPATAS APVTDTDAPD VRMQPVPPAS ASALLPGRPL DEERAWLRRT LSREFDSMAS SVARIISEHP GLQASGAVGR DDVLADSVAV RLYLSRRGTG VDVGLRSGSN GPHVPFARCA VSGLSRLPSY RGATIYRTSP TEQEWQHYRT RRLVTNWAFV STLTGPCESQ DGDTDVLVWS MTARRTALLE SDGAERVEDR ALFLPGTHFK ILELQEPSGG DRGAILMREI GAKEIDGDGR VDPDRAPLDD LAIVSLRRSL ERWSTAEPTR RVGAGSTGRF GLLPGLDRRA VGEKVSER
|
| |