Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3970 |
Symbol | |
ID | 5705247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4508499 |
End bp | 4509545 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641273395 |
Product | LacI family transcription regulator |
Protein accession | YP_001538751 |
Protein GI | 159039498 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.149546 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACACGGA TCGACGACGT TGCCCGGCTG GCCGGAGTCT CGACCGCCAC TGTCTCCCGG GCGCTACGCG GGCTCCCGAC GGTCTCGGCG GCGACGCGGC ACCGGGTTCT AGCCGCCGCC GAACAACTCC AGTACACCGT CTCACCGAAC GCGTCGCGGC TGGCCGGCGG GCGTACCGGC ACGGTCGCCG TGGTCGTTCC CCGGATCACC CGTTGGTTCT TCGGAGTCGT CGTCGAGACG GTCGAGGACT TCCTCCACCG AGGCGGCTAC GACCTGCTGC TGCACAATCT CGGCGGGCGG GAGCGGACCC GACAGCGGGT GCTGCGTACC GCCGACCTAC ACAAGCGGGT CGACGGAATC ATCCTGGCGG CCACCCCACT GCGGGCACCC GAGCTGGCCT TCCTGTCTGC GCTGGACCTG CCCGGGGTCA TCGTCAGCTC CGGCACGAAC GTGCCCGGCT GGCCGTGCGT ACGCATCGAC GACGTCGCCG CCGCGCGGAC CGCCACCCGC CACCTGCTCG ACCTCGGACA CCGGCGGGTC GCGCACATCT CCGGCGACCC CGACGACGAA CTCGCGTTCA CCGCCCACCT GGACCGGCGG CGCGGCTACC GGGAGGCGCT GCGCTCGGCG GGCATCCGAC CCGACCCGAG TCTCGACATC GAATCCCGGT TCGACGTCGA CGGCGGTATC CGAGCCACCG AGGAGTTACT GCGTCGGGGC GACCCTCCCA CCGCGATCTT CGCCGCCTGC GACGAGATGG CGATGGGGGC ACTGACCGCG CTGCGGGACG CCGGGCTGCG GGTGCCGGAC GACGTGAGCG TGATCGGCAT CGACAACCAC TACCTGGCGG GTGTGCTCGG ACTGACCACC GTCGCCCAGT CCCCGACCGA CCAGGGGCTG ATCGCCGCGA AGACCCTGCT CGGTGCACTG ACCGGTCGGC CTGCCGACCC ACTTCGCGCC GCGGACGGGC CGGTGGTCCT GCCCACCCGA CTGGTCGTCC GGGAAACGAC CGCGCCGCCA CGAACGCCGG AGTCCGCCGC CCGATGA
|
Protein sequence | MTRIDDVARL AGVSTATVSR ALRGLPTVSA ATRHRVLAAA EQLQYTVSPN ASRLAGGRTG TVAVVVPRIT RWFFGVVVET VEDFLHRGGY DLLLHNLGGR ERTRQRVLRT ADLHKRVDGI ILAATPLRAP ELAFLSALDL PGVIVSSGTN VPGWPCVRID DVAAARTATR HLLDLGHRRV AHISGDPDDE LAFTAHLDRR RGYREALRSA GIRPDPSLDI ESRFDVDGGI RATEELLRRG DPPTAIFAAC DEMAMGALTA LRDAGLRVPD DVSVIGIDNH YLAGVLGLTT VAQSPTDQGL IAAKTLLGAL TGRPADPLRA ADGPVVLPTR LVVRETTAPP RTPESAAR
|
| |