Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0180 |
Symbol | |
ID | 5706337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 192672 |
End bp | 195074 |
Gene Length | 2403 bp |
Protein Length | 800 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641269706 |
Product | XRE family transcriptional regulator |
Protein accession | YP_001535106 |
Protein GI | 159035853 |
COG category | [K] Transcription |
COG ID | [COG3620] Predicted transcriptional regulator with C-terminal CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.476576 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00129717 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCAGATC AACTCGGATT GCTGTTGCGT CGGCTACGAA ACCAAGCGGG ATTGACACAG GAACAGGTCG CGGAACGGTC CGGGGTGAGT GTCCGCACCA TTCGCCGCCT GGAATCGGCG AGGAACATCG ATCACCGCCT GGGCACCCTG AACCTGTTGG CTGACGCGCT GGAACTTGGA TCCGAGGATC GCGAACTCCT CGCCACCATG CTCGCGAGAA CGAGCACCCC GCCCACGTTC GCGGTTCACG CTGCTTCCAC GGAGACTCGG CCAACTGCAT CGGCCAGCGG GCCGCCCAGG GCGCCATCGA GGCAGACGTC CGTCCTGGTG CCGGCGCGTG CCGTGCTGGA CGCTGCTGAG ACGTTGGCGA GGGAAGCCAA ACGTCGGTGG CAGCGCGAGG AGGAGCAGCG CCAGGTCCAC GACCCGTTTC CGTTGCCGCT GTGCTGGCAG CCGGCCCCCG CGCAGCTGGT CGACTACGCG GCGAACGTCC AGCGCCTTCC ACCAGGAGCT ACCCCACAAC TGGATCTCAG TGGTGATATG GGCAGTATCG CCGAGGTCTA CCGAAGGATC CGGTCCGGCC GGTTGGTGAT CCTGGGCCGG GCGGGTTCGG GTAAGTCGAT CATGTTGATC AGGCTCGTCC TGGACTCGCT CGCGGCCTTC ACCCTCCCCG AGCGGGTGCC GATGATCTTC AACCTCGGAT CCTGGGACGC GACAGCCATC ACCTTGCGGG ACTGGCTGAT CGGTCAGCTG TTGCGTGACC ACCCGCACCT GGCCCGCAGG GCACCCGGCG GTCCGACGTT GGCCGCCGCA CTGGTCGACG CTGATCTCAT CCTGCCCATC CTGGACGGGT TCGACGAGAT CGCCGACGGC CTGCGGCGCG AGGCGCTCGA AGCGTTCAAC GCGACCTCAC TGCCGCTTGT GCTGACCAGC CGCCGTGACG AGTACGCCGA GGCGGTGCGC GGAGCCGGGG CCCCGCTGAA CTGGGCGGCC GGTATCGAGC TCGTGGACCT CACCCTCGAT GACCTCGCGG CCTACCTGCC CCGGACCGCC AGGCAGGCCG GCCGCGACGA CACCGTAGCG GTGTGGGATC CCGTCCTGAA GCTGCTGCAG GCCACGACGT GCCCGGCGAG CGTGAACCTC ACGAGAGTAC TGTCCACTCC TCTGATGGTC GTCCTGGCGC GGACGATGTA TAGCGAGACA CCGGAACGGG ATCCGGCCGA ACTGCTCGAC ATAACGCGGT TTCCCAGCGC GAAGTCCGTC GAGGAGCATT TACTGGCAGG ATTTGTCCCG ACGGTCTACC GACCGTCCGT CCCCGACCGG GAGGCTGGCG GCTTCCGGCA GCGGACCTGG AACCCGCACC ACGCAGAGCG TTGGCTCGGC TACCTCGCCC ATCAGTTGGT ACGGCACGGC CAGGACCGGC GGGATCTCGC GTGGTGGCAG ATCGGCGACT CCCTTCGTCG TTCGACGCGT ATCGGGACCG CAACGCTGGT TTCTGCGCTG TGCACTGCCG TGTCGGCCTG GATGACCGGG CTGGTCGCCG GGCAGGTCGA CCCCGAGCAG ATCCTGGTGG AAGGGGCCAT GATGGGGCTG TCGGCCGGCC TCGCCTTCGG AGCTGTCTAT GCGGCGATAA CCGCCTTCGG CGGCACCTTC CAGCCGACCC ACGTGCGGCT GCGACTGCGT CGCCGCCACA GCGTCGTCGC CCGCCCGCCA ATCCAGACAT TCACCGTCAG GTTCGCAGTC GTCCTGCTGG GCGGGTTCGT CATGGGCGTT GGAAGCGCCT GCGCCACCGC CCTGGTACGC GCACGGTACT GGGAAACCCC GCTCGCGAGC CTTGAGGTGA TCAGGGCGAC CCTCATCAAC ATGCTGGTCT TCGGACTGAT CTTCGGTTTG GCGGCCGGAC TGGTGTTCGG GCTCCTGGCC GCGTTGGAGG TGCCGGTGGA CGTTTCCTTC GTCGCCACTC CGGTCAGCCT ATTGTCCGCG AACCGTGCGA CCGTGAGCCA ACAGATCCTC TTCCTTGCCC CCGTTCTCGC CCTGACAATC GCCGTCGGTG GACGGCTGGT CGTCGACCTG CTCGAGGGAG GCGTCCTCGG AGAGCTGAGA TGGGCCTGGC CCGACGCGTT TCTCATCGGG GCTGCCGGGG GGCTGGGAGG CGCATTGTCG TACATGTTCT GCTTCACCGC CTGGGGGCAG TGGGTGATCC TGACCCGGGT GTGGCTACCG CTGACTGGCC GGCTGCCCTG GAACACGATG GCCTTTCTGG AAGGCGCCTA CCGACGGGGC GTGCTCCGTC AGACCGGCGC GGTTTACCAG TTCCGCCACG TTCGGCTTCA ACAGCACCTG AGTCACTCGT ACTGCGAGCG GCGGCGGAGA AGTCGACGAA CTCCCGTTCA CTCCGAGGGC TGA
|
Protein sequence | MADQLGLLLR RLRNQAGLTQ EQVAERSGVS VRTIRRLESA RNIDHRLGTL NLLADALELG SEDRELLATM LARTSTPPTF AVHAASTETR PTASASGPPR APSRQTSVLV PARAVLDAAE TLAREAKRRW QREEEQRQVH DPFPLPLCWQ PAPAQLVDYA ANVQRLPPGA TPQLDLSGDM GSIAEVYRRI RSGRLVILGR AGSGKSIMLI RLVLDSLAAF TLPERVPMIF NLGSWDATAI TLRDWLIGQL LRDHPHLARR APGGPTLAAA LVDADLILPI LDGFDEIADG LRREALEAFN ATSLPLVLTS RRDEYAEAVR GAGAPLNWAA GIELVDLTLD DLAAYLPRTA RQAGRDDTVA VWDPVLKLLQ ATTCPASVNL TRVLSTPLMV VLARTMYSET PERDPAELLD ITRFPSAKSV EEHLLAGFVP TVYRPSVPDR EAGGFRQRTW NPHHAERWLG YLAHQLVRHG QDRRDLAWWQ IGDSLRRSTR IGTATLVSAL CTAVSAWMTG LVAGQVDPEQ ILVEGAMMGL SAGLAFGAVY AAITAFGGTF QPTHVRLRLR RRHSVVARPP IQTFTVRFAV VLLGGFVMGV GSACATALVR ARYWETPLAS LEVIRATLIN MLVFGLIFGL AAGLVFGLLA ALEVPVDVSF VATPVSLLSA NRATVSQQIL FLAPVLALTI AVGGRLVVDL LEGGVLGELR WAWPDAFLIG AAGGLGGALS YMFCFTAWGQ WVILTRVWLP LTGRLPWNTM AFLEGAYRRG VLRQTGAVYQ FRHVRLQQHL SHSYCERRRR SRRTPVHSEG
|
| |