Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_1557 |
Symbol | |
ID | 7088247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | + |
Start bp | 1824050 |
End bp | 1825747 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643460458 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002357485 |
Protein GI | 217972734 |
COG category | [F] Nucleotide transport and metabolism [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [COG2169] Adenosine deaminase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.486878 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00513105 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTACAG ATAGAGCGTC TGAGAGTGCG ATTAAGCGTG ATGAAATAGA CTCAGTGCAT TCACTGGAAT CCAGTGCGGG GCTACTTTCT GCGTCCATTT GCCGTGAGGC GAGGATGAGC CGCGATCCGC GTTTTGATGG TAAGTTTTTT GTGGGCGTAT TAACGACGGG CATTTATTGC CGTACCGTGT GTCCCGCCGT CGCGCCAAAA GAGCAGAATG TGCGTTATTT TGATTCGGCC GTGAAAGCCG CACAAGCTGG ATTACGCCCG TGTTTGCGTT GTCGGCCCGA CAGTGCGCCA GGTTCAAATC CGTGGAAAGG CACTGGCACA ACGTTGGCGA GGGCCATTAG TTTAATTGAA GCGGGCGCAT TAGCGGGGAA ATCATCGGGA GAGCCAGCGT CGACTGTGAC ACAATTGGCT GACAGATTAG GGATCAGTAG TCGTTATCTC AATAAGCTCT TTACCGAAGG TTTTGGCACT TCGCCTAAAC AATATGCCTT ATACCGGCAG TTACTGTTTG CAAAACAACT GCTACATCAA ACCCAATTAC CTATCACCCA AGTCGCGTTG GCGGCGGGTT TCAACAGTAT TCGCCGTTTC AATGAGGCAT TTAAGCAAAC CTTACAATTG ACGCCAACTC AATTAAGAAA GACCGCGTTA ACTAAGGCGG CAAAGGAGCA GGATATTGAT TCATCAACAG GATTCGGTGA AATTGATACT TTGGGTGAAT TTGTGCCGCA CTATGGATTA ACCTTATATC AATACTATCG ACCCCCATTG GATTGGGCAT CGCAGCTGGC ATTTTATCGC TTACGTGCAG TGACAGGAAT GGAGTGGTTT ACGCCGCAAA TGAGTCATCC ACAAGCGAGC GATGCAGTTC AGATTGCAGA TGAAACCTAT GTTTCAGCAG AAGCCAAGGC TGACGATAAC GGGCTCGAAT ATGGTCGCTG TTTTGCCATA GGCAAGATGC GCGGCACAGT GCAAATTATC CATGAACCTA AGCTAAATCG CTTCAAACTT GCGATTGCTT TGACCGAAGA TTCCGCCGTC GATGAGTTGC AATTGCTGGT CACTGAGGTG CGACGCATTT TAGATCTCGA TGCGGATATG CAGCAGATTG AGCAAGGTTT AAGTACGCTG CCGAGTTTAG GTTTAACACC GTTTTCGGGA TTGCGCATTC CTGGCGCAGG ATCTTTGTTT GAAGCGGTTT GCCGCGCGAT TTTGGGCCAG CAAGTGACGG TCGTGCAGGC AACTAAGTTA CTGAATATAT TGGTTGAAGC CTATGGCGAG CGCTTTAGTC TCAATGGCCG CGAGTATCGA TTATTCCCAA CACCCGAGGC GATTCGTGAG GCGAGTTTAA CTGAGCTAAA AATGCCGGGC GCCCGTAAAT TAGCCCTCAA TGCGCTCGCG GCGTTTATCT GCGAACATCC TGAGGCGAGT GTCGATGATT GGCTTAGCGT TAAAGGCATA GGTCCTTGGA CGATAGCGTA TGCCAAGCTT CGTGGCTTGG GCGATCCAAA CGTGTTTTTA CATTCGGATT TGATTGTTAA AAAGCATTTA TTGGCCTTAT ATATCAAGAA CAATAAGTTA GATGAAACGG CAGCCGCAGC AGTGAGCTAT CCCAAATTGT GCGAGCAGCT GAGTCAACAA ATTGCCCCGT GGGGCAGTTA CTTAACCTTT CAGCTCTGGC ATCAATAA
|
Protein sequence | MSTDRASESA IKRDEIDSVH SLESSAGLLS ASICREARMS RDPRFDGKFF VGVLTTGIYC RTVCPAVAPK EQNVRYFDSA VKAAQAGLRP CLRCRPDSAP GSNPWKGTGT TLARAISLIE AGALAGKSSG EPASTVTQLA DRLGISSRYL NKLFTEGFGT SPKQYALYRQ LLFAKQLLHQ TQLPITQVAL AAGFNSIRRF NEAFKQTLQL TPTQLRKTAL TKAAKEQDID SSTGFGEIDT LGEFVPHYGL TLYQYYRPPL DWASQLAFYR LRAVTGMEWF TPQMSHPQAS DAVQIADETY VSAEAKADDN GLEYGRCFAI GKMRGTVQII HEPKLNRFKL AIALTEDSAV DELQLLVTEV RRILDLDADM QQIEQGLSTL PSLGLTPFSG LRIPGAGSLF EAVCRAILGQ QVTVVQATKL LNILVEAYGE RFSLNGREYR LFPTPEAIRE ASLTELKMPG ARKLALNALA AFICEHPEAS VDDWLSVKGI GPWTIAYAKL RGLGDPNVFL HSDLIVKKHL LALYIKNNKL DETAAAAVSY PKLCEQLSQQ IAPWGSYLTF QLWHQ
|
| |