Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal195_2940 |
Symbol | |
ID | 5754718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS195 |
Kingdom | Bacteria |
Replicon accession | NC_009997 |
Strand | - |
Start bp | 3485150 |
End bp | 3486847 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641289251 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001555366 |
Protein GI | 160876050 |
COG category | [F] Nucleotide transport and metabolism [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [COG2169] Adenosine deaminase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.91789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0713336 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACAG ATAGATCGTC TGAGAGTGCG ATTAAGCGTG ATGAAATAGA CTCAGTGCAT TCACTGGAAT CCAGTGCGGG GCTACTTTCT GCGTCTATTT GCCGTGAGGC GAGGATGAGC CGCGATCCGC GTTTTGATGG TAAGTTTTTT GTGGGCGTAT TAACGACGGG CATTTATTGC CGTACCGTGT GTCCCGCCGT CGCGCCAAAA GAGCAGAATG TGCGTTATTT TGATTCGGCC GTGAAAGCCG CACAGGCGGG ATTACGCCCG TGTTTGCGTT GTCGGCCCGA CAGTGCGCCA GGTTCAAATC CGTGGAAAGG CACAGGCACG ACATTGGCGA GGGCCATTAG TTTAATTGAA GCTGGCGCAT TAGCGGGGAA ATCATCGGGA GAGCCTGCCT CGACTGTGAC ACAATTGGCT GACAGATTAG GGATCAGTAG TCGTTATCTC AATAAGCTCT TTACCGAAGG TTTTGGCACT TCGCCTAAAC AATATGCCTT ATACAGGCAG TTACTGTTTG CAAAACAACT GCTACATCAA ACCCAATTAC CTATCACCCA AGTCGCGCTG GCGGCGGGTT TCAACAGTAT TCGCCGTTTC AATGAGGCAT TTAAGCAAAC CTTACAATTG ACGCCAACGC AATTAAGAAA GACCGCTTTA ACCAAGGCGG CAAAGGAGCA TGTTGTTGGA CGAACAACAG GATTCGGTGA AATTGATACT TTGGGTGAAT TTGTGCCGCA CTATGGATTA ACCCTATATC AATACTATCG CCCCCCATTG GATTGGGCAT CGCAGCTGGC ATTTTATCGC TTACGTGCGG TGACAGGGAT GGAGTGGTTC ACGCCGCAAA TGAGTCACCC ACAAGCGAGT GATGCAGTTC AGATTGCAGA TGAAACCAAT GTTTCAGCAG AAGCCAAGGC TGACGAGAAC GGGCTCGAAT ATGGTCGCTG TTTTGCCATC GGCAAGATGC GCGGCACAGT GCACATTATC CATGAGCCTA AGCTGAATCG TTTCAAACTT GCGATTGCTT TGACCGAAGA TTCCGCCGTT GATGAGCTGC AATTGATGGT CACTGAGGTG CGGCGCATTT TAGATCTCGA TGCGGATATG CAGCAGATTG AGCAAGGTTT AAGTACGCTG CCGAGTTTAG GTTTAACACC GTTTTCGGGA TTGCGCATTC CTGGCGCAGG ATCTTTGTTT GAAGCGGTTT GCCGCGCGAT TTTGGGCCAG CAAGTGACGG TCGTGCAGGC AACTAAGTTA CTGAATATAT TGGTTGAAGC CTATGGCGAG TGTTTTAGTC TCAATGGCCG CGAGTATCGA TTATTCCCAA CACCCGAGGC GATTCGTGAG GCGAGTTTAA CTGAGCTAAA AATGCCGGGC GCCCGTAAAT TAGCGCTCAA TGCGCTCGCG GCGTTTATCT GCGAACATCC TGAGGCGAGT GTCGATGATT GGCTTAGCGT TAAAGGCATA GGTCCTTGGA CGATAGCGTA TGCCAAGCTT CGTGGCTTGG GCGATCCAAA CGTGTTTTTA CATTCGGATT TGATTGTTAA AAAGCATTTA TTGGCCTTAT ATATCAAGAA CAATAAGTTA GATGAAACGG CCGCCGCGGC AGTGAGCTAT CCCCAATTGT GCGAGCAGCT GAGTCAACAA ATTGCCCCGT GGGGCAGTTA CTTAACCTTT CAGCTCTGGC ATCAATAA
|
Protein sequence | MSTDRSSESA IKRDEIDSVH SLESSAGLLS ASICREARMS RDPRFDGKFF VGVLTTGIYC RTVCPAVAPK EQNVRYFDSA VKAAQAGLRP CLRCRPDSAP GSNPWKGTGT TLARAISLIE AGALAGKSSG EPASTVTQLA DRLGISSRYL NKLFTEGFGT SPKQYALYRQ LLFAKQLLHQ TQLPITQVAL AAGFNSIRRF NEAFKQTLQL TPTQLRKTAL TKAAKEHVVG RTTGFGEIDT LGEFVPHYGL TLYQYYRPPL DWASQLAFYR LRAVTGMEWF TPQMSHPQAS DAVQIADETN VSAEAKADEN GLEYGRCFAI GKMRGTVHII HEPKLNRFKL AIALTEDSAV DELQLMVTEV RRILDLDADM QQIEQGLSTL PSLGLTPFSG LRIPGAGSLF EAVCRAILGQ QVTVVQATKL LNILVEAYGE CFSLNGREYR LFPTPEAIRE ASLTELKMPG ARKLALNALA AFICEHPEAS VDDWLSVKGI GPWTIAYAKL RGLGDPNVFL HSDLIVKKHL LALYIKNNKL DETAAAAVSY PQLCEQLSQQ IAPWGSYLTF QLWHQ
|
| |