Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1471 |
Symbol | |
ID | 4252049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 1721359 |
End bp | 1722378 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638118070 |
Product | AraC family transcriptional regulator |
Protein accession | YP_733606 |
Protein GI | 113969813 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000173794 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACAGC GCGTCGACTA CAAGACGAAC CTGATGGTGG GTGATCAAAC CTATCCCGCA TTTGAGCTAC GTAATATTCT CGATTTTATT GCCTCGCAGT TAGGCGAAAA CGCCTTACAG CAAGTCTGTG AGCATATCGG CGTTGGCCTA GCAGAACTCA ATCATTGCCA GTTTGTGTTT GTGTGGCAGG TAGAATATGC GATGGAGTTT TTACGTTTGC AGGGCGGTGA CCCCGATATT GGCACTAAAT TAGGCCTAAG CTATCGGGTG AGTAGTTTAG ATGTGCTCTT GCCCCATCTT GCGCAGTTGA CTTCTCTACA GGCCTGCCTG CAGTTTGTGG TCAATCATCC CCAACTAGTT GGCAGTTTTA CCGATACCTT AGTGCGCCTC GAAGAAGACT GTGTCTGCAT TCGTTGGCTC AATACTGGCC GTATCGACAA AGCACAATAT GGATTTCAAT TTCTGCACAG CATAGGTTCG CTGTTAGGGC TGGCGAGAGA GCTCACGGGG GAGGCGATAA CGCTTAGGCA AATCCATTTG GCGGAACCTG TCCGCGATGA GCGTTTTTTA ACCCAGGCAA CGGGGGCTAG GGTGCAGTTT AATTGCGAAT ATTATGAATG GAGCATTTCC CTCAACCAAC TCGCGCTTGC CATCCAGTAT CCTTTCCCTG CCTACGCCGA AAAGACCGCT TCGGTATCAA CCACATCCTT TATCGAAACC GTACTGGCGG CAATCAACGA GCATTTTCCC CAAGTGCTTC ATTTAGATGA CATGGCGACG CAACTACATA TGAGCGATCG CAGCTTTCGG CGTAAGCTGG CACAGCTTGG CTCGAGTTAC CAACGTTTAG TCGATCAGGT GCGTTGCCAA AGGGCGGTCG AGTTGATTTT AGCTAACGAG TTGGATATTG AGGCGATTGC CGAGACCTTA GGCTACAGCG ATGTTAGCCA TTTCCGTCAA TCTTTTAAGC ATTGGATTGG CCATCCGCCT GGGTATTTTA GCCGGCTAAA TGTTGGGTAA
|
Protein sequence | MLQRVDYKTN LMVGDQTYPA FELRNILDFI ASQLGENALQ QVCEHIGVGL AELNHCQFVF VWQVEYAMEF LRLQGGDPDI GTKLGLSYRV SSLDVLLPHL AQLTSLQACL QFVVNHPQLV GSFTDTLVRL EEDCVCIRWL NTGRIDKAQY GFQFLHSIGS LLGLARELTG EAITLRQIHL AEPVRDERFL TQATGARVQF NCEYYEWSIS LNQLALAIQY PFPAYAEKTA SVSTTSFIET VLAAINEHFP QVLHLDDMAT QLHMSDRSFR RKLAQLGSSY QRLVDQVRCQ RAVELILANE LDIEAIAETL GYSDVSHFRQ SFKHWIGHPP GYFSRLNVG
|
| |