Gene Shewmr4_2080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2080 
Symbol 
ID4252653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp2475513 
End bp2476736 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content51% 
IMG OID638118704 
ProductAraC family transcriptional regulator 
Protein accessionYP_734210 
Protein GI113970417 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGC TCGCCACATG GCAGGATAAA TGTATCGATA GCCAATTGCT GGTGGCAAGT 
TTAGTGATGT TATTTAAGCA GCGTGAATTG GATACTGATA AGTTACTGCG CGGGACGGGG
ATTTTTATCG CGGATATTCG TAAACCCGAT CACCTTATTA GTCCCAAGCA ATTATCACGA
CTGCTGGATA ATGCGCTCTC CCTGTGGCCA AGCGGCGATT TAAGTTTTTT ATTGGGTCAA
CGTTGGTTGC CATCCCAAAG CGGCGCGCTG ACGGCGGCCA TGTTTTGCGC GCAGGATTTA
CAGGCATTAA GCCGCGTTTG GCATAAGTAT CATTGGCTCA CTCAGCCTTG GCTGCAAACT
TGGTGCTGCC AAGGCGAGCA TGAGTGGCAT TATCTCTTGA GTCTAGACTT AGGTATGCAG
CGCCATCGGC AGTTTTTGAT TGAGCTGAGT CTGTCGTCGC TTACCGCAGC CTGTAAGCAG
CTGCTTGGGC AAGCTTGGCG CGGCAGTCTG TGCTTTCCCT ACCGCGCGCC GGATAATTTG
GCCCATTACT ATAAATACTT TGGTACCGAC TTAAGTTTCG ATGCGCCCCT TTGCCGGATT
TCGATGACTA AAGCCCAGTT ACGCCAACCC TCATTATTTG CGCGCCAGAG CACGGCCTAT
GCCTTATCAA CTGGGGAAGA GAGGGATGAC AATCTCATTC ATAATGGTGT GATTCATCAT
GGCGCTAGTC ATTCTGTGGG TGACTCTTTA GCCATTGCAT CACAAGCAAT AGGAACTCAA
GCCACAGCGT CTCAAACTAT TGTATCTCAA CAGAATAGGC TGGATCAGGC GCCAAATATA
ATGTCAGCGT CGAATATCGC TCCTGCGCCG AATATGACCC AGGCGGCTCG CCAAGCGTTG
TTAACACACG AGTTTCGCCT TGGGTTGGCG GGAGGGATCC GCCTTAAACT GATGCGCAGC
ACTCTTTCCT TGCCAGAGAT GGCACAATCG CTGGAGATGA GCCCCGCAAC CTTAAAACGC
CGCCTGACGG AGATGGGCTT AAGTTACGGG CAACTTGCCG ACGAAACGCG TTTGATCCGG
GCGCTGTATT TTTTAGCCGA ACCCGAGCAA GACGCGCACA CGATTGCTAA TTCTCTATCC
TTCAGTGATG CCAGTAATTT CAGGCGTTCC TTTAAACGTT GGACAGGTCA ACTGCCTGCG
TATTTCCGCT TTTGGTTAGC CTAG
 
Protein sequence
MRALATWQDK CIDSQLLVAS LVMLFKQREL DTDKLLRGTG IFIADIRKPD HLISPKQLSR 
LLDNALSLWP SGDLSFLLGQ RWLPSQSGAL TAAMFCAQDL QALSRVWHKY HWLTQPWLQT
WCCQGEHEWH YLLSLDLGMQ RHRQFLIELS LSSLTAACKQ LLGQAWRGSL CFPYRAPDNL
AHYYKYFGTD LSFDAPLCRI SMTKAQLRQP SLFARQSTAY ALSTGEERDD NLIHNGVIHH
GASHSVGDSL AIASQAIGTQ ATASQTIVSQ QNRLDQAPNI MSASNIAPAP NMTQAARQAL
LTHEFRLGLA GGIRLKLMRS TLSLPEMAQS LEMSPATLKR RLTEMGLSYG QLADETRLIR
ALYFLAEPEQ DAHTIANSLS FSDASNFRRS FKRWTGQLPA YFRFWLA