Gene Shewmr4_1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1471 
Symbol 
ID4252049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1721359 
End bp1722378 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content49% 
IMG OID638118070 
ProductAraC family transcriptional regulator 
Protein accessionYP_733606 
Protein GI113969813 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000173794 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACAGC GCGTCGACTA CAAGACGAAC CTGATGGTGG GTGATCAAAC CTATCCCGCA 
TTTGAGCTAC GTAATATTCT CGATTTTATT GCCTCGCAGT TAGGCGAAAA CGCCTTACAG
CAAGTCTGTG AGCATATCGG CGTTGGCCTA GCAGAACTCA ATCATTGCCA GTTTGTGTTT
GTGTGGCAGG TAGAATATGC GATGGAGTTT TTACGTTTGC AGGGCGGTGA CCCCGATATT
GGCACTAAAT TAGGCCTAAG CTATCGGGTG AGTAGTTTAG ATGTGCTCTT GCCCCATCTT
GCGCAGTTGA CTTCTCTACA GGCCTGCCTG CAGTTTGTGG TCAATCATCC CCAACTAGTT
GGCAGTTTTA CCGATACCTT AGTGCGCCTC GAAGAAGACT GTGTCTGCAT TCGTTGGCTC
AATACTGGCC GTATCGACAA AGCACAATAT GGATTTCAAT TTCTGCACAG CATAGGTTCG
CTGTTAGGGC TGGCGAGAGA GCTCACGGGG GAGGCGATAA CGCTTAGGCA AATCCATTTG
GCGGAACCTG TCCGCGATGA GCGTTTTTTA ACCCAGGCAA CGGGGGCTAG GGTGCAGTTT
AATTGCGAAT ATTATGAATG GAGCATTTCC CTCAACCAAC TCGCGCTTGC CATCCAGTAT
CCTTTCCCTG CCTACGCCGA AAAGACCGCT TCGGTATCAA CCACATCCTT TATCGAAACC
GTACTGGCGG CAATCAACGA GCATTTTCCC CAAGTGCTTC ATTTAGATGA CATGGCGACG
CAACTACATA TGAGCGATCG CAGCTTTCGG CGTAAGCTGG CACAGCTTGG CTCGAGTTAC
CAACGTTTAG TCGATCAGGT GCGTTGCCAA AGGGCGGTCG AGTTGATTTT AGCTAACGAG
TTGGATATTG AGGCGATTGC CGAGACCTTA GGCTACAGCG ATGTTAGCCA TTTCCGTCAA
TCTTTTAAGC ATTGGATTGG CCATCCGCCT GGGTATTTTA GCCGGCTAAA TGTTGGGTAA
 
Protein sequence
MLQRVDYKTN LMVGDQTYPA FELRNILDFI ASQLGENALQ QVCEHIGVGL AELNHCQFVF 
VWQVEYAMEF LRLQGGDPDI GTKLGLSYRV SSLDVLLPHL AQLTSLQACL QFVVNHPQLV
GSFTDTLVRL EEDCVCIRWL NTGRIDKAQY GFQFLHSIGS LLGLARELTG EAITLRQIHL
AEPVRDERFL TQATGARVQF NCEYYEWSIS LNQLALAIQY PFPAYAEKTA SVSTTSFIET
VLAAINEHFP QVLHLDDMAT QLHMSDRSFR RKLAQLGSSY QRLVDQVRCQ RAVELILANE
LDIEAIAETL GYSDVSHFRQ SFKHWIGHPP GYFSRLNVG