Gene Sde_1550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1550 
Symbol 
ID3965078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1995367 
End bp1996518 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content42% 
IMG OID637920628 
ProductAraC family transcriptional regulator 
Protein accessionYP_527024 
Protein GI90021197 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000434853 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAAT GGCTAAAGTT TTTATTTATT TGCTCCCTTT GCTTGGGGGC GTGGACGAGC 
GTGAACCTTT GGGTGAATCA CGCTGGCCCT AAAAAAGTAC GCCAGTTTAC GGGCGTATTT
GTGCTTGTAT TACTTGTTCC CCCTTTGGTT GGTTATTTGC AGCTGATTAG TGCAGAAATA
CCTGCATTTT TTTCGTTTGT GCGTTCCACG CTTACATGGT GGTATGGGCC GCTTATGTAT
TTTATTGCGC GTGAAATGTT GTTATTACCA AATACACCGC GTGGCATAGC GAACCATATG
TGCGCAGTTT GTGGTTTGTT TGTGGTCACC CAATTATTAA TAAGTAGTGC AATACCCTAT
GGTTATTTCG TTATAGTAAC CGCTGTGGTG GCTGCCTATT GTACGCTTGC GGGCCATACC
TTAGTAAAAA ACGCATCGCG CTTACGCAGG CTTAATAGTA GCTATCGAAA ATCTACTTTT
TATTGGTTGC AATATTTATT GGCAGGCTTG TTATTGCTGT GTGCAATGGA TATAGGTGTG
CTAGTAGCTC TACACTCAAA TGTGCATTTA GACTTTTTAG CGCTCAATAG CATTGCATGC
GTATTTGCTA TTTATGTAAA CGGTATAGTG TTATTTACTC TAATTAAACC CGCGCTTTTT
GAACTAGATG ACGCACAAAC AATTGAACAC GTACAAACCA ATACAGGCGA GCAGCATAAA
ATAAGCGAAG TCGCCGCAGA CGAATCACCA GCAGCCAAGA ATAATGTGCG TTACTTGGAG
TTGAGTGATC AAGTTGCAAC TACGCTAATT AATACCCTTG CAACAATAAT GGAAACTGAT
AAGCCCCACT TAGAGCCAGA CGAAAATTTA ACAAGCATGG CTGGGCGGTT GGGAATAACC
ACGCACATGT TTTCTGAGCT TTTAAACGTA CACTTACATA CCAACTTTTA CGATTGGATG
AATAGCTATC GCTTTAACGC CGCGCTGTTA TTACTGCAAG ATCAAACCGT AAATTACTCC
GTAACCGATA TTGCTTTTCA GGCAGGGTTT AATAATAGGA ATAGTTTTTA CCGTGTGTTC
AAATCGAACT TAGGCATTAC ACCCGCGCAG TATCGCAAAC AGTATAAAGC CGAATTGCAA
AAGCAGGCCT AG
 
Protein sequence
MDEWLKFLFI CSLCLGAWTS VNLWVNHAGP KKVRQFTGVF VLVLLVPPLV GYLQLISAEI 
PAFFSFVRST LTWWYGPLMY FIAREMLLLP NTPRGIANHM CAVCGLFVVT QLLISSAIPY
GYFVIVTAVV AAYCTLAGHT LVKNASRLRR LNSSYRKSTF YWLQYLLAGL LLLCAMDIGV
LVALHSNVHL DFLALNSIAC VFAIYVNGIV LFTLIKPALF ELDDAQTIEH VQTNTGEQHK
ISEVAADESP AAKNNVRYLE LSDQVATTLI NTLATIMETD KPHLEPDENL TSMAGRLGIT
THMFSELLNV HLHTNFYDWM NSYRFNAALL LLQDQTVNYS VTDIAFQAGF NNRNSFYRVF
KSNLGITPAQ YRKQYKAELQ KQA