Gene Sde_3153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3153 
Symbol 
ID3965587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4029857 
End bp4030951 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content45% 
IMG OID637922250 
ProductAraC family transcriptional regulator 
Protein accessionYP_528622 
Protein GI90022795 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGCGC ACCTAAGCTT AAGATTTTTT CTGTTATATC TTCTACTTAA AGCTTTGAAC 
TTTGGCTTTG AATGGTTGCT AGTGAACCCC GATACCCCCT ACAAAGCCGC ATGGCTGGCT
TTGCTTATGG CCAGTTCATT TTTGATGGCG CCGTGCGTGT GGTTGCTTGC ACGGGAAATT
GATCGCAATG CGCCACCGCG CTTATGGCCA ATTGCCTGGG GCCAATGCGC GGTTGTATTG
GCGGGCTTTA TATTATGTAT ACCGCTATTT TTGGCAGCCA ATCAAAGCGT TTTGCTAGTA
GATCCATCCC GTGCGAAACC CCACTGGTTC AATCTCACCC ATACCACCAT GGTAGGCGCA
GTATTGCTTT ATCTCGTCCA AGTCCCCTGG TATTTATCGC GCAGCGTAAG CCTATTTCGC
GAACGCCTAC GCATTAACAA GTTTTTATTT TCTAACATTG ATGAGCCGGC CCTTAACGCT
TTGCGCGCTT TAATTTGGGT TATGGCTGCA AGCTGGCTGT TTAACTTACT ACGCATGCTG
CATACGATGA TTTTAGAGCC ATCACAAGTA TGGAACCTGC TAATAAGCGC CTGTGAAATA
GGGGTAACCA TTACGGCGCT GTACGTTATT TTTAAACGCT GTTGGCAATA CAGTGTTGAC
GATCAAACTA TGGTTGAATC CGTAAGCCCC GAGCTAAAAG AACAGGCTTC GCTCCTACAG
GGCGACAAGT GCGCCAAGTA CGCCAAGTCG TCGCTTGATC AAACTACACG AACTCGGGTA
GCTAAAAAGA TTCTAGCTCA GTTTGAAGAA GAGAAAATAT ACCGTAAGAA TGGCCTAAAG
CTGCAAGATT TATGCGTTGC CACAAATGAG AGCGCCCACT ATATATCACA GGTAATAAAC
CAAGAGCTGG GGTTCAGCTT TTTTGATTTA GTGAACAAAT ATCGAATTGA AGAGGCACAA
CAGAAGCTAA AACAAAATCG CGACCTACCT ATTTTAGATA TTGCCCTAGA GGTTGGGTTT
AATTCGAAAC CTACTTTTAA TAAAGCCTTT AAACTGCGAG TTGGGCAAAC ACCCAGTGAA
TTCCGAGCAA AGTAG
 
Protein sequence
MFAHLSLRFF LLYLLLKALN FGFEWLLVNP DTPYKAAWLA LLMASSFLMA PCVWLLAREI 
DRNAPPRLWP IAWGQCAVVL AGFILCIPLF LAANQSVLLV DPSRAKPHWF NLTHTTMVGA
VLLYLVQVPW YLSRSVSLFR ERLRINKFLF SNIDEPALNA LRALIWVMAA SWLFNLLRML
HTMILEPSQV WNLLISACEI GVTITALYVI FKRCWQYSVD DQTMVESVSP ELKEQASLLQ
GDKCAKYAKS SLDQTTRTRV AKKILAQFEE EKIYRKNGLK LQDLCVATNE SAHYISQVIN
QELGFSFFDL VNKYRIEEAQ QKLKQNRDLP ILDIALEVGF NSKPTFNKAF KLRVGQTPSE
FRAK