Gene Sde_1391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1391 
Symbol 
ID3968660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp1796779 
End bp1797918 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content41% 
IMG OID637920467 
ProductAraC family transcriptional regulator 
Protein accessionYP_526865 
Protein GI90021038 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGATC GAATTTTTAA TGTATACGAT GTTTTCTTGA TTATTGCAGT GTTTGAAGCA 
TTGTTATTAG CTGTTTTTCG GATGGTGCTG CCGGCCAGCA ATCGCAGTGG TAAATGGGGT
AGTTATTTTC TCGCTGCTTT TTTAATCGTT GTTTCCGTCG ATTTCATCAC CGGCTTGCTA
ATGTGGAATG ACGCCATTCC CTTATCGCAA TCCTTTTACT CTAATGGTTT GGTGCTGCTA
TTTACATTCA GTCATTTTGC CCGTGGTCCG TTATTTTATT TTTATGTGCG AGCGTTGCTG
TTTTCTGGTT TGCAATTGCG AGCAAGGCAT TGCATTCACG CTTTACCGGC GTTGCTGGCT
GTTGTGGGTG TGTGTGTATT TGGTATTACT ACAGAAGACT TGCAGTCGCG ATCTGGCGAT
GTGGATACTA CCCAAATGGC GAGTGTTATC TGGTATGCGT CTAATGGTTT GTCCATTTTT
TACGCGGCGT ATGCGTTGGT GTGGCTGCAA AAATATTTAC AGCGTTTAAA AGAGCAGTTC
TCTAGTATTT CGAGCATAGA AATTAGTTGG TTAATGGTGC TTAGTGGCTG TTTTTTAATT
AGTTGGAGCT GGTCTATTTT AATAAACTTA AGTGCAGATC TAATTGGTGG TGGTTTTGCT
GATGCGATGG GCACCAGTCA CAATTTAATA CGTTTTATGC TAATGAACGG GCTGGTATTT
TATAGCTTGG TATGCACCAG TAAAATTGTG AATGTACGTT ATAGCGAGCC GAAGCAAATA
GCAGTAAAAG ACTCTAAAGA TGTGGTTGAA CAAATTGAGA GGGGTATCCA TGAGTTGCAG
TTACATTTGT ACCCAAGTAT CAATATTGAT CAATACGCCG AGAAAATAGG TGTAAATGCT
AAAGCAGTGT CTAACGCTTT AAATAGAGAC CTTAAAACAC GCTTTTTCGA GTTTATTAAC
GCGCATCGGG TAGAGGAGGC TAAGCGCCTG TTGGAGGATA GTAGTAAAAA AAATTTGTCC
ATTGCACAAA TTTATACTGC TGCAGGGTTC AATAATAAAT CGTCATTTCA TCGGTTTTTT
AGTCGTTTAG TGGGTATGTC GCCTTCAGAG TATCGGCAAC AAGCACAGCG CGGTAAATAA
 
Protein sequence
MGDRIFNVYD VFLIIAVFEA LLLAVFRMVL PASNRSGKWG SYFLAAFLIV VSVDFITGLL 
MWNDAIPLSQ SFYSNGLVLL FTFSHFARGP LFYFYVRALL FSGLQLRARH CIHALPALLA
VVGVCVFGIT TEDLQSRSGD VDTTQMASVI WYASNGLSIF YAAYALVWLQ KYLQRLKEQF
SSISSIEISW LMVLSGCFLI SWSWSILINL SADLIGGGFA DAMGTSHNLI RFMLMNGLVF
YSLVCTSKIV NVRYSEPKQI AVKDSKDVVE QIERGIHELQ LHLYPSINID QYAEKIGVNA
KAVSNALNRD LKTRFFEFIN AHRVEEAKRL LEDSSKKNLS IAQIYTAAGF NNKSSFHRFF
SRLVGMSPSE YRQQAQRGK