Gene Sde_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3858 
Symbol 
ID3967007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp4862147 
End bp4863328 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content44% 
IMG OID637922955 
ProductAraC family transcriptional regulator 
Protein accessionYP_529325 
Protein GI90023498 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACTT TGCAACTTTC TATATACGCC ATGGCTTTGG GGTTGTGTGC CTTTACTGGG 
CTGCTTGCTT GGCGGGCGTC GTTTCGCAAT AGGTACTTTT TAGTATTTAT GACCCTACTG
GTGTTAATGC TAGGTTGCGA TTGGTTAATG CACCACCCTA GTACACCGCT TAAGAATTTA
TGGCTTGTTA TTTTAATGGC GAGTGCACTA TTGGTTGGCC CATGCGCGTT GCTATTAGCT
AATTCTGTAG GCAATGCGGA TTTGCACATT AATTGGCGCA GTCACGCAGT GTTGGTTATT
GCGGCTTGGT TACTGTTAAC GCCGCTGGCA ACGTCTATTC ACTTCGGCAC CGAATTCGTC
AATGCAGCTG CCCCTGTAAC CAAGGCGTAT GCTTTTTTTA TTCACACTGG TATGTCGCTT
ACGGTTGGTT TGTTTTTATT GCAAACCCTG TGGGTGCTGC GCACTTGCTA TGCCTTGCTG
CAGCGGCGCA ATGTACAAAA CAAGTGGCTG TTTTCTGAGT TAGCCGACCC CGGCCTAAAT
TTGTTGCGCA TTTTAGTGTT GGCCATTGTT ATTAATGCTG TGGTTTCTAT AGCAAAGGTG
CTTTATTGCG CCCTGTTAGA TGGGGTGTAT ATGCCCATTA ATATTGTTAT ATCTGGTATT
CATTTATTAA TGGCCATATT TTTGGCCAGC TCTTATATTA GTTTGGTTGT TGGGGCACAA
GGTAAGGCAG AAGCAATAAG GCAAACACTA TTTAAGCCAG AGGCACATCC AACCACACAA
AGTGATACCA CGGCCAGTAA AAATTTTGAG CCAAGCAGTA ATAGCAATAA CAAAACTACT
GCCGAACTTA CCGGAAAGCA GCAAGCGCTG CTAAAACAAA TTAAAGCGGC AATGGATGTA
GAGCATTTGT ATAAAAAACC CAGCTTAAGC CTACGCGATT TATGCGACCA CCTAAACGAA
AGCCCCCACA ATATATCGCA GGTAATTAAC GAAAGTGATT TAGGTAATTT TTACGATATG
GTGAATAGCC GCAGGGTGGC GCTGGCATCC CAGCTTTTAC AACAAAACCC ACAACGCACG
GTGTTGGATA TTGCTTTCGA TTGCGGGTTT AATTCTAAGT CTTCTTTTAA TAGCGTGTTT
AAGCGGTATA CGGGGGTAAC GCCTAGTCAG TACCGCGCTT GA
 
Protein sequence
MNTLQLSIYA MALGLCAFTG LLAWRASFRN RYFLVFMTLL VLMLGCDWLM HHPSTPLKNL 
WLVILMASAL LVGPCALLLA NSVGNADLHI NWRSHAVLVI AAWLLLTPLA TSIHFGTEFV
NAAAPVTKAY AFFIHTGMSL TVGLFLLQTL WVLRTCYALL QRRNVQNKWL FSELADPGLN
LLRILVLAIV INAVVSIAKV LYCALLDGVY MPINIVISGI HLLMAIFLAS SYISLVVGAQ
GKAEAIRQTL FKPEAHPTTQ SDTTASKNFE PSSNSNNKTT AELTGKQQAL LKQIKAAMDV
EHLYKKPSLS LRDLCDHLNE SPHNISQVIN ESDLGNFYDM VNSRRVALAS QLLQQNPQRT
VLDIAFDCGF NSKSSFNSVF KRYTGVTPSQ YRA