Gene Sde_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_1661 
Symbol 
ID3965138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2129078 
End bp2130079 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content47% 
IMG OID637920742 
ProductAraC family transcriptional regulator 
Protein accessionYP_527133 
Protein GI90021306 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000000279685 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTTT TGAAGCTTGT CCAAAGTAGT AGTCTCCCGG AGGCAGAAGA AGTTTATTTG 
CCGAGTCAAG ACAGTTACGA TATGAAGAGC CCGTTGGAGG CAATGGTCTA TTCATATCGT
ATTCTAGCCC ACAGTTATAA ATACGATTCC TTCAACCAAG CGATAACCAG CCTGCTTGAT
GCCGGTATTA AAAAGCTCGG CATGGAAACA GCATTGCTTA CGCGCCCAGT TTCGGCCCAA
ATTTTTGAGG TTGTAGCCTG TGGTGGTAAG TGTGAAGGTT TCTATGTTGG TCAGCATTTA
AGCCTGCAAG AAACACCTTG TTTGACTGTA TTTCAGAAAA ATGAGACGTG TGCATACACC
AATGTTGAGC GGATGTGTGG CAAAGTGCCA GCTATGGCAT ATAATCAAAC ACAAGTAGGT
GCGTATCTCG GAACCTACGT GCAGCCCCAC TTCGCTGAGC CGGGTGTAAT GTGTTTTACG
GCGCCAGAAG CGAGGCTAAC GGAGTTTAGT GCCGAGGACG TGGTATTTAT CGAACTATTG
GCTGAAGGAG TGGCCTTTAT GACTGATCAA CTAAGAGCGC AAGCTCAACG TAAGTTAACT
GACCAGGCGA TGTTTGCCCT GGGTTCTGTG AAAACATTGG ATGAGTATCT CGAACAGGCA
AGGTTGCCTG AGGTGTTTGG GGTGCCCGCA AGAGTAGTGG AGGTGCTTCA GCGCCGAATT
GGTCATGCTC CCCTAAGTAT TGGCCACGTT GCGGAAGAGT TAAATCTTTC AAAACGTACT
CTTCAGCGTC GCTTACAGCA GCAAGATGTA AACTTTGCTG AACTGCGTGA CCAAGTCCGG
TTTCACTATT CCATCGATTA CCTTATTAAG CAGCATCAAA GCATCGACAG TATCTCTGCA
TCGTTAGATT TTTCTGATCG AACTAGCTTT ACCAACGCCT TTAAACGTTG GACAGGTCTT
TCTCCCAGTA CTTTTAGAAA GCTTTTCCGC GATTACGTTT AG
 
Protein sequence
MTVLKLVQSS SLPEAEEVYL PSQDSYDMKS PLEAMVYSYR ILAHSYKYDS FNQAITSLLD 
AGIKKLGMET ALLTRPVSAQ IFEVVACGGK CEGFYVGQHL SLQETPCLTV FQKNETCAYT
NVERMCGKVP AMAYNQTQVG AYLGTYVQPH FAEPGVMCFT APEARLTEFS AEDVVFIELL
AEGVAFMTDQ LRAQAQRKLT DQAMFALGSV KTLDEYLEQA RLPEVFGVPA RVVEVLQRRI
GHAPLSIGHV AEELNLSKRT LQRRLQQQDV NFAELRDQVR FHYSIDYLIK QHQSIDSISA
SLDFSDRTSF TNAFKRWTGL SPSTFRKLFR DYV