Gene Sde_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_3059 
Symbol 
ID3967664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3911426 
End bp3912847 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content52% 
IMG OID637922156 
ProductGntR family transcriptional regulator 
Protein accessionYP_528528 
Protein GI90022701 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0865812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000973023 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGCACC TTTACCAAAC CTTGGCCACC GATATAGTTA AACAAATAGA AAACGGCCAT 
TACCGCCCCG GCGATAAGCT GCCCGGTATT CGCGCAATAA GCCGCCAGCG CAATGTAAGT
ATTGCCACTA CCCAGTCGGC ATTCAGGGTA TTGGAAGATG ATGGCTGGGT AGACGTTCGC
CCGCGCTCAG GCTTTTATGT GCGCGCTCGC CAAACCGTAA GCCCTATCGT TGCAGCGGCA
CAGTCAGACC CAAAGCCCAG CAATGTTACC GGCCAAGATA TGGCCCTAAG CCTGATAAAA
GCGGCCAATA AACACCATAT ATTACAGCTG GGCGCCGCAG TGCCAGACTT GTCTTTTTTA
CCTGTTCAGC CCATAAGCCA AGCCACCAGC CAAGCCCAAA AGCGCTTTGG CGACCGCGCA
TTAAGCTACG AAATGCCGCC GGGTGCGCTA GAACTTCGCC GCCAAATAGC ACGCCGTATG
GGCGAGGCGG GTTGTGCGGT AAGCAGCGAA GAAATTGTAA TTACAAGCGG ATGCCAGGAA
GCGCTTACTA TCGCCCTAAA GGCCGTCACT AAAGCGGGAG ACATTGTAGC TATAGAATCC
CCCACCTTTT ACGGGCTGTT GCAGGTTATA GAATCTCTTG GCTTAGAAGC CTTAGAAATA
CCCGTAGACC CACAAACAGG TATGAGCCTA GATGCCCTAA AATTGGCCCT AGAGCGCTGG
CCAATTAAAG CCTGTGTTGT AGTGGCCAAC TGCAGTAACC CCTTGGGCTA CACCATGCCC
GACGAGAGTA AGTTGGCACT TACATCACTA CTTACCGAGC ACAACGTGCC CCTTATAGAA
GACGATGTGT ACGGCGACTT AAGCTTCGAT AAGCACCGCC CAGCACTGTG CCGCAGTTTG
GCACCGCAGG CCGATATTAT TTACTGCAGC TCGTTTTCTA AAACCTTAAG CCCTGGGTTG
CGCGTTGGCT GGATAGCCGC CGGTAAGCAC TTGGCGCGCG TAGAGTACTT AAAGTACGTA
AGCAATATAG CCAGCGCTAC CTTGCCGCAA CTTACGGTAG CTCACTTTTT AGAAAGCGGC
CGATACGACC GCTACCTGCG CCAAGCTCGG GGGCAGTACG CGCGTGCGGT TAGCCGCATG
ACCGACGCCG TTTTGCGCTA CTTCCCTGAG GGTACAAGAG TAAGCCAACC CAGAGGCGGC
TTTGTAATAT GGGTGGAACT GCCCTGCGCT ATTAATACAT TTGAACTTGC CCAGCTGGCC
TTAGGCCAAG GTATTAGCAT TGCACCCGGC CCTATATTTT CAGCAAAGCA AAAGTTTAAA
AACGCTATGC GGCTGTCTTG CGCCTGCAAG TGGGATGAAA AAGTAGAAAA AGGCTTAGCC
TGGTTGGGCG CTAGAATTGA ACATATGCAA TTGAGCAAAT AA
 
Protein sequence
MSHLYQTLAT DIVKQIENGH YRPGDKLPGI RAISRQRNVS IATTQSAFRV LEDDGWVDVR 
PRSGFYVRAR QTVSPIVAAA QSDPKPSNVT GQDMALSLIK AANKHHILQL GAAVPDLSFL
PVQPISQATS QAQKRFGDRA LSYEMPPGAL ELRRQIARRM GEAGCAVSSE EIVITSGCQE
ALTIALKAVT KAGDIVAIES PTFYGLLQVI ESLGLEALEI PVDPQTGMSL DALKLALERW
PIKACVVVAN CSNPLGYTMP DESKLALTSL LTEHNVPLIE DDVYGDLSFD KHRPALCRSL
APQADIIYCS SFSKTLSPGL RVGWIAAGKH LARVEYLKYV SNIASATLPQ LTVAHFLESG
RYDRYLRQAR GQYARAVSRM TDAVLRYFPE GTRVSQPRGG FVIWVELPCA INTFELAQLA
LGQGISIAPG PIFSAKQKFK NAMRLSCACK WDEKVEKGLA WLGARIEHMQ LSK