Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_3858 |
Symbol | |
ID | 3967007 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | + |
Start bp | 4862147 |
End bp | 4863328 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 637922955 |
Product | AraC family transcriptional regulator |
Protein accession | YP_529325 |
Protein GI | 90023498 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACTT TGCAACTTTC TATATACGCC ATGGCTTTGG GGTTGTGTGC CTTTACTGGG CTGCTTGCTT GGCGGGCGTC GTTTCGCAAT AGGTACTTTT TAGTATTTAT GACCCTACTG GTGTTAATGC TAGGTTGCGA TTGGTTAATG CACCACCCTA GTACACCGCT TAAGAATTTA TGGCTTGTTA TTTTAATGGC GAGTGCACTA TTGGTTGGCC CATGCGCGTT GCTATTAGCT AATTCTGTAG GCAATGCGGA TTTGCACATT AATTGGCGCA GTCACGCAGT GTTGGTTATT GCGGCTTGGT TACTGTTAAC GCCGCTGGCA ACGTCTATTC ACTTCGGCAC CGAATTCGTC AATGCAGCTG CCCCTGTAAC CAAGGCGTAT GCTTTTTTTA TTCACACTGG TATGTCGCTT ACGGTTGGTT TGTTTTTATT GCAAACCCTG TGGGTGCTGC GCACTTGCTA TGCCTTGCTG CAGCGGCGCA ATGTACAAAA CAAGTGGCTG TTTTCTGAGT TAGCCGACCC CGGCCTAAAT TTGTTGCGCA TTTTAGTGTT GGCCATTGTT ATTAATGCTG TGGTTTCTAT AGCAAAGGTG CTTTATTGCG CCCTGTTAGA TGGGGTGTAT ATGCCCATTA ATATTGTTAT ATCTGGTATT CATTTATTAA TGGCCATATT TTTGGCCAGC TCTTATATTA GTTTGGTTGT TGGGGCACAA GGTAAGGCAG AAGCAATAAG GCAAACACTA TTTAAGCCAG AGGCACATCC AACCACACAA AGTGATACCA CGGCCAGTAA AAATTTTGAG CCAAGCAGTA ATAGCAATAA CAAAACTACT GCCGAACTTA CCGGAAAGCA GCAAGCGCTG CTAAAACAAA TTAAAGCGGC AATGGATGTA GAGCATTTGT ATAAAAAACC CAGCTTAAGC CTACGCGATT TATGCGACCA CCTAAACGAA AGCCCCCACA ATATATCGCA GGTAATTAAC GAAAGTGATT TAGGTAATTT TTACGATATG GTGAATAGCC GCAGGGTGGC GCTGGCATCC CAGCTTTTAC AACAAAACCC ACAACGCACG GTGTTGGATA TTGCTTTCGA TTGCGGGTTT AATTCTAAGT CTTCTTTTAA TAGCGTGTTT AAGCGGTATA CGGGGGTAAC GCCTAGTCAG TACCGCGCTT GA
|
Protein sequence | MNTLQLSIYA MALGLCAFTG LLAWRASFRN RYFLVFMTLL VLMLGCDWLM HHPSTPLKNL WLVILMASAL LVGPCALLLA NSVGNADLHI NWRSHAVLVI AAWLLLTPLA TSIHFGTEFV NAAAPVTKAY AFFIHTGMSL TVGLFLLQTL WVLRTCYALL QRRNVQNKWL FSELADPGLN LLRILVLAIV INAVVSIAKV LYCALLDGVY MPINIVISGI HLLMAIFLAS SYISLVVGAQ GKAEAIRQTL FKPEAHPTTQ SDTTASKNFE PSSNSNNKTT AELTGKQQAL LKQIKAAMDV EHLYKKPSLS LRDLCDHLNE SPHNISQVIN ESDLGNFYDM VNSRRVALAS QLLQQNPQRT VLDIAFDCGF NSKSSFNSVF KRYTGVTPSQ YRA
|
| |